PAIR Lab: PKU Alignment and Interaction Research Lab
PAIR Lab: PKU Alignment and Interaction Research Lab
Open-Source Projects
People
Talks
Publications
Resources
Contact
Haifeng Zhang
Latest
Large Sequence Models for Sequential Decision-Making: A Survey
Contextual Transformer for Offline Meta Reinforcement Learning
A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning
Offline Pre-trained Multi-agent Decision Transformer
Settling the Variance of Multi-Agent Policy Gradients
Cite
×