PAIR Lab: PKU Alignment and Interaction Research Lab
PAIR Lab: PKU Alignment and Interaction Research Lab
Open-Source Projects
People
Talks
Publications
Resources
Contact
Multi-Agent Reinforcement Learning
Multi-Agent Reinforcement Learning is a Sequence Modeling Problem
Large sequence models (SM) such as GPT series and BERT have displayed outstanding performance and generalization capabilities in …
Muning Wen
,
Jakub Grudzien Kuba
,
Runji Lin
,
Weinan Zhang
,
Ying Wen
,
Jun Wang
,
Yaodong Yang
PDF
Cite
Neural Auto-Curricula in Two-Player Zero-Sum Games
When solving two-player zero-sum games, multi-agent reinforcement learning (MARL) algorithms often create populations of agents where, …
Xidong Feng
,
Oliver Slumbers
,
Ziyu Wan
,
Bo Liu
,
Stephen McAleer
,
Ying Wen
,
Jun Wang
,
Yaodong Yang
PDF
Cite
LIGS: Learnable Intrinsic-Reward Generation Selection for Multi-Agent Learning
Efficient exploration is important for reinforcement learners to achieve high rewards. In multi-agent systems, coordinated exploration …
David Henry Mguni
,
Taher Jafferjee
,
Jianhong Wang
,
Oliver Slumbers
,
Nicolas Perez Nieves
,
Feifei Tong
,
Li Yang
,
Jiangcheng Zhu
,
Yaodong Yang
,
Jun Wang
PDF
Cite
Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning
Trust region methods rigorously enabled reinforcement learning (RL) agents to learn monotonically improving policies, leading to …
Jakub Grudzien Kuba
,
Ruiqing Chen
,
Muning Wen
,
Ying Wen
,
Fanglei Sun
,
Jun Wang
,
Yaodong Yang
PDF
Cite
Settling the Variance of Multi-Agent Policy Gradients
Policy gradient (PG) methods are popular reinforcement learning (RL) methods where a baseline is often applied to reduce the variance …
Jakub Grudzien Kuba
,
Muning Wen
,
Linghui Meng
,
Shangding Gu
,
Haifeng Zhang
,
David Henry Mguni
,
Jun Wang
,
Yaodong Yang
PDF
Cite
Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games
Measuring and promoting policy diversity is critical for solving games with strong non-transitive dynamics where strategic cycles …
Xiangyu Liu
,
Hangtian Jia
,
Ying Wen
,
Yujing Hu
,
Yingfeng Chen
,
Changjie Fan
,
Zhipeng Hu
,
Yaodong Yang
PDF
Cite
«
Cite
×