PAIR Lab: PKU Alignment and Interaction Research Lab
PAIR Lab: PKU Alignment and Interaction Research Lab
Open-Source Projects
People
Talks
Publications
Resources
Contact
Multi-Agent Reinforcement Learning
Safe multi-agent reinforcement learning for multi-robot control
A challenging problem in robotics is how to control multiple robots cooperatively and safely in real-world applications. Yet, …
Shangding Gu
,
Jakub Grudzien Kuba
,
Yuanpei Chen
,
Yali Du
,
Long Yang
,
Alois Knoll
,
Yaodong Yang
PDF
Cite
A Game-Theoretic Framework for Managing Risk in Multi-Agent Systems
In order for agents in multi-agent systems (MAS) to be safe, they need to take into account the risks posed by the actions of other …
Oliver Slumbers
,
David Henry Mguni
,
Stephen Marcus McAleer
,
Stefano B. Blumberg
,
Jun Wang
,
Yaodong Yang
PDF
Cite
MANSA: Learning Fast and Slow in Multi-Agent Systems
In multi-agent reinforcement learning (MARL), independent learning (IL) often shows remarkable performance and easily scales with the …
David Mguni
,
Haojun Chen
,
Taher Jafferjee
,
Jianhong Wang
,
Long Fei
,
Xidong Feng
,
Stephen McAleer
,
Feifei Tong
,
Jun Wang
,
Yaodong Yang
PDF
Cite
On the complexity of computing Markov perfect equilibrium in general-sum stochastic games
We introduce approximate Markov perfect equilibrium as a solution to the computational problem of finite-state stochastic games repeated in the infinite horizon and prove its PPAD-completeness.
Xiaotie Deng
,
Ningyuan Li
,
David Mguni
,
Jun Wang
,
Yaodong Yang
PDF
Cite
A game-theoretic approach to multi-agent trust region optimization
Trust region methods are widely applied in single-agent reinforcement learning problems due to their monotonic performance-improvement …
Ying Wen
,
Hui Chen
,
Yaodong Yang
,
Minne Li
,
Zheng Tian
,
Xu Chen
,
Jun Wang
PDF
Cite
ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency
Multi-agent reinforcement learning (MARL) suffers from the non-stationarity problem, which is the ever-changing targets at every …
Chuming Li
,
Jie Liu
,
Yinmin Zhang
,
Yuhong Wei
,
Yazhe Niu
,
Yaodong Yang
,
Yu Liu
,
Wanli Ouyang
PDF
Cite
Scalable Model-based Policy Optimization for Decentralized Networked Systems
Reinforcement learning algorithms require a large amount of samples; this often limits their real-world applications on even simple …
Yali Du
,
Chengdong Ma
,
Yuchen Liu
,
Runji Lin
,
Hao Dong
,
Jun Wang
,
Yaodong Yang
PDF
Cite
A Unified Diversity Measure for Multiagent Reinforcement Learning
Promoting behavioural diversity is of critical importance in multi-agent reinforcement learning, since it helps the agent population …
Zongkai Liu
,
Chao Yu
,
Yaodong Yang
,
Peng Sun
,
Zifan Wu
,
Yuan Li
PDF
Cite
MATE: Benchmarking Multi-Agent Reinforcement Learning in Distributed Target Coverage Control
We introduce the Multi-Agent Tracking Environment (MATE), a novel multi-agent environment simulates the target coverage control …
Xuehai Pan
,
Mickel Liu
,
Fangwei Zhong
,
Yaodong Yang
,
Song-Chun Zhu
,
Yizhou Wang
PDF
Cite
Offline Pre-trained Multi-agent Decision Transformer
Offline reinforcement learning leverages previously collected offline datasets to learn optimal policies with no necessity to access …
Linghui Meng
,
Muning Wen
,
Chenyang Le
,
Xiyun Li
,
Dengpeng Xing
,
Weinan Zhang
,
Ying Wen
,
Haifeng Zhang
,
Jun Wang
,
Yaodong Yang
,
Bo Xu
PDF
Cite
»
Cite
×