PAIR Lab: PKU Alignment and Interaction Research Lab
PAIR Lab: PKU Alignment and Interaction Research Lab
Open-Source Projects
People
Talks
Publications
Resources
Contact
Shapley Value Approximation
STAS: Spatial-Temporal Return Decomposition for Multi-agent Reinforcement Learning
Centralized Training with Decentralized Execution (CTDE) has been proven to be an effective paradigm in cooperative multi-agent …
Sirui Chen
,
Zhaowei Zhang
,
Yaodong Yang
,
Yali Du
PDF
Cite
Cite
×