PAIR Lab: PKU Alignment and Interaction Research Lab
PAIR Lab: PKU Alignment and Interaction Research Lab
Open-Source Projects
People
Talks
Publications
Resources
Contact
Online Markov Decision Process
Online Markov Decision Processes with Non-oblivious Strategic Adversary
We study a novel setting in Online Markov Decision Processes (OMDPs) where the loss function is chosen by a non-oblivious strategic …
Le Cong Dinh
,
David Henry Mguni
,
Long Tran-Thanh
,
Jun Wang
,
Yaodong Yang
PDF
Cite
Cite
×