Heterogeneous-Agent Mirror Learning