Robust and Diverse Multi-Agent Learning via Rational Policy Gradient

12 views • 2 days ago