A Deeper Understanding of State-Based Critics in Multi-Agent Reinforcement Learning

Xueguang Lyu; Andrea Baisero; Yuchen Xiao; Christopher Amato

A Deeper Understanding of State-Based Critics in Multi-Agent Reinforcement Learning

Xueguang Lyu, Andrea Baisero, Yuchen Xiao, Christopher Amato

[AAAI-22] Main Track

Keywords
Poster Session 4 @ Red 3, Poster Session 8 @ Red 3, Poster Session 4, Poster Session 8

Download Paper

Enter the Virtual Venue

Abstract: Centralized Training for Decentralized Execution, where training is done in a centralized offline fashion, has become a popular solution paradigm in Multi-Agent Reinforcement Learning.

Many such methods take the form of actor-critic with state-based critics, since centralized training allows access to the true system state, which can be useful during training despite not being available at execution time.

State-based critics have become a common empirical choice, albeit one which has had limited theoretical justification or analysis.

In this paper, we show that state-based critics can introduce bias in the policy gradient estimates, potentially undermining the asymptotic guarantees of the algorithm.

We also show that, even if the state-based critics do not introduce any bias, they can still result in a larger gradient variance, contrary to the common intuition.

Finally, we show the effects of the theories in practice by comparing different forms of centralized critics on a wide range of common benchmarks, and detail how various environmental properties are related to the effectiveness of different types of critics.

Introduction Video

Sessions where this paper appears

Timezone

Poster Session 4

Fri, February 25 5:00 PM - 6:45 PM (+00:00)

Red 3

Add to Calendar
Apple
Google
iCal File
Microsoft 365
Outlook.com
Yahoo

Poster Session 4
Poster Session 8

Sun, February 27 12:45 AM - 2:30 AM (+00:00)

Red 3

Add to Calendar
Apple
Google
iCal File
Microsoft 365
Outlook.com
Yahoo

Poster Session 8