Research Article

EAQR: A Multiagent Q-Learning Algorithm for Coordination of Multiple Agents

Table 6

Minimal cumulative reward for the DSN problem (evaluation episodes = 5000).

 = 10,000 = 50,000 = 100,000

EAQR39.2641.9742
WoLF-PHC37.6538.7138.82
EMA Q-learning32.9232.0832.53
Single-agent RL25.1229.3832.21