Abstract
Recently, there has been a surge of interest in decentralized learning approaches to tackle complex collaborative tasks in multi-agent systems. One of the most promising approaches is multi-agent reinforcement learning (MARL). Yet, as the number of agents becomes larger, the sample complexity in MARL increases exponentially, making scalability a fundamental issue. Networked MARL algorithms can address this issue by leveraging a communication network for information exchange between the agents. For homogeneous network MARL, previous research established a regret upper-bound √MH4SAT. Recent approaches rely on global knowledge about the structure of the communication network, which poses a serious limitation when it is not known or changes depending on the task. In this paper, we overcome this limitation by proposing a novel networked MARL algorithm with an upperconfidence bound (UCB) exploration strategy, called provably efficient local-information networked (PrELIN) MARL, that does not require any global information and relies only on the local interactions between the agents. Furthermore, we derive the regret and sample complexity for our algorithm and show that the regret bound remains sublinear.
| Original language | English |
|---|---|
| Title of host publication | 2025 IEEE 64th Conference on Decision and Control (CDC) |
| Publisher | IEEE |
| Pages | 6077-6082 |
| Number of pages | 6 |
| ISBN (Electronic) | 9798331526276 |
| ISBN (Print) | 9798331526283 |
| DOIs | |
| Publication status | Published - 12 Jan 2026 |
| Event | 2025 IEEE 64th Conference on Decision and Control - Rio de Janeiro, Brazil Duration: 10 Dec 2025 → 12 Dec 2025 https://cdc2025.ieeecss.org/ (Conference homepage) |
Publication series
| Name | Proceedings of the IEEE Conference on Decision & Control |
|---|---|
| Publisher | IEEE |
| ISSN (Print) | 0743-1546 |
| ISSN (Electronic) | 2576-2370 |
Conference
| Conference | 2025 IEEE 64th Conference on Decision and Control |
|---|---|
| Abbreviated title | CDC2025 |
| Country/Territory | Brazil |
| City | Rio de Janeiro |
| Period | 10/12/25 → 12/12/25 |
| Internet address |
|
Keywords
- MARL
- statistical ML
- Distributed learning