Explicit Dynamic Coordination Reinforcement Learning Based on Utility

Huaiwei Si; Guozhen Tan; Yifu Yuan; Yanfei peng; Jianping Li

Explicit Dynamic Coordination Reinforcement Learning Based on Utility

Vol. 16, No. 3, March 31, 2022

10.3837/tiis.2022.03.003, Download Paper (Free):

Abstract

Multi-agent systems often need to achieve the goal of learning more effectively for a task through coordination. Although the introduction of deep learning has addressed the state space problems, multi-agent learning remains infeasible because of the joint action spaces. Large-scale joint action spaces can be sparse according to implicit or explicit coordination structure, which can ensure reasonable coordination action through the coordination structure. In general, the multi-agent system is dynamic, which makes the relations among agents and the coordination structure are dynamic. Therefore, the explicit coordination structure can better represent the coordinative relationship among agents and achieve better coordination between agents. Inspired by the maximization of social group utility, we dynamically construct a factor graph as an explicit coordination structure to express the coordinative relationship according to the utility among agents and estimate the joint action values based on the local utility transfer among factor graphs. We present the application of such techniques in the scenario of multiple intelligent vehicle systems, where state space and action space are a problem and have too many interactions among agents. The results on the multiple intelligent vehicle systems demonstrate the efficiency and effectiveness of our proposed methods.

Statistics

Show / Hide Statistics

Cite this article

[IEEE Style]

H. Si, G. Tan, Y. Yuan, Y. peng, J. Li, "Explicit Dynamic Coordination Reinforcement Learning Based on Utility," KSII Transactions on Internet and Information Systems, vol. 16, no. 3, pp. 792-812, 2022. DOI: 10.3837/tiis.2022.03.003.

[ACM Style]

Huaiwei Si, Guozhen Tan, Yifu Yuan, Yanfei peng, and Jianping Li. 2022. Explicit Dynamic Coordination Reinforcement Learning Based on Utility. KSII Transactions on Internet and Information Systems, 16, 3, (2022), 792-812. DOI: 10.3837/tiis.2022.03.003.

[BibTeX Style]

@article{tiis:25517, title="Explicit Dynamic Coordination Reinforcement Learning Based on Utility", author="Huaiwei Si and Guozhen Tan and Yifu Yuan and Yanfei peng and Jianping Li and ", journal="KSII Transactions on Internet and Information Systems", DOI={10.3837/tiis.2022.03.003}, volume={16}, number={3}, year="2022", month={March}, pages={792-812}}

Explicit Dynamic Coordination Reinforcement Learning Based on Utility

Abstract

Statistics

Cite this article

[IEEE Style]

[ACM Style]

[BibTeX Style]

Unified Search
(in title, author, abstract, and keywords)

Category Search

Explicit Dynamic Coordination Reinforcement Learning Based on Utility

Abstract

Statistics

Cite this article

[IEEE Style]

[ACM Style]

[BibTeX Style]

Unified Search (in title, author, abstract, and keywords)

Category Search

Unified Search
(in title, author, abstract, and keywords)