• KSII Transactions on Internet and Information Systems
    Monthly Online Journal (eISSN: 1976-7277)

PGA: An Efficient Adaptive Traffic Signal Timing Optimization Scheme Using Actor-Critic Reinforcement Learning Algorithm


Abstract

Advanced traffic signal timing method plays very important role in reducing road congestion and air pollution. Reinforcement learning is considered as superior approach to build traffic light timing scheme by many recent studies. It fulfills real adaptive control by the means of taking real-time traffic information as state, and adjusting traffic light scheme as action. However, existing works behave inefficient in complex intersections and they are lack of feasibility because most of them adopt traffic light scheme whose phase sequence is flexible. To address these issues, a novel adaptive traffic signal timing scheme is proposed. It's based on actor-critic reinforcement learning algorithm, and advanced techniques proximal policy optimization and generalized advantage estimation are integrated. In particular, a new kind of reward function and a simplified form of state representation are carefully defined, and they facilitate to improve the learning efficiency and reduce the computational complexity, respectively. Meanwhile, a fixed phase sequence signal scheme is derived, and constraint on the variations of successive phase durations is introduced, which enhances its feasibility and robustness in field applications. The proposed scheme is verified through field-data-based experiments in both medium and high traffic density scenarios. Simulation results exhibit remarkable improvement in traffic performance as well as the learning efficiency comparing with the existing reinforcement learning-based methods such as 3DQN and DDQN.


Statistics

Show / Hide Statistics

Statistics (Cumulative Counts from December 1st, 2015)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article

[IEEE Style]
S. Shen, G. Shen, Y. Shen, D. Liu, X. Yang, X. Kong, "PGA: An Efficient Adaptive Traffic Signal Timing Optimization Scheme Using Actor-Critic Reinforcement Learning Algorithm," KSII Transactions on Internet and Information Systems, vol. 14, no. 11, pp. 4268-4289, 2020. DOI: 10.3837/tiis.2020.11.002.

[ACM Style]
Si Shen, Guojiang Shen, Yang Shen, Duanyang Liu, Xi Yang, and Xiangjie Kong. 2020. PGA: An Efficient Adaptive Traffic Signal Timing Optimization Scheme Using Actor-Critic Reinforcement Learning Algorithm. KSII Transactions on Internet and Information Systems, 14, 11, (2020), 4268-4289. DOI: 10.3837/tiis.2020.11.002.

[BibTeX Style]
@article{tiis:24025, title="PGA: An Efficient Adaptive Traffic Signal Timing Optimization Scheme Using Actor-Critic Reinforcement Learning Algorithm", author="Si Shen and Guojiang Shen and Yang Shen and Duanyang Liu and Xi Yang and Xiangjie Kong and ", journal="KSII Transactions on Internet and Information Systems", DOI={10.3837/tiis.2020.11.002}, volume={14}, number={11}, year="2020", month={November}, pages={4268-4289}}