Table of Contents Author Guidelines Submit a Manuscript
Mathematical Problems in Engineering
Volume 2013 (2013), Article ID 962869, 10 pages
http://dx.doi.org/10.1155/2013/962869
Research Article

The Study of Reinforcement Learning for Traffic Self-Adaptive Control under Multiagent Markov Game Environment

1School of Civil Engineering and Transportation, South China University of Technology, Guangzhou 510640, China
2School of Electronic Engineering and Automation, Jiangxi University of Science and Technology, Ganzhou, Jiangxi 341000, China
3School of Port and Shipping Management, Guangzhou Marine Institute, Guangzhou 510725, China

Received 25 February 2013; Revised 12 August 2013; Accepted 26 August 2013

Academic Editor: Orwa Jaber Housheya

Copyright © 2013 Lun-Hui Xu et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. C. Diakaki, M. Papageorgiou, and K. Aboudolas, “A multivariable regulator approach to traffic-responsive network—wide signal control,” Control Engineering Practice, vol. 10, no. 2, pp. 183–195, 2002. View at Publisher · View at Google Scholar · View at Scopus
  2. P. G. Balaji, D. Srinivasan, and C.-K. Tham, “Coordination in distributed multi-agent system using type-2 fuzzy decision systems,” in Proceedings of the IEEE International Conference on Fuzzy Systems (FUZZ '08), pp. 2291–2298, Piscataway, NJ, USA, June 2008. View at Publisher · View at Google Scholar · View at Scopus
  3. D. Srinivasan, M. C. Choy, and R. L. Cheu, “Neural networks for real-time traffic signal control,” IEEE Transactions on Intelligent Transportation Systems, vol. 7, no. 3, pp. 261–272, 2006. View at Publisher · View at Google Scholar · View at Scopus
  4. J. J. Sánchez, M. Galán, and E. Rubio, “Genetic algorithms and cellular automata: a new architecture for traffic light cycles optimization,” in Proceedings of the Congress on Evolutionary Computation (CEC '04), pp. 1668–1674, June 2004. View at Scopus
  5. R. Sutton and A. Barto, Reinforcement Learning: An Introduction, MIT Press, Boston, Mass, USA, 1998.
  6. M. Wiering, “Multi-agent reinforcement learning for traffic light control,” in Proceedings of the 17th International Conference on Machine Learning and Applications, pp. 1151–1158, 2000.
  7. B. Abdulhai, R. Pringle, and G. J. Karakoulas, “Reinforcement learning for true adaptive traffic signal control,” Journal of Transportation Engineering, vol. 129, no. 3, pp. 278–285, 2003. View at Publisher · View at Google Scholar · View at Scopus
  8. M. D. Pendrith, “Distributed reinforcement learning for a traffic engineering application,” in Proceedings of the 4th International Conference on Autonomous Agents, pp. 404–411, ACM Press, Rockville, Md, USA, June 2000. View at Scopus
  9. B. Bakker, M. Steingrover, R. Schouten, E. Nijhuis, and L. Kester, “Cooperative multi-agent reinforcement learning of traffic lights,” in Proceedings of the Workshop on Cooperative Multi-Agent Learning, pp. 24–36, 2005.
  10. A. Salkham, R. Cunningham, A. Garg, and V. Cahill, “A collaborative reinforcement learning approach to urban traffic control optimization,” in Proceedings of the IEEE/WIC/ACM International Conference on Intelligent Agent Technology (IAT '08), pp. 560–566, Sydney, Australia, December 2008. View at Publisher · View at Google Scholar · View at Scopus
  11. I. Arel, C. Liu, T. Urbanik, and A. G. Kohls, “Reinforcement learning-based multi-agent system for network traffic signal control,” IET Intelligent Transport Systems, vol. 4, no. 2, pp. 128–135, 2010. View at Publisher · View at Google Scholar · View at Scopus
  12. J. C. Medina and R. F. Benekohal, “Q-learning and approximate dynamic programming for traffic control—a case study for an oversaturated network,” in Proceedings of the Boar Annual Meeting, 2012.
  13. M. Abdoos, N. Mozayani, and A. L. C. Bazzan, “Traffic light control in non-stationary environments based on multi agent Q-learning,” in Proceedings of the 14th IEEE International Intelligent Transportation Systems Conference (ITSC '11), pp. 1580–1585, Washington, DC, USA, October 2011. View at Publisher · View at Google Scholar · View at Scopus
  14. P. G. Balaji, X. German, and D. Srinivasan, “Urban traffic signal control using reinforcement learning agents,” IET Intelligent Transport Systems, vol. 4, no. 3, pp. 177–188, 2010. View at Publisher · View at Google Scholar · View at Scopus
  15. L. Kuyer, S. Whiteson, B. Bakker, and N. Vlassis, “Multiagent reinforcement learning for urban traffic control using coordination graph,” in Proceedings of the 19th European Conference on Machine Learning, 2008.
  16. S. E. Tantawy and B. Abdulhai, “Multi-agent reinforcement learning for integrated network of adaptive traffic signal controllers (MARLIN-ATSC),” in Proceedings of the 15th International IEEE Annual Conference on Intelligent Transportation Systems (ITSC '12), pp. 319–326, Anchorage, Alaska, USA, 2012.
  17. P. Koonce, “Traffic signal timing manual,” US Department of Transportation FHWA-HOP-08-024, Federal Highway Administration, 2008. View at Google Scholar
  18. M. L. Littman, “Markov games as a framework for muitiagent reinforcement learning,” in Proceedings of the 11th International Conference on Machine learning, pp. 157–153, Morgan Kaufmann, Boston, Mass, USA, 1994.
  19. I. Alvarez, A. Poznyak, and A. Malo, “Urban traffic control problem a game theory approach,” in Proceedings of the 47th IEEE Conference on Decision and Control (CDC '08), pp. 2168–2172, Cancún, Mexico, December 2008. View at Publisher · View at Google Scholar · View at Scopus
  20. J. Hu and M. P. Wellman, “Nash Q-learning for general-sum stochastic games,” Journal of Machine Learning Research (JMLR), vol. 4, no. 6, pp. 1039–1069, 2003. View at Publisher · View at Google Scholar · View at Zentralblatt MATH · View at MathSciNet
  21. C. J. C. H. Watkins and P. Dayan, “Q-learning,” Machine Learning, vol. 8, no. 3-4, pp. 279–292, 1992. View at Publisher · View at Google Scholar · View at Scopus
  22. C. Szepesvári and M. L. Littman, “A unified analysis of value-function-based reinforcement-learning algorithms,” Neural Computation, vol. 11, no. 8, pp. 2017–2060, 1999. View at Google Scholar · View at Scopus
  23. J. Filar and K. Vrieze, Competitive Markov Decision Processes, Springer, New York, NY, USA, 1997. View at MathSciNet
  24. W. Wang, Urban Traffic Planning, Southeast University Press, Nanjing, China, 1999.
  25. R. Wunderlich, C. Liu, I. Elhanany, and T. Urbanik II, “A novel signal-scheduling algorithm with quality-of-service provisioning for an isolated intersection,” IEEE Transactions on Intelligent Transportation Systems, vol. 9, no. 3, pp. 536–547, 2008. View at Publisher · View at Google Scholar · View at Scopus