Table of Contents Author Guidelines Submit a Manuscript
Mathematical Problems in Engineering
Volume 2015, Article ID 597956, 10 pages
http://dx.doi.org/10.1155/2015/597956
Research Article

Genetic Scheduling and Reinforcement Learning in Multirobot Systems for Intelligent Warehouses

1Department of Control and Systems Engineering, School of Management and Engineering, Nanjing University, Nanjing 210093, China
2Research Center for Novel Technology of Intelligent Equipments, Nanjing University, Nanjing 210093, China

Received 8 August 2015; Accepted 15 December 2015

Academic Editor: Luciano Mescia

Copyright © 2015 Jiajia Dou et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. R. de Koster, T. Le-Duc, and K. J. Roodbergen, “Design and control of warehouse order picking: a literature review,” European Journal of Operational Research, vol. 182, no. 2, pp. 481–501, 2007. View at Publisher · View at Google Scholar · View at Scopus
  2. P. R. Wurman, R. D'Andrea, and M. Mountz, “Coordinating hundreds of cooperative, autonomous vehicles in warehouses,” AI Magazine, vol. 29, no. 1, pp. 9–19, 2008. View at Google Scholar · View at Scopus
  3. J. Gu, M. Goetschalckx, and L. F. McGinnis, “Research on warehouse operation: a comprehensive review,” European Journal of Operational Research, vol. 177, no. 1, pp. 1–21, 2007. View at Publisher · View at Google Scholar · View at Scopus
  4. T.-K. Wang, Q. Dang, and P.-Y. Pan, “Path planning approach in unknown environment,” International Journal of Automation and Computing, vol. 7, no. 3, pp. 310–316, 2010. View at Publisher · View at Google Scholar · View at Scopus
  5. B. P. Gerkey and M. J. Matarić, “A formal analysis and taxonomy of task allocation in multi-robot systems,” The International Journal of Robotics Research, vol. 23, no. 9, pp. 939–954, 2004. View at Publisher · View at Google Scholar · View at Scopus
  6. N. R. Jennings, Coordination Techniques for Distributed Artificial Intelligence, Wiley-Interscience, 1996.
  7. T. Sandholm, “Algorithm for optimal winner determination in combinatorial auctions,” Artificial Intelligence, vol. 135, no. 1-2, pp. 1–54, 2002. View at Publisher · View at Google Scholar · View at Zentralblatt MATH · View at MathSciNet · View at Scopus
  8. K.-S. Hwang, S.-W. Tan, and C.-C. Chen, “Cooperative strategy based on adaptive Q-learning for robot soccer systems,” IEEE Transactions on Fuzzy Systems, vol. 12, no. 4, pp. 569–576, 2004. View at Publisher · View at Google Scholar · View at Scopus
  9. N. D. Richards, M. Sharma, and D. G. Ward, “A hybrid A/automaton approach to on-line path planning with obstacle avoidance,” in Proceedings of the 1st AIAA Intelligent Systems Technical Conference, pp. 141–157, Chicago, Ill, USA, September 2004.
  10. M. Elango, S. Nachiappan, and M. K. Tiwari, “Balancing task allocation in multi-robot systems using K-means clustering and auction based mechanisms,” Expert Systems with Applications, vol. 38, no. 6, pp. 6486–6491, 2011. View at Publisher · View at Google Scholar · View at Scopus
  11. L. Zhou, Y. Shi, J. Wang, and P. Yang, “A balanced heuristic mechanism for multirobot task allocation of intelligent warehouses,” Mathematical Problems in Engineering, vol. 2014, Article ID 380480, 10 pages, 2014. View at Publisher · View at Google Scholar · View at Scopus
  12. R. D'Andrea and P. Wurman, “Future challenges of coordinating hundreds of autonomous vehicles in distribution facilities,” in Proceedings of the IEEE International Conference on Technologies for Practical Robot Applications (TePRA '08), pp. 80–83, Woburn, Mass, USA, November 2008. View at Publisher · View at Google Scholar · View at Scopus
  13. J. Enright and R. Wurman, “Optimization and coordinated autonomy in mobile fulfillment systems,” in Proceedings of the Workshops at the 25th AAAI Conference on Artificial Intelligence, pp. 33–38, San Francisco, Calif, USA, August 2011.
  14. P. Larrañaga, C. M. H. Kuijpers, R. H. Murga, I. Inza, and S. Dizdarevic, “Genetic algorithms for the travelling salesman problem: a review of representations and operators,” Artificial Intelligence Review, vol. 13, no. 2, pp. 129–170, 1999. View at Publisher · View at Google Scholar · View at Scopus
  15. M. F. Tasgetiren, P. N. Suganthan, Q.-K. Pan, and Y.-C. Liang, “A genetic algorithm for the generalized traveling salesman problem,” in Proceedings of the IEEE Congress on Evolutionary Computation (CEC '07), pp. 2382–2389, Singapore, September 2007. View at Publisher · View at Google Scholar · View at Scopus
  16. J. Li, Q. Sun, M. C. Zhou, and X. Dai, “A new multiple traveling salesman problem and its genetic algorithm-based solution,” in Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics (SMC '13), pp. 627–632, IEEE, Manchester, UK, October 2013. View at Publisher · View at Google Scholar · View at Scopus
  17. A. Király and J. Abonyi, “Optimization of multiple traveling salesmen problem by a novel representation based genetic algorithm,” in Intelligent Computational Optimization in Engineering, vol. 366 of Studies in Computational Intelligence, pp. 241–269, Springer, Berlin, Germany, 2011. View at Publisher · View at Google Scholar · View at MathSciNet · View at Scopus
  18. C. Chen, D. Dong, H. X. Li, and T. J. Tarn, “Hybrid MDP based integrated hierarchical q-learning,” Science in China Series F—Information Sciences, vol. 54, no. 11, pp. 2279–2294, 2011. View at Google Scholar
  19. A. Konar, I. G. Chakraborty, S. J. Singh, L. C. Jain, and A. K. Nagar, “A deterministic improved Q-learning for path planning of a mobile robot,” IEEE Transactions on Systems, Man, and Cybernetics: Systems, vol. 43, no. 5, pp. 1141–1153, 2013. View at Publisher · View at Google Scholar · View at Scopus
  20. C. Chen, H.-X. Li, and D. Dong, “Hybrid control for robot navigation—a hierarchical Q-learning algorithm,” IEEE Robotics and Automation Magazine, vol. 15, no. 2, pp. 37–47, 2008. View at Publisher · View at Google Scholar · View at Scopus
  21. C. Chen, D. Dong, H.-X. Li, J. Chu, and T.-J. Tarn, “Fidelity-based probabilistic Q-learning for control of quantum systems,” IEEE Transactions on Neural Networks and Learning Systems, vol. 25, no. 5, pp. 920–933, 2014. View at Publisher · View at Google Scholar · View at Scopus
  22. D. Dong, C. Chen, J. Chu, and T.-J. Tarn, “Robust quantum-inspired reinforcement learning for robot navigation,” IEEE/ASME Transactions on Mechatronics, vol. 17, no. 1, pp. 86–97, 2012. View at Publisher · View at Google Scholar · View at Scopus
  23. R. S. Sutton and A. G. Barto, Reinforcement Learning: An Introduction, MIT Press, Cambridge, Mass, USA, 1998.
  24. T. Kollar and N. Roy, “Using reinforcement learning to improve exploration trajectories for error minimization,” in Proceedings of the IEEE International Conference on Robotics and Automation (ICRA '06), pp. 3338–3343, Orlando, Fla, USA, May 2006. View at Publisher · View at Google Scholar · View at Scopus
  25. H. H. Viet, P. H. Kyaw, and T. C. Chung, “Simulation-based evaluations of reinforcement learning algorithms for autonomous mobile robot path planning,” in IT Convergence and Services, vol. 107 of Lecture Notes in Electrical Engineering, pp. 467–476, Springer, Amsterdam, The Netherlands, 2011. View at Publisher · View at Google Scholar
  26. M. A. K. Jaradat, M. Al-Rousan, and L. Quadan, “Reinforcement based mobile robot navigation in dynamic environment,” Robotics and Computer-Integrated Manufacturing, vol. 27, no. 1, pp. 135–149, 2011. View at Publisher · View at Google Scholar · View at Scopus
  27. M. Tan, “Multi-agent reinforcement learning: independent vs. cooperative agents,” in Proceedings of the 10th International Conference on Machine Learning (ICML '93), pp. 330–337, Amherst, Mass, USA, June 1993.
  28. L. Buşoniu, R. Babuška, and B. De Schutter, “A comprehensive survey of multiagent reinforcement learning,” IEEE Transactions on Systems, Man and Cybernetics Part C: Applications and Reviews, vol. 38, no. 2, pp. 156–172, 2008. View at Publisher · View at Google Scholar · View at Scopus
  29. Y. Hu, Y. Gao, and B. An, “Multiagent reinforcement learning with unshared value functions,” IEEE Transactions on Cybernetics, vol. 45, no. 4, pp. 647–662, 2015. View at Publisher · View at Google Scholar · View at Scopus