Table of Contents Author Guidelines Submit a Manuscript
Computational Intelligence and Neuroscience
Volume 2016 (2016), Article ID 7158507, 14 pages
http://dx.doi.org/10.1155/2016/7158507
Research Article

A Self-Organizing Incremental Spatiotemporal Associative Memory Networks Model for Problems with Hidden State

Department of Computer Science and Software, Tianjin Polytechnic University, Tianjin 300387, China

Received 31 May 2016; Revised 23 July 2016; Accepted 27 July 2016

Academic Editor: Manuel Graña

Copyright © 2016 Zuo-wei Wang. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. A. K. McCallum, Reinforcement learning with selective perception and hidden state [Ph.D. thesis], University of Rochester, Rochester, NY, USA, 1996.
  2. M. Hutter, “Feature reinforcement learning: part I. unstructured MDPs,” Journal of Artificial General Intelligence, vol. 1, no. 1, pp. 3–24, 2009. View at Publisher · View at Google Scholar
  3. M. Daswani, P. Sunehag, and M. Hutter, “Feature reinforcement learning: state of the art,” in Proceedings of the Workshops at the 28th AAAI Conference on Artificial Intelligence, 2014.
  4. P. Nguyen, P. Sunehag, and M. Hutter, “Context tree maximizing reinforcement learning,” in Proceedings of the 26th AAAI Conference on Artificial Intelligence, pp. 1075–1082, Toronto, Canada, July 2012.
  5. J. Veness, K. S. Ng, M. Hutter, W. Uther, and D. Silver, “A Monte-Carlo AIXI approximation,” Journal of Artificial Intelligence Research, vol. 40, no. 1, pp. 95–142, 2011. View at Publisher · View at Google Scholar · View at MathSciNet · View at Scopus
  6. M. P. Holmes and C. L. Isbell Jr., “Looping suffix tree-based inference of partially observable hidden state,” in Proceedings of the 23rd International Conference on Machine Learning (ICML '06), pp. 409–416, ACM, Pittsburgh, Pa, USA, 2006.
  7. M. Daswani, P. Sunehag, and M. Hutter, “Feature reinforcement learning using looping suffix trees,” in Proceedings of the 10th European Workshop on Reinforcement Learning (EWRL '12), pp. 11–24, Edinburgh, Scotland, 2012.
  8. M. Daswani, P. Sunehag, and M. Hutter, “Q-learning for history-based reinforcement learning,” in Proceedings of the Conference on Machine Learning, pp. 213–228, 2013.
  9. S. Timmer and M. Riedmiller, Reinforcement learning with history lists [Ph.D. thesis], University of Osnabrück, Osnabrück, Germany, 2009.
  10. E. Talvitie, “Learning partially observable models using temporally abstract decision trees,” in Advances in Neural Information Processing Systems, pp. 818–826, 2012. View at Google Scholar
  11. M. Mahmud, “Constructing states for reinforcement learning,” in Proceedings of the 27th International Conference on Machine Learning (ICML '10), pp. 727–734, June 2010. View at Scopus
  12. S. Hochreiter and J. Schmidhuber, “Long short-term memory,” Neural Computation, vol. 9, no. 8, pp. 1735–1780, 1997. View at Publisher · View at Google Scholar · View at Scopus
  13. V. Mnih, K. Kavukcuoglu, D. Silver et al., “Human-level control through deep reinforcement learning,” Nature, vol. 518, no. 7540, pp. 529–533, 2015. View at Publisher · View at Google Scholar · View at Scopus
  14. M. Hausknecht and P. Stone, “Deep recurrent Q-learning for partially observable MDPs,” http://arxiv.org/abs/1507.06527
  15. K. Narasimhan, T. Kulkarni, and R. Barzilay, “Language understanding for text-based games using deep reinforcement learning,” https://arxiv.org/abs/1506.08941
  16. J. Oh, X. Guo, H. Lee, R. Lewis, and S. Singh, “Action-conditional video prediction using deep networks in atari games,” http://arxiv.org/abs/1507.08750
  17. X. Li, L. Li, J. Gao et al., “Recurrent reinforcement learning: a hybrid approach,” http://arxiv.org/abs/1509.03044
  18. D. Wang and B. Yuwono, “Incremental learning of complex temporal patterns,” IEEE Transactions on Neural Networks, vol. 7, no. 6, pp. 1465–1481, 1996. View at Publisher · View at Google Scholar · View at Scopus
  19. A. Sudo, A. Sato, and O. Hasegawa, “Associative memory for online learning in noisy environments using self-organizing incremental neural network,” IEEE Transactions on Neural Networks, vol. 20, no. 6, pp. 964–972, 2009. View at Publisher · View at Google Scholar · View at Scopus
  20. M. U. Keysermann and P. A. Vargas, “Towards autonomous robots via an incremental clustering and associative learning architecture,” Cognitive Computation, vol. 7, no. 4, pp. 414–433, 2014. View at Publisher · View at Google Scholar · View at Scopus
  21. A. F. R. Araújo and G. D. A. Barreto, “Context in temporal sequence processing: a self-organizing approach and its application to robotics,” IEEE Transactions on Neural Networks, vol. 13, no. 1, pp. 45–57, 2002. View at Publisher · View at Google Scholar · View at Scopus
  22. A. L. Freire, G. A. Barreto, M. Veloso, and A. T. Varela, “Short-term memory mechanisms in neural network learning of robot navigation tasks: A Case Study,” in Proceedings of the 6th Latin American Robotics Symposium (LARS '09), pp. 1–6, October 2009. View at Publisher · View at Google Scholar · View at Scopus
  23. S. Tangruamsub, A. Kawewong, M. Tsuboyama, and O. Hasegawa, “Self-organizing incremental associative memory-based robot navigation,” IEICE Transactions on Information and Systems, vol. 95, no. 10, pp. 2415–2425, 2012. View at Publisher · View at Google Scholar · View at Scopus
  24. V. A. Nguyen, J. A. Starzyk, W.-B. Goh, and D. Jachyra, “Neural network structure for spatio-temporal long-term memory,” IEEE Transactions on Neural Networks and Learning Systems, vol. 23, no. 6, pp. 971–983, 2012. View at Publisher · View at Google Scholar · View at Scopus
  25. F. Shen, H. Yu, W. Kasai, and O. Hasegawa, “An associative memory system for incremental learning and temporal sequence,” in Proceedings of the International Joint Conference on Neural Networks (IJCNN '10), pp. 1–8, IEEE, Barcelona, Spain, July 2010. View at Publisher · View at Google Scholar · View at Scopus
  26. F. Shen, Q. Ouyang, W. Kasai, and O. Hasegawa, “A general associative memory based on self-organizing incremental neural network,” Neurocomputing, vol. 104, pp. 57–71, 2013. View at Publisher · View at Google Scholar · View at Scopus
  27. B. Khouzam, “Neural networks as cellular computing models for temporal sequence processing,” Suplec, 2014