Table of Contents Author Guidelines Submit a Manuscript
Computational Intelligence and Neuroscience
Volume 2017, Article ID 4694860, 14 pages
https://doi.org/10.1155/2017/4694860
Research Article

Deep Recurrent Neural Network-Based Autoencoders for Acoustic Novelty Detection

1Machine Intelligence & Signal Processing Group, Technische Universität München, Munich, Germany
2audEERING GmbH, Gilching, Germany
3Chair of Complex & Intelligent Systems, University of Passau, Passau, Germany
4A3LAB, Department of Information Engineering, Università Politecnica delle Marche, Ancona, Italy
5Department of Computing, Imperial College London, London, UK

Correspondence should be addressed to Erik Marchi; ed.mut@ihcram.kire

Received 12 July 2016; Accepted 25 September 2016; Published 15 January 2017

Academic Editor: Stefan Haufe

Copyright © 2017 Erik Marchi et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. L. Tarassenko, P. Hayton, N. Cerneaz, and M. Brady, “Novelty detection for the identification of masses in mammograms,” in Proceedings of the 4th International Conference on Artificial Neural Networks, pp. 442–447, June 1995. View at Scopus
  2. J. Quinn and C. Williams, “Known unknowns: novelty detection in condition monitoring,” in Pattern Recognition and Image Analysis, J. Mart, J. Bened, A. Mendona, and J. Serrat, Eds., vol. 4477 of Lecture Notes in Computer Science, pp. 1–6, Springer, Berlin, Germany, 2007. View at Google Scholar
  3. L. Clifton, D. A. Clifton, P. J. Watkinson, and L. Tarassenko, “Identification of patient deterioration in vital-sign data using one-class support vector machines,” in Proceedings of the Federated Conference on Computer Science and Information Systems (FedCSIS '11), pp. 125–131, September 2011. View at Scopus
  4. Y.-H. Liu, Y.-C. Liu, and Y.-J. Chen, “Fast support vector data descriptions for novelty detection,” IEEE Transactions on Neural Networks, vol. 21, no. 8, pp. 1296–1313, 2010. View at Publisher · View at Google Scholar · View at Scopus
  5. C. Surace and K. Worden, “Novelty detection in a changing environment: a negative selection approach,” Mechanical Systems and Signal Processing, vol. 24, no. 4, pp. 1114–1128, 2010. View at Publisher · View at Google Scholar · View at Scopus
  6. J. A. Quinn, C. K. I. Williams, and N. McIntosh, “Factorial switching linear dynamical systems applied to physiological condition monitoring,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 31, no. 9, pp. 1537–1551, 2009. View at Publisher · View at Google Scholar · View at Scopus
  7. A. Patcha and J.-M. Park, “An overview of anomaly detection techniques: existing solutions and latest technological trends,” Computer Networks, vol. 51, no. 12, pp. 3448–3470, 2007. View at Publisher · View at Google Scholar · View at Scopus
  8. M. Markou and S. Singh, “A neural network-based novelty detector for image sequence analysis,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 28, no. 10, pp. 1664–1677, 2006. View at Publisher · View at Google Scholar · View at Scopus
  9. M. Markou and S. Singh, “Novelty detection: a review—part 1: statistical approaches,” Signal Processing, vol. 83, no. 12, pp. 2481–2497, 2003. View at Publisher · View at Google Scholar · View at Scopus
  10. M. Markou and S. Singh, “Novelty detection: a review—part 2: neural network based approaches,” Signal Processing, vol. 83, no. 12, pp. 2499–2521, 2003. View at Publisher · View at Google Scholar · View at Scopus
  11. E. R. de Faria, I. R. Goncalves, J. a. Gama, and A. C. Carvalho, “Evaluation of multiclass novelty detection algorithms for data streams,” IEEE Transactions on Knowledge and Data Engineering, vol. 27, no. 11, pp. 2961–2973, 2015. View at Publisher · View at Google Scholar
  12. C. Satheesh Chandran, S. Kamal, A. Mujeeb, and M. H. Supriya, “Novel class detection of underwater targets using self-organizing neural networks,” in Proceedings of the IEEE Underwater Technology (UT '15), Chennai, India, February 2015. View at Publisher · View at Google Scholar
  13. K. Worden, G. Manson, and D. Allman, “Experimental validation of a structural health monitoring methodology: part I. Novelty detection on a laboratory structure,” Journal of Sound and Vibration, vol. 259, no. 2, pp. 323–343, 2003. View at Publisher · View at Google Scholar · View at Scopus
  14. J. Foote, “Automatic audio segmentation using a measure of audio novelty,” in Proceedings of the IEEE Internatinal Conference on Multimedia and Expo (ICME '00), vol. 1, pp. 452–455, August 2000. View at Scopus
  15. B. Schölkopf, R. C. Williamson, A. J. Smola, J. Shawe-Taylor, and J. C. Platt, “Support vector method for novelty detection,” in Proceedings of the Neural Information Processing Systems (NIPS '99), vol. 12, pp. 582–588, Denver, Colo, USA, 1999.
  16. J. Ma and S. Perkins, “Time-series novelty detection using one-class support vector machines,” in Proceedings of the IEEE International Joint Conference on Neural Networks (IJCNN '03), vol. 3, pp. 1741–1745, Portland, Ore, USA, July 2003. View at Publisher · View at Google Scholar
  17. J. Ma and S. Perkins, “Online novelty detection on temporal sequences,” in Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '03), pp. 613–618, ACM, Washington, DC, USA, August 2003. View at Publisher · View at Google Scholar · View at Scopus
  18. P. Hayton, B. Schölkopf, L. Tarassenko, and P. Anuzis, “Support vector novelty detection applied to jet engine vibration spectra,” in Proceedings of the 14th Annual Neural Information Processing Systems Conference (NIPS '00), pp. 946–952, December 2000. View at Scopus
  19. L. Tarassenko, A. Nairac, N. Townsend, and P. Cowley, “Novelty detection in jet engines,” in Proceedings of the IEE Colloquium on Condition Monitoring: Machinery, External Structures and Health, pp. 4/1–4/5, Birmingham, UK, April 1999. View at Publisher · View at Google Scholar
  20. L. Clifton, D. A. Clifton, Y. Zhang, P. Watkinson, L. Tarassenko, and H. Yin, “Probabilistic novelty detection with support vector machines,” IEEE Transactions on Reliability, vol. 63, no. 2, pp. 455–467, 2014. View at Publisher · View at Google Scholar · View at Scopus
  21. D. R. Hardoon and L. M. Manevitz, “One-class machine learning approach for fmri analysis,” in Proceedings of the Postgraduate Research Conference in Electronics, Photonics, Communications and Networks, and Computer Science, Lancaster, UK, April 2005.
  22. M. Davy, F. Desobry, A. Gretton, and C. Doncarli, “An online support vector machine for abnormal events detection,” Signal Processing, vol. 86, no. 8, pp. 2009–2025, 2006. View at Publisher · View at Google Scholar · View at Scopus
  23. M. A. F. Pimentel, D. A. Clifton, L. Clifton, and L. Tarassenko, “A review of novelty detection,” Signal Processing, vol. 99, pp. 215–249, 2014. View at Publisher · View at Google Scholar · View at Scopus
  24. N. Japkowicz, C. Myers, M. Gluck et al., “A novelty detection approach to classification,” in Proceedings of the 14th international joint conference on Artificial intelligence (IJCAI '95), vol. 1, pp. 518–523, Montreal, Canada, 1995.
  25. B. B. Thompson, R. J. Marks, J. J. Choi, M. A. El-Sharkawi, M.-Y. Huang, and C. Bunje, “Implicit learning in autoencoder novelty assessment,” in Proceedings of the 2002 International Joint Conference on Neural Networks (IJCNN '02), vol. 3, pp. 2878–2883, IEEE, Honolulu, Hawaii, USA, 2002.
  26. S. Hawkins, H. He, G. Williams, and R. Baxter, “Outlier detection using replicator neural networks,” in Data Warehousing and Knowledge Discovery, pp. 170–180, Springer, Berlin, Germany, 2002. View at Google Scholar
  27. N. Japkowicz, “Supervised versus unsupervised binary-learning by feedforward neural networks,” Machine Learning, vol. 42, no. 1-2, pp. 97–122, 2001. View at Publisher · View at Google Scholar · View at Scopus
  28. L. Manevitz and M. Yousef, “One-class document classification via Neural Networks,” Neurocomputing, vol. 70, no. 7–9, pp. 1466–1481, 2007. View at Publisher · View at Google Scholar · View at Scopus
  29. G. Williams, R. Baxter, H. He, S. Hawkins, and L. Gu, “A comparative study of RNN for outlier detection in data mining,” in Proceedings of the 2nd IEEE International Conference on Data Mining (ICDM '02), pp. 709–712, IEEE Computer Society, December 2002. View at Scopus
  30. H. Sohn, K. Worden, and C. R. Farrar, “Statistical damage classification under changing environmental and operational conditions,” Journal of Intelligent Material Systems and Structures, vol. 13, no. 9, pp. 561–574, 2002. View at Publisher · View at Google Scholar · View at Scopus
  31. S. Ntalampiras, I. Potamitis, and N. Fakotakis, “Probabilistic novelty detection for acoustic surveillance under real-world conditions,” IEEE Transactions on Multimedia, vol. 13, no. 4, pp. 713–719, 2011. View at Publisher · View at Google Scholar · View at Scopus
  32. E. Principi, S. Squartini, R. Bonfigli, G. Ferroni, and F. Piazza, “An integrated system for voice command recognition and emergency detection based on audio signals,” Expert Systems with Applications, vol. 42, no. 13, pp. 5668–5683, 2015. View at Publisher · View at Google Scholar · View at Scopus
  33. A. Ito, A. Aiba, M. Ito, and S. Makino, “Evaluation of abnormal sound detection using multi-stage GMM in various environments,” in Proceedings of the 12th Annual Conference of the International Speech Communication Association (INTERSPEECH '11), pp. 301–304, Florence, Italy, August 2011. View at Scopus
  34. S. Ntalampiras, I. Potamitis, and N. Fakotakis, “On acoustic surveillance of hazardous situations,” in Proceedings of the 34th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '09), pp. 165–168, April 2009. View at Publisher · View at Google Scholar
  35. E. Marchi, F. Vesperini, F. Eyben, S. Squartini, and B. Schuller, “A novel approach for automatic acoustic novelty detection using a denoising autoencoder with bidirectional LSTM neural networks,” in Proceedings of the 40th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '15), p. 5, IEEE, Brisbane, Australia, April 2015.
  36. E. Marchi, F. Vesperini, F. Weninger, F. Eyben, S. Squartini, and B. Schuller, “Non-linear prediction with LSTM recurrent neural networks for acoustic novelty detection,” in Proceedings of the 2015 International Joint Conference on Neural Networks (IJCNN '15), p. 5, IEEE, Killarney, Ireland, July 2015.
  37. F. A. Gers, N. N. Schraudolph, and J. Schmidhuber, “Learning precise timing with LSTM recurrent networks,” The Journal of Machine Learning Research, vol. 3, no. 1, pp. 115–143, 2003. View at Publisher · View at Google Scholar · View at MathSciNet · View at Scopus
  38. A. Graves, “Generating sequences with recurrent neural networks,” https://arxiv.org/abs/1308.0850.
  39. D. Eck and J. Schmidhuber, “Finding temporal structure in music: blues improvisation with lstm recurrent networks,” in Proceedings of the 12th IEEE Workshop on Neural Networks for Signal Processing, pp. 747–756, IEEE, Martigny, Switzerland, September 2002. View at Publisher · View at Google Scholar
  40. J. A. Bilmes, “A gentle tutorial of the EM algorithm and its application to parameter estimation for gaussian mixture and hidden Markov models,” Tech. Rep. 97-021, International Computer Science Institute, Berkeley, Calif, USA, 1998. View at Google Scholar
  41. L. R. Rabiner, “Tutorial on hidden Markov models and selected applications in speech recognition,” Proceedings of the IEEE, vol. 77, no. 2, pp. 257–286, 1989. View at Publisher · View at Google Scholar · View at Scopus
  42. B. Schölkopf, R. C. Williamson, A. J. Smola, J. Shawe-Taylor, and J. C. Platt, “Support vector method for novelty detection,” in Advances in Neural Information Processing Systems 12, S. Solla, T. Leen, and K. Müller, Eds., pp. 582–588, MIT Press, 2000. View at Google Scholar
  43. D. E. Rumelhart, G. E. Hinton, and R. J. Williams, “Learning representations by back-propagating errors,” Nature, vol. 323, no. 6088, pp. 533–536, 1986. View at Publisher · View at Google Scholar · View at Scopus
  44. S. Hochreiter and J. Schmidhuber, “Long short-term memory,” Neural Computation, vol. 9, no. 8, pp. 1735–1780, 1997. View at Publisher · View at Google Scholar · View at Scopus
  45. M. Schuster and K. K. Paliwal, “Bidirectional recurrent neural networks,” IEEE Transactions on Signal Processing, vol. 45, no. 11, pp. 2673–2681, 1997. View at Publisher · View at Google Scholar · View at Scopus
  46. A. Graves and J. Schmidhuber, “Framewise phoneme classification with bidirectional LSTM and other neural network architectures,” Neural Networks, vol. 18, no. 5-6, pp. 602–610, 2005. View at Publisher · View at Google Scholar · View at Scopus
  47. G. Hinton, L. Deng, D. Yu et al., “Deep neural networks for acoustic modeling in speech recognition: the shared views of four research groups,” IEEE Signal Processing Magazine, vol. 29, no. 6, pp. 82–97, 2012. View at Publisher · View at Google Scholar · View at Scopus
  48. I. Goodfellow, H. Lee, Q. V. Le, A. Saxe, and A. Y. Ng, “Measuring invariances in deep networks,” in Advances in Neural Information Processing Systems, Y. Bengio, D. Schuurmans, J. Lafferty, C. Williams, and A. Culotta, Eds., vol. 22, pp. 646–654, Curran & Associates Inc, Red Hook, NY, USA, 2009. View at Google Scholar
  49. Y. Bengio, P. Lamblin, D. Popovici, and H. Larochelle, “Greedy layer-wise training of deep networks,” in Advances in Neural Information Processing Systems 19, B. Schölkopf, J. Platt, and T. Hoffman, Eds., pp. 153–160, MIT Press, 2007. View at Google Scholar
  50. P. Vincent, H. Larochelle, I. Lajoie, Y. Bengio, and P.-A. Manzagol, “Stacked denoising autoencoders: learning useful representations in a deep network with a local denoising criterion,” Journal of Machine Learning Research, vol. 11, pp. 3371–3408, 2010. View at Google Scholar · View at MathSciNet
  51. E. Marchi, G. Ferroni, F. Eyben, L. Gabrielli, S. Squartini, and B. Schuller, “Multi-resolution linear prediction based features for audio onset detection with bidirectional LSTM neural networks,” in Proceedings of the 39th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '14), pp. 2183–2187, IEEE, Florence, Italy, May 2014. View at Publisher · View at Google Scholar
  52. F. Eyben, F. Weninger, F. Gross, and B. Schuller, “Recent developments in openSMILE, the munich open-source multimedia feature extractor,” in Proceedings of the 21st ACM International Conference on Multimedia (MM '13), pp. 835–838, ACM, Barcelona, Spain, October 2013. View at Publisher · View at Google Scholar · View at Scopus
  53. J. Barker, E. Vincent, N. Ma, H. Christensen, and P. Green, “The pascal chime speech separation and recognition challenge,” Computer Speech & Language, vol. 27, no. 3, pp. 621–633, 2013. View at Publisher · View at Google Scholar · View at Scopus
  54. F. Weninger, J. Bergmann, and B. Schuller, “Introducing CURRENNT: the Munich open-source CUDA RecurREnt neural network toolkit,” Journal of Machine Learning Research, vol. 16, pp. 547–551, 2015. View at Google Scholar · View at MathSciNet
  55. C.-C. Chang and C.-J. Lin, “LIBSVM: a library for support vector machines,” ACM Transactions on Intelligent Systems and Technology, vol. 2, no. 3, article 27, 2011. View at Publisher · View at Google Scholar · View at Scopus
  56. R. Collobert, K. Kavukcuoglu, and C. Farabet, “Torch7: a matlab-like environment for machine learning,” BigLearn, NIPS Workshop, no. EPFL-CONF-192376, 2011.
  57. M. D. Smucker, J. Allan, and B. Carterette, “A comparison of statistical significance tests for information retrieval evaluation,” in Proceedings of the 16th ACM Conference on Conference on Information and Knowledge Management (CIKM '07), pp. 623–632, ACM, Lisbon, Portugal, November 2007.
  58. E. Marchi, G. Ferroni, F. Eyben, S. Squartini, and B. Schuller, “Audio onset detection: a wavelet packet based approach with recurrent neural networks,” in Proceedings of the 2014 International Joint Conference on Neural Networks (IJCNN '14), pp. 3585–3591, Beijing, China, July 2014. View at Publisher · View at Google Scholar · View at Scopus
  59. N. Friedman, K. Murphy, and S. Russell, “Learning the structure of dynamic probabilistic networks,” in Proceedings of the 14th Conference on Uncertainty in Artificial Intelligence (UAI '98), pp. 139–147, Morgan Kaufmann, Madison, Wisc, USA, July 1998.
  60. S. Lawrence, C. L. Giles, A. C. Tsoi, and A. D. Back, “Face recognition: a convolutional neural-network approach,” IEEE Transactions on Neural Networks, vol. 8, no. 1, pp. 98–113, 1997. View at Publisher · View at Google Scholar · View at Scopus