Table of Contents Author Guidelines Submit a Manuscript
Journal of Electrical and Computer Engineering
Volume 2016 (2016), Article ID 8363507, 19 pages
http://dx.doi.org/10.1155/2016/8363507
Research Article

Score-Informed Source Separation for Multichannel Orchestral Recordings

Music Technology Group, Universitat Pompeu Fabra, Barcelona, Spain

Received 24 June 2016; Accepted 3 November 2016

Academic Editor: Alexander Petrovsky

Copyright © 2016 Marius Miron et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. E. Gómez, M. Grachten, A. Hanjalic et al., “PHENICX: performances as highly enriched aNd Interactive concert experiences,” in Proceedings of the SMAC Stockholm Music Acoustics Conference 2013 and SMC Sound and Music Computing Conference, 2013.
  2. S. Ewert and M. Müller, “Using score-informed constraints for NMF-based source separation,” in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '12), pp. 129–132, Kyoto, Japan, March 2012. View at Publisher · View at Google Scholar · View at Scopus
  3. J. Fritsch and M. D. Plumbley, “Score informed audio source separation using constrained nonnegative matrix factorization and score synthesis,” in Proceedings of the 38th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '13), pp. 888–891, Vancouver, Canada, May 2013. View at Publisher · View at Google Scholar · View at Scopus
  4. Z. Duan and B. Pardo, “Soundprism: an online system for score-informed source separation of music audio,” IEEE Journal on Selected Topics in Signal Processing, vol. 5, no. 6, pp. 1205–1215, 2011. View at Publisher · View at Google Scholar · View at Scopus
  5. R. Hennequin, B. David, and R. Badeau, “Score informed audio source separation using a parametric model of non-negative spectrogram,” in Proceedings of the 36th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '11), pp. 45–48, May 2011. View at Publisher · View at Google Scholar · View at Scopus
  6. J. J. Carabias-Orti, M. Cobos, P. Vera-Candeas, and F. J. Rodriguez-Serrano, “Nonnegative signal factorization with learnt instrument models for sound source separation in close-microphone recordings,” EURASIP Journal on Advances in Signal Processing, vol. 2013, article 184, 2013. View at Publisher · View at Google Scholar
  7. T. Pratzlich, R. M. Bittner, A. Liutkus, and M. Muller, “Kernel Additive Modeling for interference reduction in multi-channel music recordings,” in Proceedings of the 40th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '15), pp. 584–588, Brisbane, Australia, April 2014. View at Publisher · View at Google Scholar · View at Scopus
  8. S. Ewert, B. Pardo, M. Mueller, and M. D. Plumbley, “Score-informed source separation for musical audio recordings: an overview,” IEEE Signal Processing Magazine, vol. 31, no. 3, pp. 116–124, 2014. View at Publisher · View at Google Scholar · View at Scopus
  9. F. J. Rodriguez-Serrano, Z. Duan, P. Vera-Candeas, B. Pardo, and J. J. Carabias-Orti, “Online score-informed source separation with adaptive instrument models,” Journal of New Music Research, vol. 44, no. 2, pp. 83–96, 2015. View at Publisher · View at Google Scholar · View at Scopus
  10. F. J. Rodriguez-Serrano, S. Ewert, P. Vera-Candeas, and M. Sandler, “A score-informed shift-invariant extension of complex matrix factorization for improving the separation of overlapped partials in music recordings,” in Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP '16), pp. 61–65, Shanghai, China, 2016.
  11. M. Miron, J. J. Carabias, and J. Janer, “Improving score-informed source separation for classical music through note refinement,” in Proceedings of the 16th International Society for Music Information Retrieval (ISMIR '15), Málaga, Spain, October 2015.
  12. A. Arzt, H. Frostel, T. Gadermaier, M. Gasser, M. Grachten, and G. Widmer, “Artificial intelligence in the concertgebouw,” in Proceedings of the 24th International Joint Conference on Artificial Intelligence, pp. 165–176, Buenos Aires, Argentina, 2015.
  13. J. J. Carabias-Orti, F. J. Rodriguez-Serrano, P. Vera-Candeas, N. Ruiz-Reyes, and F. J. Cañadas-Quesada, “An audio to score alignment framework using spectral factorization and dynamic time warping,” in Proceedings of the 16th International Society for Music Information Retrieval (ISMIR '15), Malaga, Spain, 2015.
  14. Ö. Izmirli and R. Dannenberg, “Understanding features and distance functions for music sequence alignment,” in Proceedings of the 11th International Society for Music Information Retrieval Conference (ISMIR '10), pp. 411–416, Utrecht, The Netherlands, 2010.
  15. M. Goto, “Development of the RWC music database,” in Proceedings of the 18th International Congress on Acoustics (ICA '04), pp. 553–556, Kyoto, Japan, April 2004.
  16. S. Ewert, M. Müller, and P. Grosche, “High resolution audio synchronization using chroma onset features,” in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '09), pp. 1869–1872, IEEE, Taipei, Taiwan, April 2009. View at Publisher · View at Google Scholar · View at Scopus
  17. J. J. Burred, From sparse models to timbre learning: new methods for musical source separation [Ph.D. thesis], 2008.
  18. F. J. Rodriguez-Serrano, J. J. Carabias-Orti, P. Vera-Candeas, T. Virtanen, and N. Ruiz-Reyes, “Multiple instrument mixtures source separation evaluation using instrument-dependent NMF models,” in Proceedings of the 10th international conference on Latent Variable Analysis and Signal Separation (LVA/ICA '12), pp. 380–387, Tel Aviv, Israel, March 2012.
  19. A. Klapuri, T. Virtanen, and T. Heittola, “Sound source separation in monaural music signals using excitation-filter model and EM algorithm,” in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '10), pp. 5510–5513, March 2010. View at Publisher · View at Google Scholar · View at Scopus
  20. J.-L. Durrieu, A. Ozerov, C. Févotte, G. Richard, and B. David, “Main instrument separation from stereophonic audio signals using a source/filter model,” in Proceedings of the 17th European Signal Processing Conference (EUSIPCO '09), pp. 15–19, Glasgow, UK, August 2009. View at Scopus
  21. J. J. Bosch, K. Kondo, R. Marxer, and J. Janer, “Score-informed and timbre independent lead instrument separation in real-world scenarios,” in Proceedings of the 20th European Signal Processing Conference (EUSIPCO '12), pp. 2417–2421, August 2012. View at Scopus
  22. P. S. Huang, M. Kim, M. Hasegawa-Johnson, and P. Smaragdis, “Singing-voice separation from monaural recordings using deep recurrent neural networks,” in Proceedings of the International Society for Music Information Retrieval Conference (ISMIR '14), Taipei, Taiwan, October 2014.
  23. E. K. Kokkinis and J. Mourjopoulos, “Unmixing acoustic sources in real reverberant environments for close-microphone applications,” Journal of the Audio Engineering Society, vol. 58, no. 11, pp. 907–922, 2010. View at Google Scholar · View at Scopus
  24. S. Ewert and M. Müller, “Score-informed voice separation for piano recordings,” in Proceedings of the 12th International Society for Music Information Retrieval Conference (ISMIR '11), pp. 245–250, Miami, Fla, USA, October 2011. View at Scopus
  25. B. Niedermayer, Accurate audio-to-score alignment-data acquisition in the context of computational musicology [Ph.D. thesis], Johannes Kepler University Linz, Linz, Austria, 2012.
  26. S. Wang, S. Ewert, and S. Dixon, “Compensating for asynchronies between musical voices in score-performance alignment,” in Proceedings of the 40th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '15), pp. 589–593, April 2014. View at Publisher · View at Google Scholar · View at Scopus
  27. M. Miron, J. Carabias, and J. Janer, “Audio-to-score alignment at the note level for orchestral recordings,” in Proceedings of the 15th International Society for Music Information Retrieval Conference (ISMIR '14), Taipei, Taiwan, 2014.
  28. D. Fitzgerald, M. Cranitch, and E. Coyle, “Non-negative tensor factorisation for sound source separation. acoustics, speech and signal processing,” in Proceedings of the 2006 IEEE International Conference on Acoustics, Speech, and Signal (ICASSP '06), Toulouse, France, 2006.
  29. C. Févotte and A. Ozerov, “Notes on nonnegative tensor factorization of the spectrogram for audio source separation: statistical insights and towards self-clustering of the spatial cues,” in Exploring Music Contents: 7th International Symposium, CMMR 2010, Málaga, Spain, June 21–24, 2010. Revised Papers, vol. 6684 of Lecture Notes in Computer Science, pp. 102–115, Springer, Berlin, Germany, 2011. View at Publisher · View at Google Scholar
  30. J. Pätynen, V. Pulkki, and T. Lokki, “Anechoic recording system for symphony orchestra,” Acta Acustica United with Acustica, vol. 94, no. 6, pp. 856–865, 2008. View at Publisher · View at Google Scholar · View at Scopus
  31. D. Campbell, K. Palomäki, and G. Brown, “A Matlab simulation of shoebox room acoustics for use in research and teaching,” Computing and Information Systems, vol. 9, no. 3, p. 48, 2005. View at Google Scholar
  32. O. Mayor, Q. Llimona, M. Marchini, P. Papiotis, and E. Maestre, “RepoVizz: a framework for remote storage, browsing, annotation, and exchange of multi-modal data,” in Proceedings of the 21st ACM International Conference on Multimedia (MM '13), pp. 415–416, Barcelona, Spain, October 2013. View at Publisher · View at Google Scholar · View at Scopus
  33. D. D. Lee and H. S. Seung, “Learning the parts of objects by non-negative matrix factorization,” Nature, vol. 401, no. 6755, pp. 788–791, 1999. View at Publisher · View at Google Scholar · View at Scopus
  34. C. Févotte, N. Bertin, and J.-L. Durrieu, “Nonnegative matrix factorization with the Itakura-Saito divergence: with application to music analysis,” Neural Computation, vol. 21, no. 3, pp. 793–830, 2009. View at Publisher · View at Google Scholar · View at Scopus
  35. M. Nixon, Feature Extraction and Image Processing, Elsevier Science, 2002.
  36. R. Parry and I. Essa, “Estimating the spatial position of spectral components in audio,” in Independent Component Analysis and Blind Signal Separation: 6th International Conference, ICA 2006, Charleston, SC, USA, March 5–8, 2006. Proceedings, J. Rosca, D. Erdogmus, J. C. Príncipe, and S. Haykin, Eds., vol. 3889 of Lecture Notes in Computer Science, pp. 666–673, Springer, Berlin, Germany, 2006. View at Publisher · View at Google Scholar
  37. P. Papiotis, M. Marchini, A. Perez-Carrillo, and E. Maestre, “Measuring ensemble interdependence in a string quartet through analysis of multidimensional performance data,” Frontiers in Psychology, vol. 5, article 963, 2014. View at Publisher · View at Google Scholar · View at Scopus
  38. M. Mauch and S. Dixon, “PYIN: a fundamental frequency estimator using probabilistic threshold distributions,” in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '14), pp. 659–663, Florence, Italy, May 2014. View at Publisher · View at Google Scholar · View at Scopus
  39. F. J. Cañadas-Quesada, P. Vera-Candeas, D. Martínez-Muñoz, N. Ruiz-Reyes, J. J. Carabias-Orti, and P. Cabanas-Molero, “Constrained non-negative matrix factorization for score-informed piano music restoration,” Digital Signal Processing, vol. 50, pp. 240–257, 2016. View at Publisher · View at Google Scholar · View at Scopus
  40. A. Cont, D. Schwarz, N. Schnell, and C. Raphael, “Evaluation of real-time audio-to-score alignment,” in Proceedings of the 8th International Conference on Music Information Retrieval (ISMIR '07), pp. 315–316, Vienna, Austria, September 2007. View at Scopus
  41. E. Vincent, R. Gribonval, and C. Févotte, “Performance measurement in blind audio source separation,” IEEE Transactions on Audio, Speech and Language Processing, vol. 14, no. 4, pp. 1462–1469, 2006. View at Publisher · View at Google Scholar · View at Scopus
  42. V. Emiya, E. Vincent, N. Harlander, and V. Hohmann, “Subjective and objective quality assessment of audio source separation,” IEEE Transactions on Audio, Speech and Language Processing, vol. 19, no. 7, pp. 2046–2057, 2011. View at Publisher · View at Google Scholar · View at Scopus
  43. D. R. Begault and E. M. Wenzel, “Headphone localization of speech,” Human Factors, vol. 35, no. 2, pp. 361–376, 1993. View at Google Scholar · View at Scopus
  44. G. S. Kendall, “A 3-D sound primer: directional hearing and stereo reproduction,” Computer Music Journal, vol. 19, no. 4, pp. 23–46, 1995. View at Google Scholar · View at Scopus
  45. A. Martí Guerola, Multichannel audio processing for speaker localization, separation and enhancement, Universitat Politècnica de València, 2013. View at Publisher · View at Google Scholar