Laboratory for Recognition and Organization of Speech and Audio, Department of Electrical Engineering, Columbia University, New York 10027, NY, USA
Copyright © 2007 Hindawi Publishing Corporation. This is an open access article distributed under the
Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Abstract
We present a discriminative model for polyphonic piano transcription. Support vector machines trained on spectral features are
used to classify frame-level note instances. The classifier outputs are temporally constrained via hidden Markov models, and the proposed system
is used to transcribe both synthesized and real piano recordings. A frame-level transcription accuracy of 68% was achieved on a newly
generated test set, and direct comparisons to previous approaches are provided.