Evaluating Simultaneous Recognition and Encoding for Optical Music Recognition

Abstract

Most Optical Music Recognition workflows include several steps to retrieve the content from music score images. These steps typically comprise preprocessing, recognition, notation reconstruction and encoding. Currently, state-of-the-art models allow performing graphic recognition in an almost end-to-end fashion, performing the steps from preprocessing to recognition simultaneously. However, this graphic recognition has to be further processed to obtain a standard digital music representation. In this paper, we study the simultaneous recognition and encoding for a state-of-the-art OMR approach, based on neural networks, which receives a single staff-region image as input and directly obtains a sequence of characters that encodes the content in a standard music format. Our results confirm that performing OMR this way is feasible and brings additional benefits such as directly obtaining a version of the score readily available to be further processed or edited by standard tools.

Keywords

This publication has 17 references indexed in Scilit:

Human-Guided Recognition of Music Score Images
Published by Association for Computing Machinery (ACM) ,2017
An End-to-End Trainable Neural Network for Image-Based Sequence Recognition and Its Application to Scene Text Recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2016
The Tasso in Music Project
Early Music, 2015
A Robust Method for Musical Note Recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2015
Review: The Josquin Research Project by Jesse Rodin and Craig Sapp
Journal of the American Musicological Society, 2015
Towards a Standard Testbed for Optical Music Recognition: Definitions, Metrics, and Page Images
Journal of New Music Research, 2015
Classification of optical music symbols based on combined neural network
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2014
Robust and Adaptive OMR System Including Fuzzy Modeling, Fusion of Musical Rules, and Possible Error Detection
EURASIP Journal on Advances in Signal Processing, 2006
Connectionist temporal classification
Published by Association for Computing Machinery (ACM) ,2006
Music Information Processing Using the Humdrum Toolkit: Concepts, Examples, and Lessons
Computer Music Journal, 2002

Cited by 1 article