Research Article

Recognizing Musical Notation Using Convolutional Neural Networks

November 30, 2020
TR EN

Recognizing Musical Notation Using Convolutional Neural Networks

Abstract

Musical scores are the essential of music theory and its development. Musical notation was developed by Greeks around 521 BCE, considering that music was developed a long time ago will will find a gap between new musical technology and old scrpits of music theory since they were written in. However, having music scores in written form has rised various kinds of problems for music information retrieval (MIR). Music notation recognition is a type of optical character recognition (OCR) applications, which allow us to recognize musical scores and convert it to a format that can be editied or played on computer such as musicXML (for page layout). In this paper, we introduce a Convolutional Neural Networks (CNN) based framework for musical notation recognition in images. We use a popular pre-trained CNN network, namely ResNet-101 to extract global features of notation and rest images. Then, a Support Vector Machine (SVM) is employed for training and classification purpose. ResNet-101 is one of the state-of-art pre-trained network for image recognition, ResNet-101 trained with more than a million images. Multiclass SVM classifiers using a fast-linear solver is also very powerful classifier. We also evaluated the proposed approach on a dataset that was derived from Attwenger, P RecordLabel and OMR-dataset, and then labeled manually by music theory. As a result, we can separate notes and rests from each other with an average accuracy of 99.02%. We can also classify five different note types. This is the first time that Resnet-101 and a SVM is combined together to perform musical notation recognition, and results are very promising.

Keywords

References

  1. Attwenger, P. (2015). RecordLabel, http://homepage. univie.ac.at/a1200595/recordlabel/
  2. Bainbridge, D., & Bell, T. (2001). The challenge of optical music recognition. Comput. Humanit, 35, 95–121, doi:10.1023/A:1002485918032.
  3. Calvo-Zaragoza, J., & Rizo, D. (2018). End-to-End Neural Optical Music Recognition of Monophonic Scores, Appl. Sci, 8, 606, doi:10.3390/app8040606.
  4. Casey, M., & Veltkamp, R., & Goto, M., & Leman, M., & Rhodes, C., & Slaney, M. (2008). Content-Based Music Information Retrieval: Current Directions and Future Challenges. In Proc. of IEEE, 668–696, doi:10.1109/JPROC.2008.916370.
  5. Cho, K., & van Merrienboer, B., & Gulcehre, C., & Bahdanau, D., & Bougares, F., & Schwenk, H., & Bengio, Y. (2014). Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation, arXiv 2014, arXiv:1406.1078
  6. Dai, J., & Li, Y., & He, K., & Sun, J. (2016). R-FCN: Object Detection via Region-based Fully Convolutional Networks, arXiv 2016, arXiv:1605.06409.
  7. Girshick, R. (2015). Fast R-CNN. arXiv 2015, arXiv:1504.08083.
  8. Good, M., & Actor, G. (2003). Using MusicXML for file interchange. International Conference on WEB Delivering of Music, 15–17, doi:10.1109/WDM.2003.1233890.

Details

Primary Language

English

Subjects

Engineering

Journal Section

Research Article

Authors

Ahmad Othman
0000-0001-8156-8965
Kuzey Kıbrıs Türk Cumhuriyeti

Cem Direkoğlu *
0000-0001-7709-4082
Kuzey Kıbrıs Türk Cumhuriyeti

Publication Date

November 30, 2020

Submission Date

November 8, 2020

Acceptance Date

November 8, 2020

Published in Issue

Year 2020

APA
Othman, A., & Direkoğlu, C. (2020). Recognizing Musical Notation Using Convolutional Neural Networks. Avrupa Bilim Ve Teknoloji Dergisi, 283-290. https://doi.org/10.31590/ejosat.823266