A Robust Music Note Recognition System Using Convolutional Neural Network

Main Article Content

Mr. Singaraiah
Dr. Rakesh Mutukuru

Abstract

The task of automatically recognizing musical instruments poses significant challenges within the domain of music information retrieval. Learning to play the piano, on the other hand, demands expert instruction and substantial practice. Due to the hectic nature of modern life, many individuals find it difficult to commit to systematic training. Additionally, the scarcity of qualified piano teachers and the high costs associated with lessons further discourage potential students. If a computer could recognize and assess a learner’s piano performance in real time, it would enable learners to identify and correct their mistakes promptly. Although there are existing music recognition technologies, most suffer from several limitations. Currently, music processing systems that incorporate models for chord progressions achieve high accuracy in tasks such as music structure analysis, multi pitch analysis, and automatic composition or accompaniment. pitch patterns are treated as observations derived from the hidden states within the chord progression model. Convolutional Neural Networks (CNN) have been successfully applied to chord recognition. The CNN approch will give high accuracy, precision and F1-Score.

Downloads

Download data is not yet available.

Metrics

Metrics Loading ...

Article Details

How to Cite
Singaraiah, M. ., & Mutukuru, D. R. . (2020). A Robust Music Note Recognition System Using Convolutional Neural Network. Turkish Journal of Computer and Mathematics Education (TURCOMAT), 11(1), 1717–1724. https://doi.org/10.61841/turcomat.v11i1.14693
Section
Research Articles

References

R. Su, L. Wang and X. Liu, "Multimodal learning using 3D audio-visual data for audio-visual speech recognition,"

International Conference on Asian Language Processing (IALP), Singapore, 2017, pp. 40-43, doi:

1109/IALP.2017.8300541.

J. Calvo-Zaragoza, A. -J. Gallego and A. Pertusa, "Recognition of Handwritten Music Symbols with Convolutional

Neural Codes," 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), Kyoto,

Japan, 2017, pp. 691-696, doi: 10.1109/ICDAR.2017.118.

K. -Y. Choi, B. Coüasnon, Y. Ricquebourg and R. Zanibbi, "Bootstrapping Samples of Accidentals in Dense Piano

Scores for CNN-Based Detection," 2017 14th IAPR International Conference on Document Analysis and Recognition

(ICDAR), Kyoto, Japan, 2017, pp. 19-20, doi: 10.1109/ICDAR.2017.257.

Y. Liu and Y. Chen, "Recognition of facial expression based on CNN-CBP features," 2017 29th Chinese Control

And Decision Conference (CCDC), Chongqing, China, 2017, pp. 2139-2145, doi: 10.1109/CCDC.2017.7978869.

S. Deebika, K. A. Indira and Jesline, "A Machine Learning Based Music Player by Detecting Emotions," 2019

Fifth International Conference on Science Technology Engineering and Mathematics (ICONSTEM), Chennai, India,

, pp. 196-200, doi: 10.1109/ICONSTEM.2019. 8918890.

Apurva A. Mehta and Malay S. Bhatt. Optical music notes recognition for printed piano music score sheet. In

International Conference on Computer Communication and Informatics, Coimbatore, India, 2015.

Kia Ng, Alex McLean, and Alan Marsden. Big data optical music recognition with multi-images and multi

recognisers. In EVA London 2014 on Electronic Visualisation and the Arts, pages 215–218. BCS, 2014.

David Bainbridge and Tim Bell. A music notation construction engine for optical music recognition. Software:

Practice and Experience, 33(2):173–200, 2003.

Jorge Calvo-Zaragoza and Jose Oncina. Recognition of pen-based music notation: The HOMUS dataset. In 22nd

International Conference on Pattern Recognition, pages 3038–3043. Institute of Electrical & Electronics Engineers

(IEEE), 2014.

Jorge Calvo-Zaragoza and David Rizo. Camera-primus: Neural end-to-end optical music recognition on realistic

monophonic scores. In 19th International Society for Music Information Retrieval Conference, pages 248–255, Paris,

France, 2018.

Jorge Calvo-Zaragoza and David Rizo. End-to-end neural optical music recognition of monophonic scores.

Applied Sciences, 8(4), 2018.

Jorge Calvo-Zaragoza, Alejandro Toselli, and Enrique Vidal. Handwritten music recognition for mensural

notation: Formulation, data and baseline results. In 14th International Conference on Document Analysis and

Recognition, pages 1081– 1086, Kyoto, Japan, 2017.

I. Fujinaga, A. Hankinson and J. E. Cumming, "Introduction to SIMSSA (single interface for music score

searching and analysis)", Proceedings of the 1st International Workshop on Digital Libraries for Musicology

DLfM@JCDL 2014, pp. 1-3, September 12, 2014.

A. Rebelo, G. Capela and J. S. Cardoso, "Optical recognition of music symbols: A comparative study",

International Journal on Document Analysis and Recognition, vol. 13, no. 1, pp. 19-31, Mar. 2010.

A. Rebelo, I. Fujinaga, F. Paszkiewicz, A. Marcal, C. Guedes and J. Cardoso, "Optical music recognition: stateof-the-art and open issues", International Journal of Multimedia Information Retrieval, vol. 1, no. 3, pp. 173-190,

A. Sharif Razavian, H. Azizpour, J. Sullivan and S. Carlsson, "CNN features off-the-shelf: An astounding

baseline for recognition", The IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Workshops,

June 2014.

J. Calvo-Zaragoza and J. Oncina, "Recognition of pen-based music notation: The HOMUS dataset", 22nd

International Conference on Pattern Recognition ICPR 2014, pp. 3038-3043, August 24–28, 2014.

R. M. Pinheiro Pereira, C. E. Matos, G. Braz, J. a. D. De Almeida and A. C. De Paiva, "A deep approach for

handwritten musical symbols recognition", Proceedings of the 22nd Brazilian Symposium on Multimedia and the Web

ser. Webmedia 16, pp. 191-194, 2016.

S. Lee, S. J. Son, J. Oh and N. Kwak, "Handwritten music symbol classification using deep convolutional neural

networks", Proceedings of the 3rd International Conference on Information Science and Security, 2016.

L. Bottou, "Large-scale machine learning with stochastic gradient descent" in Proceedings of COMPSTAT2010,

Springer, pp. 177-186, 2010.