A Robust Music Note Recognition System Using Convolutional Neural Network
Main Article Content
Abstract
The task of automatically recognizing musical instruments poses significant challenges within the domain of music information retrieval. Learning to play the piano, on the other hand, demands expert instruction and substantial practice. Due to the hectic nature of modern life, many individuals find it difficult to commit to systematic training. Additionally, the scarcity of qualified piano teachers and the high costs associated with lessons further discourage potential students. If a computer could recognize and assess a learner’s piano performance in real time, it would enable learners to identify and correct their mistakes promptly. Although there are existing music recognition technologies, most suffer from several limitations. Currently, music processing systems that incorporate models for chord progressions achieve high accuracy in tasks such as music structure analysis, multi pitch analysis, and automatic composition or accompaniment. pitch patterns are treated as observations derived from the hidden states within the chord progression model. Convolutional Neural Networks (CNN) have been successfully applied to chord recognition. The CNN approch will give high accuracy, precision and F1-Score.
Downloads
Metrics
Article Details
This work is licensed under a Creative Commons Attribution 4.0 International License.
You are free to:
- Share — copy and redistribute the material in any medium or format for any purpose, even commercially.
- Adapt — remix, transform, and build upon the material for any purpose, even commercially.
- The licensor cannot revoke these freedoms as long as you follow the license terms.
Under the following terms:
- Attribution — You must give appropriate credit , provide a link to the license, and indicate if changes were made . You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
- No additional restrictions — You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits.
Notices:
You do not have to comply with the license for elements of the material in the public domain or where your use is permitted by an applicable exception or limitation .
No warranties are given. The license may not give you all of the permissions necessary for your intended use. For example, other rights such as publicity, privacy, or moral rights may limit how you use the material.
References
R. Su, L. Wang and X. Liu, "Multimodal learning using 3D audio-visual data for audio-visual speech recognition,"
International Conference on Asian Language Processing (IALP), Singapore, 2017, pp. 40-43, doi:
1109/IALP.2017.8300541.
J. Calvo-Zaragoza, A. -J. Gallego and A. Pertusa, "Recognition of Handwritten Music Symbols with Convolutional
Neural Codes," 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), Kyoto,
Japan, 2017, pp. 691-696, doi: 10.1109/ICDAR.2017.118.
K. -Y. Choi, B. Coüasnon, Y. Ricquebourg and R. Zanibbi, "Bootstrapping Samples of Accidentals in Dense Piano
Scores for CNN-Based Detection," 2017 14th IAPR International Conference on Document Analysis and Recognition
(ICDAR), Kyoto, Japan, 2017, pp. 19-20, doi: 10.1109/ICDAR.2017.257.
Y. Liu and Y. Chen, "Recognition of facial expression based on CNN-CBP features," 2017 29th Chinese Control
And Decision Conference (CCDC), Chongqing, China, 2017, pp. 2139-2145, doi: 10.1109/CCDC.2017.7978869.
S. Deebika, K. A. Indira and Jesline, "A Machine Learning Based Music Player by Detecting Emotions," 2019
Fifth International Conference on Science Technology Engineering and Mathematics (ICONSTEM), Chennai, India,
, pp. 196-200, doi: 10.1109/ICONSTEM.2019. 8918890.
Apurva A. Mehta and Malay S. Bhatt. Optical music notes recognition for printed piano music score sheet. In
International Conference on Computer Communication and Informatics, Coimbatore, India, 2015.
Kia Ng, Alex McLean, and Alan Marsden. Big data optical music recognition with multi-images and multi
recognisers. In EVA London 2014 on Electronic Visualisation and the Arts, pages 215–218. BCS, 2014.
David Bainbridge and Tim Bell. A music notation construction engine for optical music recognition. Software:
Practice and Experience, 33(2):173–200, 2003.
Jorge Calvo-Zaragoza and Jose Oncina. Recognition of pen-based music notation: The HOMUS dataset. In 22nd
International Conference on Pattern Recognition, pages 3038–3043. Institute of Electrical & Electronics Engineers
(IEEE), 2014.
Jorge Calvo-Zaragoza and David Rizo. Camera-primus: Neural end-to-end optical music recognition on realistic
monophonic scores. In 19th International Society for Music Information Retrieval Conference, pages 248–255, Paris,
France, 2018.
Jorge Calvo-Zaragoza and David Rizo. End-to-end neural optical music recognition of monophonic scores.
Applied Sciences, 8(4), 2018.
Jorge Calvo-Zaragoza, Alejandro Toselli, and Enrique Vidal. Handwritten music recognition for mensural
notation: Formulation, data and baseline results. In 14th International Conference on Document Analysis and
Recognition, pages 1081– 1086, Kyoto, Japan, 2017.
I. Fujinaga, A. Hankinson and J. E. Cumming, "Introduction to SIMSSA (single interface for music score
searching and analysis)", Proceedings of the 1st International Workshop on Digital Libraries for Musicology
DLfM@JCDL 2014, pp. 1-3, September 12, 2014.
A. Rebelo, G. Capela and J. S. Cardoso, "Optical recognition of music symbols: A comparative study",
International Journal on Document Analysis and Recognition, vol. 13, no. 1, pp. 19-31, Mar. 2010.
A. Rebelo, I. Fujinaga, F. Paszkiewicz, A. Marcal, C. Guedes and J. Cardoso, "Optical music recognition: stateof-the-art and open issues", International Journal of Multimedia Information Retrieval, vol. 1, no. 3, pp. 173-190,
A. Sharif Razavian, H. Azizpour, J. Sullivan and S. Carlsson, "CNN features off-the-shelf: An astounding
baseline for recognition", The IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Workshops,
June 2014.
J. Calvo-Zaragoza and J. Oncina, "Recognition of pen-based music notation: The HOMUS dataset", 22nd
International Conference on Pattern Recognition ICPR 2014, pp. 3038-3043, August 24–28, 2014.
R. M. Pinheiro Pereira, C. E. Matos, G. Braz, J. a. D. De Almeida and A. C. De Paiva, "A deep approach for
handwritten musical symbols recognition", Proceedings of the 22nd Brazilian Symposium on Multimedia and the Web
ser. Webmedia 16, pp. 191-194, 2016.
S. Lee, S. J. Son, J. Oh and N. Kwak, "Handwritten music symbol classification using deep convolutional neural
networks", Proceedings of the 3rd International Conference on Information Science and Security, 2016.
L. Bottou, "Large-scale machine learning with stochastic gradient descent" in Proceedings of COMPSTAT2010,
Springer, pp. 177-186, 2010.