Polish dance music classification based on Mel spectrogram decomposition
More details
Hide details
1
Lublin University of Technology, Faculty of Electrical Engineering and Computer Science, Department of Computer Science, Nadbystrzycka 38D, 20-618 Lublin, Poland
These authors had equal contribution to this work
Publication date: 2024-11-03
Corresponding author
Kinga Chwaleba
Lublin University of Technology, Faculty of Electrical Engineering and Computer Science, Department of Computer Science, Nadbystrzycka 38D, 20-618 Lublin, Poland
Adv. Sci. Technol. Res. J. 2025; 19(2)
KEYWORDS
TOPICS
ABSTRACT
Folk dances and music are essential aspects of intangible cultural heritage identifying the history and traditions of nations. Due to dynamic changes in the social structure, many national aspects are not cultivated and therefore forgotten. There is a need to develop methods to preserve these valuable aspects of culture. There are five Polish national dances: the Polonez, the Oberek, the Mazur, the Krakowiak, and the Kujawiak that reflect key elements of Polish intangible cultural heritage. They can be observed both in the way of performing dances as well as in music. There are many preserved audio and video files that differ depending on the multiple features such as composers or versions. The primary objective of this study was to apply machine learning approaches in order to distinguish the above-mentioned music of Polish traditional dances. The audio recordings dataset consisting of 137 dances in mp3 format was created. Each recording was divided into ten-second files reflecting the characteristic elements of each dance. The transformation of sound to the Mel scale improves human auditory perception. Thus, from every recording the Mel-spectrograms were generated. For the purpose of this study the most applied classification tools were compared such as VGG16, ResNet50, DenseNet121, and MobileNetV2. To compare the performance of the selected models, the following measures were applied: accuracy, precision, recall, and F1 score. ResNet50 achieved the best testing accuracy (over 90%), while DenseNet121 had the best testing loss (0.38).