A CNN-driven model with adaptive feature fusion for polish national dance music recognition

Kinga Chwaleba; Weronika Wach

Issue X International Conference of...

Stats

Get citation

A CNN-driven model with adaptive feature fusion for polish national dance music recognition

Kinga Chwaleba ¹

Weronika Wach ¹

More details

Hide details

Lublin University of Technology, Faculty of Electrical Engineering and Computer Science, Department of Computer Science, Nadbystrzycka 38D, 20-618 Lublin, Poland

These authors had equal contribution to this work

Publication date: 2025-08-29

Corresponding author

Kinga Chwaleba

Lublin University of Technology, Faculty of Electrical Engineering and Computer Science, Department of Computer Science, Nadbystrzycka 38D, 20-618 Lublin, Poland

Adv. Sci. Technol. Res. J. 2025;

KEYWORDS

machine learning

feature fusion

convolutional neural networks

SHapley Additive exPlanations

Polish national dance music identification

TOPICS

Computer Engineering

ABSTRACT

Mel spectrograms have been widely applied in music identification, often yielding successful results when combined with well-known pre-trained classification methods such as VGG16, DenseNet121, or ResNet50. However, the acquired performance may still be improved by employing fusion techniques and proposing a dataset consisting of more samples, which generally demonstrate superior results. Thus, a novel approach employing these methods with the formerly pre-trained classifiers has been introduced. The core innovation of our study is feature fusion utilizing Mel spectrograms, spectrograms, scalograms, and Mel-Frequency Cepstral Coefficients plots, created based on audio recordings from the created dataset encompassing Polish national dance music. The adaptive model is suggested as a mechanism adjusting the highly relevant features for Polish national dance music identification. Furthermore, the use of SHapley Additive exPlanations makes it possible to visualize which parts of the input feature maps are crucial to the model fusion decisions. Subsequently, the most prevalent classification metrics were employed including accuracy, precision, recall, and F1-score to compare the obtained results with state-of-the-art. Hence, the present method yields highly satisfactory results, exceeding 94% accuracy. Consequently, this study not only sets a new benchmark for Polish national dance recognition but also underscores the broader potential of multi-representation fusion as a general blueprint for next-generation audio classification systems.

Submit your paper

Instructions for Authors

All issues

Articles in press

Send by email

Optimizing traffic volume prediction: Linear regression vs. random forest

Integrating meteorological data for next-day photovoltaic energy prediction using XGBoost

Comparison of machine learning methods in predictive maintenance of machines

Predictive modeling and decision support using machine learning in business contexts

Comparison of machine learning models for predicting the compressive strength of cement mixtures with zeolite

Indexes

Keywords index

Authors index

We process personal data collected when visiting the website. The function of obtaining information about users and their behavior is carried out by voluntarily entered information in forms and saving cookies in end devices. Data, including cookies, are used to provide services, improve the user experience and to analyze the traffic in accordance with the Privacy policy. Data are also collected and processed by Google Analytics tool (more).

You can change cookies settings in your browser. Restricted use of cookies in the browser configuration may affect some functionalities of the website.

I agree I do not agree