An Approach to Instrumental Song Classification Utilizing Spectrogram and Convolutional Neural Networks

Hướng dẫn

Tìm kiếm nâng cao

Tên bài báo

Tìm

Tác giả

Năm xuất bản

Tóm tắt

Lĩnh vực

Phân loại

Số tạp chí

Bản tin định kỳ

Báo cáo thường niên

Tạp chí khoa học ĐHCT

Tạp chí tiếng anh ĐHCT

Tạp chí trong nước

Tạp chí quốc tế

Kỷ yếu HN trong nước

Kỷ yếu HN quốc tế

Book chapter

An Approach to Instrumental Song Classification Utilizing Spectrogram and Convolutional Neural Networks

543 (2024) Trang: 221–233

Tác giả: Anh Tuan Le, Nguyễn Thanh Hải, Nguyễn Thị Thanh Hiền, Nguyễn Hữu Hòa

Tạp chí: Studies in Systems, Decision and Control

Liên kết: https://doi.org/10.1007/978-3-031-63929-6_20

Tóm tắt

Searching for a song is a necessity, where the copyright of the song is a significant concern. This study proposes a method to classify and identify songs based on specific features that the model learns from music data. Python and CNN programming languages are used to build the model. In the first process, support libraries are used to extract audio data from the computer in WAV format. The dataset A includes 100 songs without lyrics, while the dataset B includes 100 audio files with the same song name but played in different types of musical instruments. We randomly cut the original audio files into clips less than 10 s long because users often use a specific code to find the entire track. The original audio files are split into clips of different lengths in the training set, including 1, 3, 5, 10, 20, 30, 60, and 90 s. Next, the Short-Time Fourier Transform was used to convert the audio data to the frequency domain. Finally, a shallow Convolutional Neural Network (CNN) and a Fully Connected layer (FC) were used to perform song classification tasks. We found that data augmentation by dividing the entire song into small pieces based on length significantly improved classification performance compared to those not using this technique. This research positively contributes to the advancement of e-commerce music systems, where listeners can enjoy music conveniently and memorably.

Vietnamese | English

Tạp chí khoa học Trường Đại học Cần Thơ
Khu II, Đại học Cần Thơ, Đường 3/2, Phường Ninh Kiều, Thành phố Cần Thơ, Việt Nam
Điện thoại: (0292) 3 872 157; Email: tapchidhct@ctu.edu.vn

Chương trình chạy tốt nhất trên trình duyệt IE 9+ & FF 16+, độ phân giải màn hình 1024x768 trở lên

Vui lòng chờ...