Music Genre Detection

This project involves the classification of music genres using the GTZAN dataset. Various machine learning algorithms and neural networks are employed to achieve the best accuracy.

Dataset

I used the GTZAN Music Genre Classification Dataset for this project. It contains audio tracks categorized into different genres.

Project Overview

Data Cleaning and Preprocessing
Data Visualization
Model Training and Evaluation
- Logistic Regression
- K-Nearest Neighbors (KNN)
- Decision Tree
- Random Forest
- CatBoost Classifier
- XGBoost Classifier
Neural Network Implementation
Model Comparison
Best Model Selection and Prediction

Steps

1. Data Cleaning and Preprocessing

The initial step involved loading the dataset and performing necessary cleaning. This includes handling missing values, encoding labels, and normalizing the data.

2. Data Visualization

I created visualizations to understand the waveforms of each genre. This helped in gaining insights into the data distribution and characteristics of different genres.

3. Model Training and Evaluation

Several machine learning models were trained and evaluated using accuracy as the metric. The results are as follows:

Logistic Regression: 52.33%
K-Nearest Neighbors (KNN): 70.67%
Decision Tree: 62.00%

The comparison of these models was visualized in a graph for better understanding.

Advanced Models

Further, advanced models were applied to improve accuracy:

Random Forest Classifier: 78.00%
CatBoost Classifier: 83.33%
XGBoost Classifier: 77.33%

A comparison graph of these advanced models was also created.

4. Neural Network Implementation

A neural network was trained for 100 epochs, achieving a test accuracy of 75%. Accuracy and error plots were generated to visualize the training process.

5. Model Comparison

All the models were compared based on their accuracies. CatBoost Classifier was found to be the best performing model with an accuracy of 83.33%.

Results

The best performing model is the CatBoost Classifier with an accuracy of 83.33%.

Files

main.ipynb: Contains the code for data cleaning, preprocessing, visualization, model training, evaluation, and predictions.

Libraries used

Pandas
NumPy
Scikit-learn
Matplotlib
Seaborn
CatBoost
XGBoost
TensorFlow
librosa

Conclusion

This project demonstrates the application of various machine learning and neural network techniques for music genre classification. The CatBoost Classifier was the best performing model, achieving an accuracy of 83.33%.

Acknowledgements

The dataset used in this project is provided by Kaggle.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.ipynb_checkpoints		.ipynb_checkpoints
archive/Data		archive/Data
catboost_info		catboost_info
README.md		README.md
main.ipynb		main.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Music Genre Detection

Dataset

Project Overview

Steps

1. Data Cleaning and Preprocessing

2. Data Visualization

3. Model Training and Evaluation

Advanced Models

4. Neural Network Implementation

5. Model Comparison

Results

Files

Libraries used

Conclusion

Acknowledgements

About

Releases

Packages

Languages

shreyasen27/music_genre_classification

Folders and files

Latest commit

History

Repository files navigation

Music Genre Detection

Dataset

Project Overview

Steps

1. Data Cleaning and Preprocessing

2. Data Visualization

3. Model Training and Evaluation

Advanced Models

4. Neural Network Implementation

5. Model Comparison

Results

Files

Libraries used

Conclusion

Acknowledgements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages