End-to-end learning for a music audio tagging task

This is a PyTorch implementation of "End-to-end learning for music audio" by Dieleman and Schrauwen (2014).

My aim was mostly to learn and play with the model, and to visualise the filters, so the implementation is currently not complete. The code does not for example average predictions over a track to display AUC scores. I plan to add this later. It also uses all the tagging classes, not the top 50.

Usage

Download and prepare the data by running get_data.sh. The remaining code is in visualise.ipynb. It should produce figures such as the following:

Data

MagnaTagATune dataset

References

Dieleman, S. and Schrauwen, B., 2014, May. End-to-end learning for music audio. In 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 6964-6968). IEEE.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.gitignore		.gitignore
README.md		README.md
get_data.sh		get_data.sh
spectra.svg		spectra.svg
visualise.ipynb		visualise.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

End-to-end learning for a music audio tagging task

Usage

Data

References

About

Releases

Packages

Languages

jfainberg/end2end_music

Folders and files

Latest commit

History

Repository files navigation

End-to-end learning for a music audio tagging task

Usage

Data

References

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages