Top Related Projects
C++ library for audio and music analysis, description and synthesis, including Python bindings
Quick Overview
The "awesome-deep-learning-music" repository is a curated list of scientific research papers about deep learning techniques applied to music. It covers various aspects of music information retrieval, generation, and analysis using deep learning methods. The repository serves as a comprehensive resource for researchers and practitioners in the field of AI and music.
Pros
- Extensive collection of research papers covering diverse topics in music and deep learning
- Well-organized structure with papers categorized by specific areas of focus
- Regularly updated with new publications and contributions from the community
- Includes links to code implementations when available, enhancing reproducibility
Cons
- Lacks detailed explanations or summaries of the listed papers
- May be overwhelming for beginners due to the large number of papers
- Some links to papers or code implementations may become outdated over time
- Does not provide a standardized way to compare or evaluate the different approaches
Note: As this is not a code library but a curated list of research papers, there are no code examples or getting started instructions to provide.
Competitor Comparisons
C++ library for audio and music analysis, description and synthesis, including Python bindings
Pros of essentia
- Comprehensive C++ library for audio analysis and music information retrieval
- Extensive set of audio features and algorithms implemented
- Well-documented and actively maintained by the Music Technology Group
Cons of essentia
- Steeper learning curve due to C++ implementation
- Focused on audio analysis rather than deep learning specifically
- Requires compilation and setup, which may be challenging for beginners
Code comparison
essentia:
#include <essentia/algorithmfactory.h>
#include <essentia/essentiamath.h>
AlgorithmFactory& factory = AlgorithmFactory::instance();
Algorithm* mfcc = factory.create("MFCC");
awesome-deep-learning-music:
# No direct code comparison available, as it's a curated list of resources
# Rather than a code library
Additional notes
awesome-deep-learning-music is a curated list of resources for deep learning in music, while essentia is a comprehensive audio analysis library. The former provides links to various tools, papers, and projects, while the latter offers a concrete implementation of audio processing algorithms.
essentia is more suitable for developers looking to implement audio analysis features directly in their applications, while awesome-deep-learning-music serves as a valuable reference for researchers and developers exploring deep learning applications in music.
Convert designs to code with AI
Introducing Visual Copilot: A new AI model to turn Figma designs to high quality code using your components.
Try Visual CopilotREADME
â ï¸ This repo is unmaintained. While the info are still relevant, contributions to keep it up to date are welcome! A good starting point are the articles referenced here: https://github.com/ybayle/awesome-deep-learning-music/issues/5
Deep Learning for Music (DL4M)
By Yann Bayle (Website, GitHub) from LaBRI (Website, Twitter), Univ. Bordeaux (Website, Twitter), CNRS (Website, Twitter) and SCRIME (Website).
TL;DR Non-exhaustive list of scientific articles on deep learning for music: summary (Article title, pdf link and code), details (table - more info), details (bib - all info)
The role of this curated list is to gather scientific articles, thesis and reports that use deep learning approaches applied to music. The list is currently under construction but feel free to contribute to the missing fields and to add other resources! To do so, please refer to the How To Contribute section. The resources provided here come from my review of the state-of-the-art for my PhD Thesis for which an article is being written. There are already surveys on deep learning for music generation, speech separation and speaker identification. However, these surveys do not cover music information retrieval tasks that are included in this repository.
Table of contents
- DL4M summary
- DL4M details
- Code without articles
- Statistics and visualisations
- Advices for reviewers of dl4m articles
- How To Contribute
- FAQ
- Acronyms used
- Sources
- Contributors
- Other useful related lists
- Cited by
- License
DL4M summary
DL4M details
A human-readable table summarized version if displayed in the file dl4m.tsv. All details for each article are stored in the corresponding bib entry in dl4m.bib. Each entry has the regular bib field:
author
year
title
journal
orbooktitle
Each entry in dl4m.bib also displays additional information:
link
- HTML link to the PDF filecode
- Link to the source code if availablearchi
- Neural network architecturelayer
- Number of layerstask
- The proposed tasks studied in the articledataset
- The names of the dataset useddataaugmentation
- The type of data augmentation technique usedtime
- The computation timehardware
- The hardware usednote
- Additional notes and informationrepro
- Indication to what extent the experiments are reproducible
Code without articles
- Audio Classifier in Keras using Convolutional Neural Network
- Deep learning driven jazz generation using Keras & Theano
- End-to-end learning for music audio tagging at scale
- Music Genre classification on GTZAN dataset using CNNs
- Pitch Estimation of Choir Music using Deep Learning Strategies: from Solo to Unison Recordings
- Music Genre Classification with LSTMs
- CNN based Music Emotion Classification using TensorFlow
- Separating singing voice from music based on deep neural networks in Tensorflow
- Music tag classification model using CRNN
- Finding the genre of a song with Deep Learning
- Composing music using neural nets
- Performance-RNN-PyTorch
Statistics and visualisations
- 167 papers referenced. See the details in dl4m.bib. There are more papers from 2017 than any other years combined. Number of articles per year:
- If you are applying DL to music, there are 364 other researchers in your field.
- 34 tasks investigated. See the list of tasks. Tasks pie chart:
- 55 datasets used. See the list of datasets. Datasets pie chart:
- 30 architectures used. See the list of architectures. Architectures pie chart:
- 9 frameworks used. See the list of frameworks. Frameworks pie chart:
- Only 47 articles (28%) provide their source code. Repeatability is the key to good science, so check out the list of useful resources on reproducibility for MIR and ML.
Advices for reviewers of dl4m articles
Please refer to the advice_review.md file.
How To Contribute
Contributions are welcome! Please refer to the CONTRIBUTING.md file.
FAQ
How are the articles sorted?
The articles are first sorted by decreasing year (to keep up with the latest news) and then alphabetically by the main author's family name.
Why are preprint from arXiv included in the list?
I want to have exhaustive research and the latest news on DL4M. However, one should take care of the information provided in the articles currently in review. If possible you should wait for the final accepted and peer-reviewed version before citing an arXiv paper. I regularly update the arXiv links to the corresponding published papers when available.
How much can I trust the results published in an article?
The list provided here does not guarantee the quality of the articles. You should either try to reproduce the experiments described or submit a request to ReScience. Use one article's conclusion at your own risks.
Acronyms used
A list of useful acronyms used in deep learning and music is stored in acronyms.md.
Sources
The list of conferences, journals and aggregators used to gather the proposed materials is stored in sources.md.
Contributors
- Yann Bayle (GitHub) - Instigator and principal maintainer
- Vincent Lostanlen (GitHub)
- Keunwoo Choi (GitHub)
- Bob L. Sturm (GitHub)
- Stefan Balke (GitHub)
- Jordi Pons (GitHub)
- Mirza Zulfan (GitHub) for the logo
- Devin Walters
- https://github.com/LegendJ
Other useful related lists and resources
Audio
- DL4MIR tutorial with keras - Tutorial for Deep Learning on Music Information Retrieval by Thomas Lidy
- Video talk from Ron Weiss - Ron Weiss (Google) Talk on Training neural network acoustic models on waveforms
- Slides on DL4M - A personal (re)view of the state-of-the-art by Jordi Pons
- DL4MIR tutorial - Python tutorials for learning to solve MIR tasks with DL
- Awesome Python Scientific Audio - Python resources for Audio and Machine Learning
- ISMIR resources - Community maintained list
- ISMIR Google group - Daily dose of general MIR
- Awesome Python - Audio section of Python resources
- Awesome Web Audio - WebAudio packages and resources
- Awesome Music - Music softwares
- Awesome Music Production - Music creation
- The Asimov Institute - 6 deep learning tools for music generation
- DLM Google group - Deep Learning in Music group
- MIR community on Slack - Link to subscribe to the MIR community's Slack
- Unclassified list of MIR-related links - Cory McKay's list of various links on DL, MIR, ...
- MIRDL - Unmaintained list of DL articles for MIR from Jordi Pons
- WWW 2018 Challenge - Learning to Recognize Musical Genre on the FMA dataset
- Music generation with DL - List of resources on music generation with deep learning
- Auditory Scene Analysis - Book about the perceptual organization of sound by Albert Bregman, the "father of Auditory Scene Analysis".
- Demonstrations of Auditory Scene Analysis - Audio demonstrations, which illustrate examples of auditory perceptual organization.
Music datasets
- AudioContentAnalysis nearly exhaustive list of music-related datasets
- Teaching MIR
- Wikipedia's list of datasets for machine learning research
- Datasets for deep learning
- Awesome public datasets
- Awesome music listening
Deep learning
- DLPaper2Code: Auto-generation of Code from Deep Learning Research Papers -
- Model Convertors - Convertors for DL frameworks and backend
- Deep architecture genealogy - Genealogy of DL architectures
- Deep Learning as an Engineer - Slides from Jan Schlüter
- Awesome Deep Learning - General deep learning resources
- Awesome Deep Learning Resources - Papers regarding deep learning and deep reinforcement learning
- Awesome RNNs - RNNs code, theory and applications
- Cheatsheets AI - Cheat Sheets for Keras, neural networks, scikit-learn,...
- DL PaperNotes - Summaries and notes on general deep learning research papers
- General lists
- Echo State Network
- DL in NLP - Best practices for using neural networks by Sebastian Ruder
- CNN overview - Stanford Course
- Dilated Recurrent Neural Networks - How to improve RNNs?
- Encoder-Decoder in RNNs - How Does Attention Work in Encoder-Decoder Recurrent Neural Networks
- On the use of DL - Misc fun around DL
- ML from scratch - Python implementations of ML models and algorithms from scratch from Data Mining to DL
- Comparison of DL frameworks - Presentation describing the different existing frameworks for DL
- ELU > ReLU - Article describing the differences between ELU and ReLU
- Reinforcement Learning: An Introduction - Book about reinforcement learning
- Estimating Optimal Learning Rate - Blog post on the learning rate optimisation
- GitHub repo for sklearn add-on for imbalanced learning - ML in uneven datasets
- Video on DL from Nando de Freitas, Scott Reed and Oriol Vinyals - Deep Learning: Practice and Trends (NIPS 2017 Tutorial, parts I & II)
- Article "Are GANs Created Equal? A Large-Scale Study" - Actually comparing DL algorithms
- Battle of the Deep Learning frameworks - DL frameworks comparison and evolution
- Black-box optimization - There are other optimization algorithms than just gradient descent
Cited by
If you use the information contained in this repository, please let us know! This repository is cited by:
- Alexander Schindler
- Meinard Müller, Christof Weiss, Stefan Balke
- WWW 2018 Challenge: Learning to Recognize Musical Genre
- Awesome Deep Learning
- AINewsFeed
License
You are free to copy, modify, and distribute Deep Learning for Music (DL4M) with attribution under the terms of the MIT license. See the LICENSE file for details. This project use another projects and you may refer to them for appropriate license information :
- Readme checklist - To build an universal Readme.
- Pylint - To clean the python code.
- Numpy - To manage python structure.
- Matplotlib - To plot nice figures.
- Bibtexparser - To deal with the bib entries.
Top Related Projects
C++ library for audio and music analysis, description and synthesis, including Python bindings
Convert designs to code with AI
Introducing Visual Copilot: A new AI model to turn Figma designs to high quality code using your components.
Try Visual Copilot