deep-learning-papers
Papers about deep learning ordered by task, date. Current state-of-the-art papers are labelled.
Top Related Projects
Deep Learning papers reading roadmap for anyone who are eager to learn this amazing tech!
The most cited deep learning papers
A curated list of awesome Deep Learning tutorials, projects and communities.
A curated list of deep learning resources for computer vision
TensorFlow - A curated list of dedicated resources http://tensorflow.org
Quick Overview
The sbrugman/deep-learning-papers repository is a curated collection of deep learning papers organized by topic. It serves as a comprehensive resource for researchers and practitioners in the field of deep learning, providing easy access to influential papers across various subdomains.
Pros
- Well-organized structure with papers categorized by specific topics
- Regularly updated with new and relevant papers
- Includes links to paper PDFs and additional resources when available
- Covers a wide range of deep learning topics, from foundational concepts to specialized applications
Cons
- Lacks detailed summaries or explanations of the papers
- May not include all relevant papers in each category
- Subjective selection process for included papers
- Limited community contribution and discussion features
Note: As this is not a code library, the code example and getting started sections have been omitted.
Competitor Comparisons
Deep Learning papers reading roadmap for anyone who are eager to learn this amazing tech!
Pros of Deep-Learning-Papers-Reading-Roadmap
- Provides a structured learning path for deep learning topics
- Includes brief descriptions and explanations for each paper
- Organizes papers by subject area and difficulty level
Cons of Deep-Learning-Papers-Reading-Roadmap
- Less frequently updated compared to deep-learning-papers
- Focuses primarily on foundational papers, potentially missing newer research
- Limited to specific deep learning topics, not as comprehensive
Code Comparison
While both repositories primarily focus on curating and organizing research papers, they don't contain significant code samples. However, deep-learning-papers includes some code snippets for paper implementations:
deep-learning-papers:
import torch
import torch.nn as nn
class SimpleNet(nn.Module):
def __init__(self):
super(SimpleNet, self).__init__()
self.fc = nn.Linear(784, 10)
Deep-Learning-Papers-Reading-Roadmap does not include code samples, as it's primarily a curated list of papers with descriptions and categorizations.
The most cited deep learning papers
Pros of awesome-deep-learning-papers
- More comprehensive collection with over 400 papers
- Better organization with categorization by research areas
- Includes a "Top 100" list for quick reference
Cons of awesome-deep-learning-papers
- Last updated in 2018, potentially outdated
- Lacks direct links to paper implementations or code
Code comparison
Not applicable for these repositories, as they primarily consist of curated lists of papers without significant code content.
Summary
awesome-deep-learning-papers offers a more extensive and well-organized collection of deep learning papers, including a helpful "Top 100" list. However, it hasn't been updated since 2018, which may limit its relevance for recent research. deep-learning-papers, while potentially more up-to-date, has a smaller collection and less structured organization.
Both repositories serve as valuable resources for researchers and practitioners in the field of deep learning, but users should consider the trade-offs between comprehensiveness and recency when choosing between them.
A curated list of awesome Deep Learning tutorials, projects and communities.
Pros of awesome-deep-learning
- More comprehensive coverage of deep learning topics, including books, courses, and videos
- Better organized structure with clear categories and subcategories
- Regularly updated with new resources and contributions from the community
Cons of awesome-deep-learning
- Less focus on specific research papers compared to deep-learning-papers
- May be overwhelming for beginners due to the large amount of information
- Lacks detailed summaries or explanations for each resource
Code comparison
While both repositories are primarily curated lists of resources, they don't contain significant code samples. However, here's an example of how they structure their content:
deep-learning-papers:
## Image Classification
- [Paper Title](link) (Conference Year)
- Summary: Brief description of the paper
awesome-deep-learning:
## Papers
* [Title](link) - Author, Publication, Year
The deep-learning-papers repository provides more detailed information about each paper, including a summary, while awesome-deep-learning offers a broader range of resources beyond just papers.
A curated list of deep learning resources for computer vision
Pros of awesome-deep-vision
- More comprehensive coverage of computer vision topics
- Better organization with clear categorization of papers
- Includes additional resources like datasets and software
Cons of awesome-deep-vision
- Less frequently updated compared to deep-learning-papers
- Focuses primarily on computer vision, limiting its scope
- May be overwhelming for beginners due to the large number of papers
Code comparison
Not applicable, as both repositories are curated lists of papers and resources without significant code content.
Summary
awesome-deep-vision offers a more extensive and well-organized collection of computer vision papers and resources, making it ideal for researchers and practitioners in this specific field. However, it may be less suitable for those seeking a broader overview of deep learning or more recent updates.
deep-learning-papers, on the other hand, covers a wider range of deep learning topics and is updated more frequently. It might be more appropriate for those looking for a general understanding of deep learning or staying current with the latest developments across various subfields.
Both repositories serve as valuable resources for the AI community, with their strengths catering to different needs and preferences.
TensorFlow - A curated list of dedicated resources http://tensorflow.org
Pros of awesome-tensorflow
- Focused specifically on TensorFlow resources and tools
- Includes a wider variety of content types (tutorials, videos, books, etc.)
- More frequently updated with new resources
Cons of awesome-tensorflow
- Less structured organization compared to deep-learning-papers
- May be overwhelming for beginners due to the large number of resources
- Lacks detailed descriptions or summaries for each listed item
Code comparison
Not applicable for these repositories, as they are curated lists of resources rather than code repositories.
Summary
awesome-tensorflow is a comprehensive collection of TensorFlow-related resources, offering a wide range of content types for developers at various skill levels. It's frequently updated but may be overwhelming for beginners due to its extensive list.
deep-learning-papers, on the other hand, focuses specifically on research papers in the broader field of deep learning. It provides a more structured organization, making it easier to navigate for those interested in academic research. However, it may not be as frequently updated and lacks the variety of resource types found in awesome-tensorflow.
Both repositories serve different purposes and cater to different audiences within the machine learning community. awesome-tensorflow is ideal for TensorFlow practitioners looking for practical resources, while deep-learning-papers is better suited for researchers and those interested in the theoretical aspects of deep learning.
Convert designs to code with AI
Introducing Visual Copilot: A new AI model to turn Figma designs to high quality code using your components.
Try Visual CopilotREADME
Deep Learning Papers by task
Papers about deep learning ordered by task, date. For each paper there is a permanent link, which is either to Arxiv.org or to a copy of the original paper in this repository.
Table of Contents
-
1.1. Code Generation
-
2.1. Summarization
2.2. Taskbots
2.3. Classification
2.4. Question Answering
2.5. Sentiment Analysis
2.6. Translation
2.7. Chatbots
2.8. Reasoning
-
3.1. Gaming
3.2. Style Transfer
3.3. Object Tracking
3.4. Visual Question Answering
3.5. Image Segmentation
3.6. Text (in the Wild) Recognition
3.7. Brain Computer Interfacing
3.8. Self-Driving Cars
3.9. Object Recognition
3.10. Logo Recognition
3.11. Super Resolution
3.12. Pose Estimation
3.13. Image Captioning
3.14. Image Compression
3.15. Image Synthesis
3.16. Face Recognition
3.17. Image Composition
3.18. Scene Graph Parsing
3.19. Video Deblurring
3.20. Depth Perception
3.21. 3D Reconstruction
3.22. Vision Representation
-
4.1. Audio Synthesis
-
5.1. Unclassified
5.2. Regularization
5.3. Neural Network Compression
5.4. Optimizers
Code
Code Generation
Title | Date | Paper | Code |
---|---|---|---|
DeepAM: Migrate APIs with Multi-modal Sequence to Sequence Learning | 25 apr 2017 | arxiv | |
A Syntactic Neural Model for General-Purpose Code Generation | 6 apr 2017 | arxiv | |
RobustFill: Neural Program Learning under Noisy I/O | 21 mar 2017 | arxiv | |
DeepFix: Fixing Common C Language Errors by Deep Learning | 12 feb 2017 | paper | |
DeepCoder: Learning to Write Programs | 7 nov 2016 | arxiv | |
Neuro-Symbolic Program Synthesis | 6 nov 2016 | arxiv | |
Deep API Learning | 27 may 2016 | arxiv |
Malware Detection and Security
Title | Date | Paper | Code |
---|---|---|---|
PassGAN: A Deep Learning Approach for Password Guessing | 1 sep 2017 | arxiv | |
Deep Android Malware Detection | 22 mar 2016 | paper | github |
Droid-Sec: Deep Learning in Android Malware Detection | 17 aug 2014 | paper | github |
Text
Summarization
Title | Date | Paper | Code |
---|---|---|---|
A Deep Reinforced Model for Abstractive Summarization | 11 may 2017 | arxiv | |
Get To The Point: Summarization with Pointer-Generator Networks | 14 apr 2017 | arxiv | |
SummaRuNNer: A Recurrent Neural Network based Sequence Model for Extractive Summarization of Documents | 14 nov 2016 | arxiv |
Taskbots
Title | Date | Paper | Code |
---|---|---|---|
Composite Task-Completion Dialogue Policy Learning via Hierarchical Deep Reinforcement Learning | 10 apr 2017 | arxiv | github |
End-to-End Task-Completion Neural Dialogue Systems | 3 mar 2017 | arxiv | github |
Classification
Title | Date | Paper | Code |
---|---|---|---|
A Large Self-Annotated Corpus for Sarcasm | 19 apr 2017 | arxiv | |
ConceptNet at SemEval-2017 Task 2: Extending Word Embeddings with Multilingual Relational Knowledge | 11 apr 2017 | arxiv | |
Bilateral Multi-Perspective Matching for Natural Language Sentences | 13 feb 2017 | arxiv | |
FastText.zip: Compressing text classification models | 12 dec 2016 | arxiv | |
ConceptNet 5.5: An Open Multilingual Graph of General Knowledge | 12 dec 2016 | arxiv | |
A Simple but Tough-to-Beat Baseline for Sentence Embeddings | 4 nov 2016 | paper | github |
Enriching Word Vectors with Subword Information | 15 jul 2016 | arxiv | |
From Word Embeddings To Document Distances | 6 jul 2016 | paper | github |
Bag of Tricks for Efficient Text Classification | 6 jul 2016 | arxiv | |
Character-level Convolutional Networks for Text Classification | 4 sep 2015 | arxiv | |
GloVe: Global Vectors for Word Representation | 25 may 2015 | paper | github |
Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks | 28 feb 2015 | arxiv | |
Distributed Representations of Sentences and Documents | 16 may 2014 | arxiv | |
Efficient Estimation of Word Representations in Vector Space | 16 jan 2013 | arxiv | |
SimHash: Hash-based Similarity Detection | 13 dec 2007 | paper |
Question Answering
Title | Date | Paper | Code |
---|---|---|---|
IRGAN: A Minimax Game for Unifying Generative and Discriminative Information Retrieval Models | 30 may 2017 | arxiv | github |
Sentiment Analysis
Title | Date | Paper | Code |
---|---|---|---|
Rationalizing Neural Predictions | 13 jun 2016 | arxiv | |
Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank | 18 okt 2013 | paper |
Translation
Title | Date | Paper | Code |
---|---|---|---|
Attention Is All You Need | 12 jun 2017 | arxiv | |
Convolutional Sequence to Sequence Learning | 8 may 2017 | arxiv | github |
Google's Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation | 14 nov 2016 | arxiv | |
A Convolutional Encoder Model for Neural Machine Translation | 7 nov 2016 | arxiv | |
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation | 26 sep 2016 | arxiv | |
Neural Machine Translation by Jointly Learning to Align and Translate | 1 sep 2014 | arxiv |
Chatbots
Title | Date | Paper | Code |
---|---|---|---|
A Deep Reinforcement Learning Chatbot | 7 sep 2017 | arxiv | |
A Neural Conversational Model | 19 jun 2015 | arxiv | github |
Reasoning
Title | Date | Paper | Code |
---|---|---|---|
NeuroSAT: Learning a SAT Solver from Single-Bit Supervision | 5 jan 2019 | arxiv | github |
Tracking the World State with Recurrent Entity Networks | 12 dec 2016 | arxiv |
Language Representation
Title | Date | Paper | Code |
---|---|---|---|
Efficient Estimation of Word Representations in Vector Space | 7 sep 2013 | arxiv | |
Distributed Representations of Words and Phrases and their Compositionality | 16 okt 2013 | arxiv | |
ELMO: Deep contextualized word representations | 22 Mar 2018 | arxiv | |
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding | 24 may 2019 | arxiv | github |
XLNet: Generalized Autoregressive Pretraining for Language Understanding | 19 jun 2019 | arxiv | github |
RoBERTa: A Robustly Optimized BERT Pretraining Approach | 26 jul 2019 | arxiv | github |
Visual
Gaming
Title | Date | Paper | Code |
---|---|---|---|
Phase-Functioned Neural Networks for Character Control | 1 may 2017 | paper | |
Equivalence Between Policy Gradients and Soft Q-Learning | 21 apr 2017 | arxiv | |
Beating Atari with Natural Language Guided Reinforcement Learning | 18 apr 2017 | arxiv | |
Learning from Demonstrations for Real World Reinforcement Learning | 12 apr 2017 | arxiv | |
FeUdal Networks for Hierarchical Reinforcement Learning | 3 mar 2017 | arxiv | |
Overcoming catastrophic forgetting in neural networks | 2 dec 2016 | arxiv | |
Playing Doom with SLAM-Augmented Deep Reinforcement Learning | 1 dec 2016 | arxiv | |
Playing FPS Games with Deep Reinforcement Learning | 18 sep 2016 | arxiv | |
DeepChess: End-to-End Deep Neural Network for Automatic Learning in Chess | 16 aug 2016 | paper | |
Generative Adversarial Imitation Learning | 10 jun 2016 | arxiv | |
Dueling Network Architectures for Deep Reinforcement Learning | 20 nov 2015 | arxiv | |
Prioritized Experience Replay | 18 nov 2015 | arxiv | |
Human-level control through deep reinforcement learning | 26 feb 2015 | paper | |
Playing Atari with Deep Reinforcement Learning | 19 dec 2013 | arxiv |
Style Transfer
Title | Date | Paper | Code |
---|---|---|---|
The Contextual Loss for Image Transformation with Non-Aligned Data | 18 jul 2018 | arxiv | github |
Deep Photo Style Transfer | 22 mar 2017 | arxiv | |
Arbitrary Style Transfer in Real-time with Adaptive Instance Normalization | 20 mar 2017 | arxiv | github |
A Learned Representation For Artistic Style | 24 okt 2016 | arxiv | |
Instance Normalization: The Missing Ingredient for Fast Stylization | 27 jul 2016 | arxiv | |
Perceptual Losses for Real-Time Style Transfer and Super-Resolution | 27 mar 2016 | arxiv | github |
A Neural Algorithm of Artistic Style | 26 aug 2015 | arxiv | github |
Object Tracking
Title | Date | Paper | Code |
---|---|---|---|
End-to-end representation learning for Correlation Filter based tracking | 20 apr 2017 | arxiv | github |
Visual Question Answering
Title | Date | Paper | Code |
---|---|---|---|
VQA: Visual Question Answering | 3 may 2015 | arxiv |
Image Segmentation
Title | Date | Paper | Code |
---|---|---|---|
PointRend: Image Segmentation as Rendering | 17 dec 2019 | arxiv | |
Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation | 22 aug 2018 | paper cvpr | github |
Dilated Residual Networks | 22 jul 2017 | paper | |
SfM-Net: Learning of Structure and Motion from Video | 25 apr 2017 | arxiv | |
Semi and Weakly Supervised Semantic Segmentation Using Generative Adversarial Network | 28 mar 2017 | arxiv | |
Mask R-CNN | 20 mar 2017 | arxiv | |
Learning Features by Watching Objects Move | 19 dec 2016 | arxiv | |
RefineNet: Multi-Path Refinement Networks for High-Resolution Semantic Segmentation | 20 nov 2016 | arxiv | github |
UberNet: Training a `Universal' Convolutional Neural Network for Low-, Mid-, and High-Level Vision using Diverse Datasets and Limited Memory | 7 sep 2016 | arxiv | |
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs | 2 jun 2016 | arxiv | |
Fully Convolutional Networks for Semantic Segmentation | 20 may 2016 | arxiv | github |
Instance-aware Semantic Segmentation via Multi-task Network Cascades | 14 dec 2015 | arxiv | |
Multi-Scale Context Aggregation by Dilated Convolutions | 23 nov 2015 | arxiv | |
SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation | 2 nov 2015 | arxiv | |
U-Net: Convolutional Networks for Biomedical Image Segmentation | 18 may 2015 | arxiv | |
Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs | 22 dec 2014 | arxiv | |
Learning Rich Features from RGB-D Images for Object Detection and Segmentation | 22 jul 2014 | arxiv |
Text (in the Wild) Recognition
Title | Date | Paper | Code |
---|---|---|---|
OCR Error Correction Using Character Correction and Feature-Based Word Classification | 21 apr 2016 | arxiv | |
Recursive Recurrent Nets with Attention Modeling for OCR in the Wild | 9 mar 2016 | arxiv | |
COCO-Text: Dataset and Benchmark for Text Detection and Recognition in Natural Images | 26 jan 2016 | arxiv | |
Efficient Scene Text Localization and Recognition with Local Character Refinement | 14 apr 2015 | arxiv | |
Reading Text in the Wild with Convolutional Neural Networks | 4 dec 2014 | arxiv | |
Synthetic Data and Artificial Neural Networks for Natural Scene Text Recognition | 9 jun 2014 | arxiv |
Brain Computer Interfacing
Title | Date | Paper | Code |
---|---|---|---|
Deep learning with convolutional neural networks for brain mapping and decoding of movement-related information from the human EEG | 15 mar 2017 | arxiv | |
Encoding Voxels with Deep Learning | 2 dec 2015 | paper | |
Deep Neural Networks Reveal a Gradient in the Complexity of Neural Representations across the Ventral Stream | 8 jul 2015 | paper |
Self-Driving Cars
Title | Date | Paper | Code |
---|---|---|---|
Computer Vision for Autonomous Vehicles: Problems, Datasets and State-of-the-Art | 18 apr 2017 | arxiv | |
End to End Learning for Self-Driving Cars | 25 apr 2016 | arxiv |
Object Recognition
Title | Date | Paper | Code |
---|---|---|---|
Cascade R-CNN: High Quality Object Detection and Instance Segmentation | 24 Jun 2019 | arxiv | github |
YOLOv3: An Incremental Improvement | 8 Apr 2018 | arxiv | github, github reimplementation |
Focal Loss for Dense Object Detection | 7 aug 2017 | arxiv | |
Introspective Classifier Learning: Empower Generatively | 25 apr 2017 | arxiv | |
Learning Chained Deep Features and Classifiers for Cascade in Object Detection | 23 feb 2017 | arxiv | |
DSSD : Deconvolutional Single Shot Detector | 23 jan 2017 | arxiv | |
YOLO9000: Better, Faster, Stronger | 25 dec 2016 | arxiv | github |
Feature Pyramid Networks for Object Detection | 9 dec 2016 | arxiv | |
Speed/accuracy trade-offs for modern convolutional object detectors | 30 nov 2016 | arxiv | |
Aggregated Residual Transformations for Deep Neural Networks | 16 nov 2016 | arxiv | |
Aggregated Residual Transformations for Deep Neural Networks | 16 nov 2016 | arxiv | |
Hierarchical Object Detection with Deep Reinforcement Learning | 11 nov 2016 | arxiv | |
Xception: Deep Learning with Depthwise Separable Convolutions | 7 okt 2016 | arxiv | |
Learning to Make Better Mistakes: Semantics-aware Visual Food Recognition | 1 okt 2016 | paper | |
Densely Connected Convolutional Networks | 25 aug 2016 | arxiv | |
Residual Networks of Residual Networks: Multilevel Residual Networks | 9 aug 2016 | arxiv | |
Context Matters: Refining Object Detection in Video with Recurrent Neural Networks | 15 jul 2016 | arxiv | |
R-FCN: Object Detection via Region-based Fully Convolutional Networks | 20 may 2016 | arxiv | |
Training Region-based Object Detectors with Online Hard Example Mining | 12 apr 2016 | arxiv | |
T-CNN: Tubelets with Convolutional Neural Networks for Object Detection from Videos | 9 apr 2016 | arxiv | |
Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning | 23 feb 2016 | arxiv | |
Deep Residual Learning for Image Recognition | 10 dec 2015 | arxiv | |
SSD: Single Shot MultiBox Detector | 8 dec 2015 | arxiv | |
Fast and Accurate Deep Network Learning by Exponential Linear Units (ELUs) | 23 nov 2015 | arxiv | |
ParseNet: Looking Wider to See Better | 15 jun 2015 | arxiv | |
You Only Look Once: Unified, Real-Time Object Detection | 8 jun 2015 | arxiv | |
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks | 4 jun 2015 | arxiv | |
Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification | 6 feb 2015 | arxiv | |
Deep Image: Scaling up Image Recognition | 13 jan 2015 | arxiv | |
Rich feature hierarchies for accurate object detection and semantic segmentation | 11 nov 2013 | arxiv | |
Selective Search for Object Recognition | 11 mar 2013 | paper | |
ImageNet Classification with Deep Convolutional Neural Networks | 3 dec 2012 | paper |
Logo Recognition
Title | Date | Paper | Code |
---|---|---|---|
Deep Learning Logo Detection with Data Expansion by Synthesising Context | 29 dec 2016 | arxiv | |
Automatic Graphic Logo Detection via Fast Region-based Convolutional Networks | 20 apr 2016 | arxiv | |
LOGO-Net: Large-scale Deep Logo Detection and Brand Recognition with Deep Region-based Convolutional Networks | 8 nov 2015 | arxiv | |
DeepLogo: Hitting Logo Recognition with the Deep Neural Network Hammer | 7 okt 2015 | arxiv |
Super Resolution
Title | Date | Paper | Code |
---|---|---|---|
Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network | 16 sep 2016 | arxiv | |
Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network | 15 sep 2016 | arxiv | |
RAISR: Rapid and Accurate Image Super Resolution | 3 jun 2016 | arxiv | |
Perceptual Losses for Real-Time Style Transfer and Super-Resolution | 27 mar 2016 | arxiv | github |
Image Super-Resolution Using Deep Convolutional Networks | 31 dec 2014 | arxiv |
Pose Estimation
Title | Date | Paper | Code |
---|---|---|---|
Forecasting Human Dynamics from Static Images | 11 apr 2017 | arxiv | |
Fast Single Shot Detection and Pose Estimation | 19 sep 2016 | arxiv | |
Convolutional Pose Machines | 30 jan 2016 | arxiv | |
Flowing ConvNets for Human Pose Estimation in Videos | 9 jun 2015 | arxiv |
Image Captioning
Title | Date | Paper | Code |
---|---|---|---|
Actor-Critic Sequence Training for Image Captioning | 29 jun 2017 | arxiv | |
Detecting and Recognizing Human-Object Interactions | 24 apr 2017 | arxiv | |
Deep Reinforcement Learning-based Image Captioning with Embedding Reward | 12 apr 2017 | arxiv | |
Towards Diverse and Natural Image Descriptions via a Conditional GAN | 17 mar 2017 | arxiv | |
Temporal Tessellation: A Unified Approach for Video Analysis | 21 dec 2016 | arxiv | github |
Self-critical Sequence Training for Image Captioning | 2 dec 2016 | arxiv | |
Generation and Comprehension of Unambiguous Object Descriptions | 7 nov 2015 | arxiv | |
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention | 10 feb 2015 | arxiv | |
Long-term Recurrent Convolutional Networks for Visual Recognition and Description | 17 nov 2014 | arxiv |
Image Compression
Title | Date | Paper | Code |
---|---|---|---|
Full Resolution Image Compression with Recurrent Neural Networks | 18 aug 2016 | arxiv |
Image Synthesis
Title | Date | Paper | Code |
---|---|---|---|
Scene Text Synthesis for Efficient and Effective Deep Network Training | 26 jan 2019 | arxiv | |
A Neural Representation of Sketch Drawings | 11 apr 2017 | arxiv | |
BEGAN: Boundary Equilibrium Generative Adversarial Networks | 31 mar 2017 | arxiv | github |
Improved Training of Wasserstein GANs | 31 mar 2017 | arxiv | |
Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks | 30 mar 2017 | arxiv | github |
Wasserstein GAN | 26 jan 2017 | arxiv | |
RenderGAN: Generating Realistic Labeled Data | 4 nov 2016 | arxiv | |
Conditional Image Generation with PixelCNN Decoders | 16 jun 2016 | arxiv | |
Pixel Recurrent Neural Networks | 25 jan 2016 | arxiv | |
Generative Adversarial Networks | 10 jun 2014 | arxiv |
Face Recognition
Title | Date | Paper | Code |
---|---|---|---|
Accessorize to a Crime: Real and Stealthy Attacks on State-of-the-Art Face Recognition | 24 okt 2016 | paper | |
OpenFace: A general-purpose face recognition library with mobile applications | 1 jun 2016 | paper | |
Emotion Recognition in the Wild via Convolutional Neural Networks and Mapped Binary Patterns | 9 nov 2015 | paper | |
Deep Face Recognition | 7 sep 2015 | paper | |
Compact Convolutional Neural Network Cascade for Face Detection | 6 aug 2015 | arxiv | |
Learning Robust Deep Face Representation | 17 jul 2015 | arxiv | |
Facenet: A unified embedding for face recognition and clustering | 12 jun 2015 | paper | |
Multi-view Face Detection Using Deep Convolutional Neural Networks | 10 feb 2015 | arxiv |
Image Composition
Title | Date | Paper | Code |
---|---|---|---|
Auto-Retoucher(ART) â A Framework for Background Replacement and Foreground Adjustment | 13 jan 2019 | arxiv (brave new task) | github (not able to reproduce results based on code) |
Spatial Fusion GAN for Image Synthesis | 14 Dec 2018 | arxiv (needs revision, interesting approach however) | github (currently, no code available) |
Compositional GAN: Learning Conditional Image Composition | 23 Aug 2018 | arxiv (with respect to spatial orientation) | github (currently, no code available) |
ST-GAN | 5 mar 2018 | arxiv (with respect to spatial orientation) | github |
Deep Painterly Harmonization | 26 Jun 2018 | paper | github |
Deep Image Harmonization | 28 feb 2017 | paper | github (only code for inference) |
Understanding and Improving the Realism of Image Composites | 1 Jul 2012 | paper |
Scene Graph Parsing
Title | Date | Paper | Code |
---|---|---|---|
Neural Motifs: Scene Graph Parsing with Global Context | 29 Mar 2018 | arxiv | github |
Video Deblurring
Title | Date | Paper | Code |
---|---|---|---|
Spatio-Temporal Filter Adaptive Network for Video Deblurring | 28 Apr 2019 | arxiv | github (to appear) |
Depth Perception
Title | Date | Paper | Code |
---|---|---|---|
Learning Depth with Convolutional Spatial Propagation Network | 13 Okt 2018 | arxiv | github |
Stereo Matching by Training a Convolutional Neural Network to Compare Image Patches | 18 May 2016 | arxiv | github |
3D Reconstruction
Title | Date | Paper | Code |
---|---|---|---|
Cerberus: A Multi-headed Derenderer | 28 May 2019 | arxiv |
Vision Representation
Title | Date | Paper | Code |
---|---|---|---|
VisualBERT: A Simple and Performant Baseline for Vision and Language | 9 aug 2019 | arxiv | |
Expected to appear: some paper learning an unsupervised vision representation that beats SOTA on a large number of tasks | <Before the end of 2019> |
Audio
Audio Synthesis
Title | Date | Paper | Code |
---|---|---|---|
Deep Cross-Modal Audio-Visual Generation | 26 apr 2017 | arxiv | |
A Neural Parametric Singing Synthesizer | 12 apr 2017 | arxiv | |
Neural Audio Synthesis of Musical Notes with WaveNet Autoencoders | 5 apr 2017 | arxiv | github |
Tacotron: Towards End-to-End Speech Synthesis | 29 mar 2017 | arxiv | github |
Deep Voice: Real-time Neural Text-to-Speech | 25 feb 2017 | arxiv | |
WaveNet: A Generative Model for Raw Audio | 12 sep 2016 | arxiv | github |
Other
Unclassified
Title | Date | Paper | Code |
---|---|---|---|
A simple neural network module for relational reasoning | 5 jun 2017 | arxiv | |
Deep Complex Networks | 27 may 2017 | arxiv | github |
Learning to Fly by Crashing | 19 apr 2017 | arxiv | |
Who Said What: Modeling Individual Labelers Improves Classification | 26 mar 2017 | arxiv | |
Semi-supervised Knowledge Transfer for Deep Learning from Private Training Data | 18 okt 2016 | arxiv | |
DeepMath - Deep Sequence Models for Premise Selection | 14 jun 2016 | arxiv | |
Unsupervised CNN for Single View Depth Estimation: Geometry to the Rescue | 16 mar 2016 | arxiv | |
Long Short-Term Memory | 15 nov 1997 | paper |
Regularization
Title | Date | Paper | Code |
---|---|---|---|
Self-Normalizing Neural Networks | 8 jun 2017 | arxiv | |
Concrete Dropout | 22 may 2017 | arxiv | github |
Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning | 6 jun 2015 | arxiv | |
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift | 11 feb 2015 | arxiv |
Neural Network Compression
Title | Date | Paper | Code |
---|---|---|---|
Design of Efficient Convolutional Layers using Single Intra-channel Convolution, Topological Subdivisioning and Spatial "Bottleneck" Structure | 15 aug 2016 | arxiv | |
SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <0.5MB model size | 24 feb 2016 | arxiv | |
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding | 1 okt 2015 | arxiv |
Optimizers
Title | Date | Paper | Code |
---|---|---|---|
Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour | 8 jun 2017 | arxiv | |
Equilibrated adaptive learning rates for non-convex optimization | 15 feb 2015 | arxiv | |
Adam: A Method for Stochastic Optimization | 22 dec 2014 | arxiv | |
Deep learning with Elastic Averaging SGD | 20 dec 2014 | arxiv | |
ADADELTA: An Adaptive Learning Rate Method | 22 dec 2012 | arxiv | |
Advances in Optimizing Recurrent Networks | 4 dec 2012 | arxiv | |
Efficient Backprop | 1 jul 1998 | paper |
A note on arXiv
arXiv provides the world with access to the newest scientific developments.
Open Access has a myriad of benefits, in particular, it allows science to be more efficient.
Remember to think about the quality of the papers referenced.
In particular, the importance of the peer-review process for science.
If you find an article on arXiv you should check if it has been peer-reviewed and published elsewhere.
The authoritative version of the paper is not the version on arXiv, rather it is the published peer-reviewed version.
The two versions may differ significantly.
For example, this is the case with one of the papers that I once discussed in the Text and Multimedia Mining class at Radboud:
- peer-reviewed version
- arXiv version Compare for yourself.
For the selection of the papers above, I choose open access over completeness. If you find another (open) version of a paper, you are invited to make a pull request.
Top Related Projects
Deep Learning papers reading roadmap for anyone who are eager to learn this amazing tech!
The most cited deep learning papers
A curated list of awesome Deep Learning tutorials, projects and communities.
A curated list of deep learning resources for computer vision
TensorFlow - A curated list of dedicated resources http://tensorflow.org
Convert designs to code with AI
Introducing Visual Copilot: A new AI model to turn Figma designs to high quality code using your components.
Try Visual Copilot