state-of-the-art-result-for-machine-learning-problems

This repository provides state of the art (SoTA) results for all machine learning problems. We do our best to keep this repository up to date. If you do find a problem's SoTA result is out of date or missing, please raise this as an issue or submit Google form (with this information: research paper name, dataset, metric, source code and year). We will fix it immediately.

8,929

1,312

8,929

View on GitHub

Top Related Projects

faiss

36,332

A library for efficient similarity search and clustering of dense vectors.

transformers

146,142

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

models

77,618

Models and examples built with TensorFlow

pytorch

91,080

Tensors and Dynamic neural networks in Python with strong GPU acceleration

DeepSpeed

39,112

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

bert

39,267

TensorFlow code and pre-trained models for BERT

Quick Overview

The RedditSota/state-of-the-art-result-for-machine-learning-problems repository is a curated collection of state-of-the-art (SOTA) results for various machine learning tasks. It serves as a comprehensive reference for researchers and practitioners, providing links to papers, code implementations, and datasets for different problem domains in machine learning.

Pros

Comprehensive coverage of various machine learning tasks and domains
Regularly updated with the latest SOTA results
Includes links to papers, code implementations, and datasets
Serves as a valuable resource for researchers and practitioners

Cons

Relies on community contributions, which may lead to inconsistent updates
Some links may become outdated over time
May not cover all niche or emerging areas of machine learning
Lacks detailed explanations or comparisons of different SOTA approaches

Code Examples

This repository is not a code library but rather a collection of links and information. Therefore, there are no code examples to provide.

Getting Started

As this is not a code library, there are no specific getting started instructions. However, users can navigate the repository by browsing the different sections organized by machine learning tasks and domains. To contribute or suggest updates, users can follow the repository's contribution guidelines and submit pull requests with new SOTA results or corrections to existing information.

Competitor Comparisons

faiss

36,332

A library for efficient similarity search and clustering of dense vectors.

Pros of faiss

Highly optimized C++ library for efficient similarity search and clustering of dense vectors
Supports GPU acceleration for faster processing of large-scale datasets
Provides a Python interface for easy integration with machine learning workflows

Cons of faiss

Focused solely on similarity search and clustering, not a comprehensive ML resource
Requires more technical expertise to implement and use effectively
Limited to dense vector representations, may not be suitable for all ML problems

Code comparison

faiss:

import faiss
import numpy as np

d = 64  # dimension
nb = 100000  # database size
nq = 10000  # nb of queries
xb = np.random.random((nb, d)).astype('float32')
xq = np.random.random((nq, d)).astype('float32')

index = faiss.IndexFlatL2(d)
index.add(xb)
D, I = index.search(xq, k=4)

state-of-the-art-result-for-machine-learning-problems:

# Image Classification

## CIFAR-10

* [96.53%] [Wide Residual Networks](http://arxiv.org/abs/1605.07146) by Sergey Zagoruyko, Nikos Komodakis

## CIFAR-100

* [82.95%] [Wide Residual Networks](http://arxiv.org/abs/1605.07146) by Sergey Zagoruyko, Nikos Komodakis

Summary

faiss is a specialized library for similarity search and clustering, while state-of-the-art-result-for-machine-learning-problems is a curated list of top-performing ML models across various tasks. faiss offers high-performance implementations but requires more technical expertise, whereas state-of-the-art-result-for-machine-learning-problems provides a comprehensive overview of ML advancements but lacks implementation details.

transformers

146,142

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Pros of transformers

Provides a comprehensive library of pre-trained models and tools for natural language processing tasks
Offers easy-to-use APIs for fine-tuning and deploying models
Actively maintained with frequent updates and community support

Cons of transformers

Focuses primarily on transformer-based models, limiting coverage of other machine learning approaches
Requires more computational resources due to the complexity of transformer models
May have a steeper learning curve for beginners compared to a curated list of results

Code comparison

transformers:

from transformers import AutoModelForSequenceClassification, AutoTokenizer

model = AutoModelForSequenceClassification.from_pretrained("bert-base-uncased")
tokenizer = AutoTokenizer.from_pretrained("bert-base-uncased")

state-of-the-art-result-for-machine-learning-problems:

## Image Classification

* [ImageNet](http://www.image-net.org/)
  * [ConvNeXt: A ConvNet for the 2020s](https://arxiv.org/abs/2201.03545) (2022)

The transformers repository provides actual code implementations, while state-of-the-art-result-for-machine-learning-problems offers a curated list of research papers and their results for various machine learning tasks.

models

77,618

Models and examples built with TensorFlow

Pros of models

Comprehensive collection of official TensorFlow implementations
Well-maintained with regular updates and contributions from Google researchers
Includes pre-trained models and detailed documentation for each implementation

Cons of models

Focused solely on TensorFlow, limiting its scope for other frameworks
May not always include the absolute latest state-of-the-art results
Requires more setup and understanding of TensorFlow to use effectively

Code Comparison

models:

import tensorflow as tf
from official.vision.image_classification import resnet_model
model = resnet_model.resnet50(num_classes=1000)

state-of-the-art-result-for-machine-learning-problems:

## Image Classification

* [ImageNet Classification](https://github.com/RedditSota/state-of-the-art-result-for-machine-learning-problems#imagenet-classification)
* [CIFAR-10 Classification](https://github.com/RedditSota/state-of-the-art-result-for-machine-learning-problems#cifar-10-classification)

Summary

models provides official TensorFlow implementations with detailed documentation and pre-trained models, making it ideal for TensorFlow users. state-of-the-art-result-for-machine-learning-problems offers a broader overview of state-of-the-art results across various frameworks and problems, serving as a comprehensive reference for researchers and practitioners. The choice between the two depends on whether you need specific TensorFlow implementations or a general overview of current best practices in machine learning.

pytorch

91,080

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Pros of pytorch

Comprehensive deep learning framework with extensive functionality
Large, active community providing support and contributions
Seamless integration with CUDA for GPU acceleration

Cons of pytorch

Steeper learning curve for beginners
Larger codebase and installation size
May be overkill for simple machine learning tasks

Code comparison

pytorch:

import torch

x = torch.tensor([1, 2, 3])
y = torch.tensor([4, 5, 6])
z = torch.matmul(x, y)

state-of-the-art-result-for-machine-learning-problems:

# Image Classification

## CIFAR-10

* [96.53%] [Wide Residual Networks](http://arxiv.org/abs/1605.07146) (2016)
* [96.43%] [Fractional Max-Pooling](http://arxiv.org/abs/1412.6071) (2015)

The state-of-the-art-result-for-machine-learning-problems repository is a curated list of top-performing models for various machine learning tasks, while pytorch is a full-fledged deep learning framework. The former provides a quick reference for researchers and practitioners to stay updated on the latest achievements, while the latter offers tools and libraries for implementing and training models. The code examples highlight this difference, with pytorch showing actual model implementation and state-of-the-art-result-for-machine-learning-problems presenting performance metrics in a markdown format.

DeepSpeed

39,112

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Pros of DeepSpeed

Actively maintained and regularly updated
Provides a complete optimization library for deep learning
Offers extensive documentation and tutorials

Cons of DeepSpeed

Focused solely on deep learning optimization, not a general ML resource
Steeper learning curve for beginners
Requires integration with existing ML frameworks

Code Comparison

DeepSpeed:

import deepspeed
model_engine, optimizer, _, _ = deepspeed.initialize(args=args,
                                                     model=model,
                                                     model_parameters=params)

state-of-the-art-result-for-machine-learning-problems:

No direct code comparison available as this repository is primarily a curated list of ML results and papers.

Summary

DeepSpeed is a comprehensive deep learning optimization library, offering active development and extensive documentation. It's ideal for users looking to improve the performance and scale of their deep learning models. However, it has a steeper learning curve and is specific to deep learning.

state-of-the-art-result-for-machine-learning-problems serves as a curated resource for tracking the latest achievements in various machine learning tasks. It's more accessible for beginners and covers a broader range of ML topics, but doesn't provide direct implementation tools or optimization techniques.

The choice between these repositories depends on whether you need a practical optimization tool (DeepSpeed) or a reference for state-of-the-art ML results (state-of-the-art-result-for-machine-learning-problems).

bert

39,267

TensorFlow code and pre-trained models for BERT

Pros of BERT

Focused on a specific, powerful NLP model with pre-trained weights
Includes detailed implementation and usage examples
Actively maintained by Google Research team

Cons of BERT

Limited to BERT model and its variants
Requires more computational resources to run and fine-tune
Steeper learning curve for beginners in NLP

Code Comparison

BERT example:

import tensorflow as tf
from bert import modeling

bert_config = modeling.BertConfig.from_json_file("bert_config.json")
model = modeling.BertModel(config=bert_config, is_training=True, input_ids=input_ids)

state-of-the-art-result-for-machine-learning-problems doesn't provide code examples, as it's primarily a curated list of SOTA results and papers.

Summary

BERT is a specialized repository focusing on a specific NLP model, offering implementation details and pre-trained weights. It's well-maintained but requires more resources and expertise to use effectively.

state-of-the-art-result-for-machine-learning-problems is a comprehensive list of SOTA results across various ML domains, providing a broader overview of the field but lacking specific implementations or code examples.

Choose BERT for deep dives into transformer-based NLP models, and state-of-the-art-result-for-machine-learning-problems for staying updated on the latest achievements across multiple ML areas.

Convert designs to code with AI

Introducing Visual Copilot: A new AI model to turn Figma designs to high quality code using your components.

Try Visual Copilot

README

State-of-the-art result for all Machine Learning Problems

LAST UPDATE: 20th Februray 2019

NEWS: I am looking for a Collaborator esp who does research in NLP, Computer Vision and Reinforcement learning. If you are not a researcher, but you are willing, contact me. Email me: yxt.stoaml@gmail.com

This repository provides state-of-the-art (SoTA) results for all machine learning problems. We do our best to keep this repository up to date. If you do find a problem's SoTA result is out of date or missing, please raise this as an issue (with this information: research paper name, dataset, metric, source code and year). We will fix it immediately.

You can also submit this Google Form if you are new to Github.

This is an attempt to make one stop for all types of machine learning problems state of the art result. I can not do this alone. I need help from everyone. Please submit the Google form/raise an issue if you find SOTA result for a dataset. Please share this on Twitter, Facebook, and other social media.

This summary is categorized into:

Supervised Learning
Semi-supervised Learning
- Computer Vision
Unsupervised Learning
- Speech
- Computer Vision
- NLP
Transfer Learning
Reinforcement Learning

Supervised Learning

NLP

1. Language Modelling

Research Paper	Datasets	Metric	Source Code	Year
Language Models are Unsupervised Multitask Learners	PTB WikiText-2	Perplexity: 35.76 Perplexity: 18.34	Tensorflow	2019
BREAKING THE SOFTMAX BOTTLENECK: A HIGH-RANK RNN LANGUAGE MODEL	PTB WikiText-2	Perplexity: 47.69 Perplexity: 40.68	Pytorch	2017
DYNAMIC EVALUATION OF NEURAL SEQUENCE MODELS	PTB WikiText-2	Perplexity: 51.1 Perplexity: 44.3	Pytorch	2017
Averaged Stochastic Gradient Descent with Weight Dropped LSTM or QRNN	PTB WikiText-2	Perplexity: 52.8 Perplexity: 52.0	Pytorch	2017
FRATERNAL DROPOUT	PTB WikiText-2	Perplexity: 56.8 Perplexity: 64.1	Pytorch	2017
Factorization tricks for LSTM networks	One Billion Word Benchmark	Perplexity: 23.36	Tensorflow	2017

2. Machine Translation

Research Paper	Datasets	Metric	Source Code	Year
Understanding Back-Translation at Scale	WMT 2014 English-to-French WMT 2014 English-to-German	BLEU: 45.6 BLEU: 35.0	PyTorch	2018
WEIGHTED TRANSFORMER NETWORK FOR MACHINE TRANSLATION	WMT 2014 English-to-French WMT 2014 English-to-German	BLEU: 41.4 BLEU: 28.9	NOT FOUND	2017
Attention Is All You Need	WMT 2014 English-to-French WMT 2014 English-to-German	BLEU: 41.0 BLEU: 28.4	PyTorch Tensorflow	2017
NON-AUTOREGRESSIVE NEURAL MACHINE TRANSLATION	WMT16 RoâEn	BLEU: 31.44	PyTorch	2017
Improving Neural Machine Translation with Conditional Sequence Generative Adversarial Nets	NIST02 NIST03 NIST04 NIST05	38.74 36.01 37.54 33.76	NMTPY	2017

3. Text Classification

Research Paper	Datasets	Metric	Source Code	Year
Learning Structured Text Representations	Yelp	Accuracy: 68.6	Tensorflow	2017
Attentive Convolution	Yelp	Accuracy: 67.36	Theano	2017

4. Natural Language Inference

Leader board:

Stanford Natural Language Inference (SNLI)

MultiNLI

Research Paper	Datasets	Metric	Source Code	Year
NATURAL LANGUAGE INFERENCE OVER INTERACTION SPACE	Stanford Natural Language Inference (SNLI)	Accuracy: 88.9	Tensorflow	2017
BERT-LARGE (ensemble)	Multi-Genre Natural Language Inference (MNLI)	Matched accuracy: 86.7 Mismatched accuracy: 85.9	Tensorflow PyTorch	2018

5. Question Answering

Leader Board

SQuAD

Research Paper	Datasets	Metric	Source Code	Year
BERT-LARGE (ensemble)	The Stanford Question Answering Dataset	Exact Match: 87.4 F1: 93.2	Tensorflow PyTorch	2018

6. Named entity recognition

Research Paper	Datasets	Metric	Source Code	Year
Named Entity Recognition in Twitter using Images and Text	Ritter	F-measure: 0.59	NOT FOUND	2017

7. Abstractive Summarization

Research Paper	Datasets	Metric	Source Code	Year
Cutting-off redundant repeating generations for neural abstractive summarization	DUC-2004 Gigaword	DUC-2004 ROUGE-1: 32.28 ROUGE-2: 10.54 ROUGE-L: 27.80 Gigaword ROUGE-1: 36.30 ROUGE-2: 17.31 ROUGE-L: 33.88	NOT YET AVAILABLE	2017
Convolutional Sequence to Sequence	DUC-2004 Gigaword	DUC-2004 ROUGE-1: 33.44 ROUGE-2: 10.84 ROUGE-L: 26.90 Gigaword ROUGE-1: 35.88 ROUGE-2: 27.48 ROUGE-L: 33.29	PyTorch	2017

8. Dependency Parsing

Research Paper	Datasets	Metric	Source Code	Year
Globally Normalized Transition-Based Neural Networks	Final CoNLL â09 dependency parsing	94.08% UAS accurancy 92.15% LAS accurancy	SyntaxNet	2017

Computer Vision

1. Classification

Â Â Â Â Â Â

Research Paper	Datasets	Metric	Source Code	Year
Dynamic Routing Between Capsules	MNIST	Test Error: 0.25Â±0.005	Official Implementation PyTorch Tensorflow Keras Chainer List of all implementations	2017
High-Performance Neural Networks for Visual Object Classification	NORB	Test Error: 2.53 Â± 0.40	NOT FOUND	2011
Giant AmoebaNet with GPipe	CIFAR-10 CIFAR-100 ImageNet-1k ...	Test Error: 1.0% Test Error: 8.7% Top-1 Error 15.7 ...	NOT FOUND	2018
ShakeDrop regularization	CIFAR-10 CIFAR-100	Test Error: 2.31% Test Error: 12.19%	NOT FOUND	2017
Aggregated Residual Transformations for Deep Neural Networks	CIFAR-10	Test Error: 3.58%	PyTorch	2017
Random Erasing Data Augmentation	CIFAR-10 CIFAR-100 Fashion-MNIST	Test Error: 3.08% Test Error: 17.73% Test Error: 3.65%	Pytorch	2017
EraseReLU: A Simple Way to Ease the Training of Deep Convolution Neural Networks	CIFAR-10 CIFAR-100	Test Error: 3.56% Test Error: 16.53%	Pytorch	2017
Dynamic Routing Between Capsules	MultiMNIST	Test Error: 5%	PyTorch Tensorflow Keras Chainer List of all implementations	2017
Learning Transferable Architectures for Scalable Image Recognition	ImageNet-1k	Top-1 Error:17.3	Tensorflow	2017
Squeeze-and-Excitation Networks	ImageNet-1k	Top-1 Error: 18.68	CAFFE	2017
Aggregated Residual Transformations for Deep Neural Networks	ImageNet-1k	Top-1 Error: 20.4%	Torch	2016

2. Instance Segmentation

Research Paper	Datasets	Metric	Source Code	Year
Mask R-CNN	COCO	Average Precision: 37.1%	Detectron (Official Version) MXNet Keras TensorFlow	2017

3. Visual Question Answering

Research Paper	Datasets	Metric	Source Code	Year
Tips and Tricks for Visual Question Answering: Learnings from the 2017 Challenge	VQA	Overall score: 69	NOT FOUND	2017

4. Person Re-identification

Â Â Â

Research Paper	Datasets	Metric	Source Code	Year
Random Erasing Data Augmentation	Market-1501 CUHK03-new-protocol DukeMTMC-reID	Rank-1: 89.13 mAP:Â 83.93 Rank-1: 84.02 mAP:Â 78.28 labeled (Rank-1: 63.93 mAP:Â 65.05) detected (Rank-1: 64.43 mAP:Â 64.75)	Pytorch	2017

Speech

Speech SOTA

1. ASR

Research Paper	Datasets	Metric	Source Code	Year
The Microsoft 2017 Conversational Speech Recognition System	Switchboard Hub5'00	WER: 5.1	NOT FOUND	2017
The CAPIO 2017 Conversational Speech Recognition System	Switchboard Hub5'00	WER: 5.0	NOT FOUND	2017

Semi-supervised Learning

Computer Vision

Â Â Â

Research Paper	Datasets	Metric	Source Code	Year
DISTRIBUTIONAL SMOOTHINGWITH VIRTUAL ADVERSARIAL TRAINING	SVHN NORB	Test error: 24.63 Test error: 9.88	Theano	2016
Virtual Adversarial Training: a Regularization Method for Supervised and Semi-supervised Learning	MNIST	Test error: 1.27	NOT FOUND	2017
Few Shot Object Detection	VOC2007 VOC2012	mAP : 41.7 mAP : 35.4	NOT FOUND	2017
Unlabeled Samples Generated by GAN Improve the Person Re-identification Baseline in vitro	Market-1501 CUHK-03 DukeMTMC-reID CUB-200-2011	Rank-1: 83.97 mAP:Â 66.07 Rank-1: 84.6 mAP:Â 87.4 Rank-1: 67.68 mAP:Â 47.13 Â Â Â Â Â Test Accuracy: 84.4	Matconvnet	2017

Unsupervised Learning

Computer Vision

1. Generative Model

Research Paper	Datasets	Metric	Source Code	Year
PROGRESSIVE GROWING OF GANS FOR IMPROVED QUALITY, STABILITY, AND VARIATION	Unsupervised CIFAR 10	Inception score: 8.80	Theano	2017

NLP

Machine Translation

Research Paper	Datasets	Metric	Source Code	Year
UNSUPERVISED MACHINE TRANSLATION USING MONOLINGUAL CORPORA ONLY	Multi30k-Task1(en-fr fr-en de-en en-de)	BLEU:(32.76 32.07 26.26 22.74)	NOT FOUND	2017
Unsupervised Neural Machine Translation with Weight Sharing	WMT14(en-fr fr-en) WMT16 (de-en en-de)	BLEU:(16.97 15.58) BLEU:(14.62 10.86)	NOT FOUND	2018

Transfer Learning

Research Paper	Datasets	Metric	Source Code	Year
One Model To Learn Them All	WMT EN â DE WMT EN â FR (BLEU) ImageNet (top-5 accuracy)	BLEU: 21.2 BLEU:30.5 86%	Tensorflow	2017

Reinforcement Learning

Research Paper	Datasets	Metric	Source Code	Year
Mastering the game of Go without human knowledge	the game of Go	ElO Rating: 5185	C++	2017

Email: yxt.stoaml@gmail.com

Top Related Projects

Convert designs to code with AI

Introducing Visual Copilot: A new AI model to turn Figma designs to high quality code using your components.

Try Visual Copilot