awesome-object-detection

Awesome Object Detection based on handong1587 github: https://handong1587.github.io/deep_learning/2015/10/09/object-detection.html

7,492

1,937

7,492

View on GitHub

Top Related Projects

detectron2

33,428

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

models

77,618

Models and examples built with TensorFlow

Mask_RCNN

25,251

Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow

mmdetection

31,487

OpenMMLab Detection Toolbox and Benchmark

yolov5

55,546

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

darknet

22,101

YOLOv4 / Scaled-YOLOv4 / YOLO - Neural Networks for Object Detection (Windows and Linux version of Darknet )

Quick Overview

The amusi/awesome-object-detection repository is a curated list of object detection resources, including papers, datasets, and toolboxes. It serves as a comprehensive guide for researchers and practitioners in the field of computer vision, specifically focusing on object detection techniques and advancements.

Pros

Extensive collection of resources covering various aspects of object detection
Well-organized structure, categorizing content by type (papers, datasets, toolboxes)
Regularly updated with new research and tools
Includes both classic and state-of-the-art methods

Cons

May be overwhelming for beginners due to the large amount of information
Lacks detailed explanations or summaries of individual resources
Some links may become outdated over time
Limited coverage of implementation details or code examples

As this is not a code library but rather a curated list of resources, there are no code examples or getting started instructions to provide.

Competitor Comparisons

detectron2

33,428

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

Pros of Detectron2

Comprehensive implementation of state-of-the-art object detection algorithms
Highly modular and extensible architecture for easy customization
Extensive documentation and active community support

Cons of Detectron2

Steeper learning curve for beginners due to its complexity
Requires more computational resources for training and inference
Limited to PyTorch framework

Code Comparison

Detectron2 (implementation):

import detectron2
from detectron2.engine import DefaultPredictor
from detectron2.config import get_cfg

cfg = get_cfg()
cfg.merge_from_file("path/to/config.yaml")
predictor = DefaultPredictor(cfg)
outputs = predictor(image)

awesome-object-detection (curated list):

## Papers

### 2019
- [Objects as Points](https://arxiv.org/abs/1904.07850) - ECCV 2020

## Datasets

- [COCO](http://cocodataset.org/)
- [Pascal VOC](http://host.robots.ox.ac.uk/pascal/VOC/)

awesome-object-detection serves as a comprehensive resource for object detection research, while Detectron2 provides a practical implementation framework. The former is ideal for staying updated with the latest developments, while the latter is better suited for hands-on experimentation and deployment of object detection models.

models

77,618

Models and examples built with TensorFlow

Pros of models

Comprehensive collection of official TensorFlow models and examples
Actively maintained by Google's TensorFlow team
Includes implementations for various tasks beyond object detection

Cons of models

Larger and more complex repository structure
Steeper learning curve for beginners
Focused solely on TensorFlow implementations

Code comparison

models:

import tensorflow as tf
from object_detection.utils import label_map_util
from object_detection.utils import visualization_utils as viz_utils

model = tf.saved_model.load('path/to/saved_model')
category_index = label_map_util.create_category_index_from_labelmap('path/to/labelmap.pbtxt')

awesome-object-detection:

# No direct code examples provided
# Repository serves as a curated list of resources and papers

Summary

models is a comprehensive repository maintained by Google, offering official TensorFlow implementations for various machine learning tasks, including object detection. It provides a wide range of models and examples but may be more complex for beginners.

awesome-object-detection is a curated list of object detection resources, papers, and implementations across different frameworks. It serves as a valuable reference for researchers and practitioners but doesn't provide direct code implementations.

While models focuses on TensorFlow-specific implementations, awesome-object-detection offers a broader overview of object detection across multiple frameworks and approaches.

Mask_RCNN

25,251

Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow

Pros of Mask_RCNN

Provides a complete implementation of Mask R-CNN for object detection and instance segmentation
Includes pre-trained models and easy-to-use inference code
Offers detailed documentation and examples for training on custom datasets

Cons of Mask_RCNN

Focuses solely on Mask R-CNN, limiting the scope of object detection techniques
May require more computational resources for training and inference
Less frequently updated compared to awesome-object-detection

Code Comparison

Mask_RCNN:

import mrcnn.model as modellib
model = modellib.MaskRCNN(mode="inference", config=config, model_dir=MODEL_DIR)
model.load_weights(COCO_MODEL_PATH, by_name=True)
results = model.detect([image], verbose=1)

awesome-object-detection: No direct code implementation, as it's a curated list of resources.

Summary

Mask_RCNN provides a specific implementation of the Mask R-CNN algorithm, offering pre-trained models and easy-to-use code for object detection and instance segmentation. In contrast, awesome-object-detection is a comprehensive collection of resources, papers, and implementations covering various object detection techniques. While Mask_RCNN is more focused and practical for immediate use, awesome-object-detection offers a broader overview of the field, making it valuable for research and exploration of different approaches.

mmdetection

31,487

OpenMMLab Detection Toolbox and Benchmark

Pros of mmdetection

Comprehensive implementation of various object detection algorithms
Modular design allowing easy customization and extension
Extensive documentation and active community support

Cons of mmdetection

Steeper learning curve due to its complexity
Requires more computational resources for training and inference

Code comparison

mmdetection:

from mmdet.apis import init_detector, inference_detector

config_file = 'configs/faster_rcnn/faster_rcnn_r50_fpn_1x_coco.py'
checkpoint_file = 'checkpoints/faster_rcnn_r50_fpn_1x_coco_20200130-047c8118.pth'
model = init_detector(config_file, checkpoint_file, device='cuda:0')
result = inference_detector(model, 'test.jpg')

awesome-object-detection: (Note: This repository is a curated list, not a code implementation)

No direct code comparison available as awesome-object-detection
is a curated list of resources rather than a code implementation.

Summary

mmdetection is a comprehensive object detection toolbox with extensive implementations and customization options, while awesome-object-detection serves as a curated list of object detection resources. mmdetection offers hands-on implementation but requires more technical expertise, whereas awesome-object-detection provides a broader overview of available techniques and papers in the field.

yolov5

55,546

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

Pros of YOLOv5

Provides a complete, ready-to-use object detection implementation
Offers high performance and real-time detection capabilities
Includes extensive documentation and examples for easy integration

Cons of YOLOv5

Focuses solely on YOLO-based object detection
May have a steeper learning curve for beginners
Limited to a single object detection approach

Code Comparison

YOLOv5:

import torch

# Load YOLOv5 model
model = torch.hub.load('ultralytics/yolov5', 'yolov5s')

# Perform inference
results = model('image.jpg')

awesome-object-detection:

# No direct code implementation, as it's a curated list of resources
# Example of a linked paper:
- [You Only Look Once: Unified, Real-Time Object Detection](https://arxiv.org/abs/1506.02640)

Summary

YOLOv5 is a complete implementation of the YOLO object detection algorithm, offering high performance and real-time capabilities. It provides ready-to-use code and extensive documentation. However, it focuses solely on YOLO-based detection and may be more complex for beginners.

awesome-object-detection is a curated list of object detection resources, papers, and implementations. It covers a wide range of approaches but doesn't provide direct code implementation. It's an excellent starting point for research and exploration of various object detection methods.

darknet

22,101

YOLOv4 / Scaled-YOLOv4 / YOLO - Neural Networks for Object Detection (Windows and Linux version of Darknet )

Pros of darknet

Provides a complete, ready-to-use implementation of YOLO and other object detection algorithms
Offers GPU acceleration for faster training and inference
Includes pre-trained models and supports various input formats

Cons of darknet

Limited to C and CUDA implementations, less accessible for Python users
Focuses primarily on YOLO-based architectures, less diverse than awesome-object-detection
Requires more setup and configuration compared to a curated list of resources

Code comparison

darknet:

#include "darknet.h"

int main(int argc, char **argv)
{
    // ... YOLO initialization and detection code
}

awesome-object-detection:

## Papers

### 2014
- **[R-CNN]** Rich feature hierarchies for accurate object detection and semantic segmentation | Ross Girshick, Jeff Donahue, Trevor Darrell, Jitendra Malik | **[CVPR' 14]** |[`[pdf]`](https://arxiv.org/pdf/1311.2524.pdf) [`[official code - caffe]`](https://github.com/rbgirshick/rcnn) 

// ... more paper listings and resources

Summary

darknet is a complete implementation of object detection algorithms, particularly YOLO, offering ready-to-use code and GPU acceleration. awesome-object-detection, on the other hand, is a comprehensive curated list of object detection resources, papers, and implementations across various frameworks. While darknet provides a more hands-on approach for YOLO-based detection, awesome-object-detection offers a broader overview of the field and diverse implementations.

Convert designs to code with AI

Introducing Visual Copilot: A new AI model to turn Figma designs to high quality code using your components.

Try Visual Copilot

README

object-detection

[TOC]

This is a list of awesome articles about object detection. If you want to read the paper according to time, you can refer to Date.

R-CNN
Fast R-CNN
Faster R-CNN
Mask R-CNN
Light-Head R-CNN
Cascade R-CNN
SPP-Net
YOLO
YOLOv2
YOLOv3
YOLT
SSD
DSSD
FSSD
ESSD
MDSSD
Pelee
Fire SSD
R-FCN
FPN
DSOD
RetinaNet
MegDet
RefineNet
DetNet
SSOD
CornerNet
M2Det
3D Object Detection
ZSDï¼Zero-Shot Object Detectionï¼
OSDï¼One-Shot object Detectionï¼
Weakly Supervised Object Detection
Softer-NMS
2018
2019
Other

Based on handong1587's github: https://handong1587.github.io/deep_learning/2015/10/09/object-detection.html

Survey

Imbalance Problems in Object Detection: A Review

intro: under review at TPAMI
arXiv: https://arxiv.org/abs/1909.00169

Recent Advances in Deep Learning for Object Detection

intro: From 2013 (OverFeat) to 2019 (DetNAS)
arXiv: https://arxiv.org/abs/1908.03673

A Survey of Deep Learning-based Object Detection

introï¼From Fast R-CNN to NAS-FPN
arXivï¼https://arxiv.org/abs/1907.09408

Object Detection in 20 Years: A Survey

introï¼This work has been submitted to the IEEE TPAMI for possible publication
arXivï¼https://arxiv.org/abs/1905.05055

ãRecent Advances in Object Detection in the Age of Deep Convolutional Neural Networksã

intro: awesome
arXiv: https://arxiv.org/abs/1809.03193

ãDeep Learning for Generic Object Detection: A Surveyã

intro: Submitted to IJCV 2018
arXiv: https://arxiv.org/abs/1809.02165

Papers&Codes

R-CNN

Rich feature hierarchies for accurate object detection and semantic segmentation

intro: R-CNN
arxiv: http://arxiv.org/abs/1311.2524
supp: http://people.eecs.berkeley.edu/~rbg/papers/r-cnn-cvpr-supp.pdf
slides: http://www.image-net.org/challenges/LSVRC/2013/slides/r-cnn-ilsvrc2013-workshop.pdf
slides: http://www.cs.berkeley.edu/~rbg/slides/rcnn-cvpr14-slides.pdf
github: https://github.com/rbgirshick/rcnn
notes: http://zhangliliang.com/2014/07/23/paper-note-rcnn/
caffe-pr("Make R-CNN the Caffe detection example"): https://github.com/BVLC/caffe/pull/482

Fast R-CNN

Fast R-CNN

arxiv: http://arxiv.org/abs/1504.08083
slides: http://tutorial.caffe.berkeleyvision.org/caffe-cvpr15-detection.pdf
github: https://github.com/rbgirshick/fast-rcnn
github(COCO-branch): https://github.com/rbgirshick/fast-rcnn/tree/coco
webcam demo: https://github.com/rbgirshick/fast-rcnn/pull/29
notes: http://zhangliliang.com/2015/05/17/paper-note-fast-rcnn/
notes: http://blog.csdn.net/linj_m/article/details/48930179
github("Fast R-CNN in MXNet"): https://github.com/precedenceguo/mx-rcnn
github: https://github.com/mahyarnajibi/fast-rcnn-torch
github: https://github.com/apple2373/chainer-simple-fast-rnn
github: https://github.com/zplizzi/tensorflow-fast-rcnn

A-Fast-RCNN: Hard Positive Generation via Adversary for Object Detection

intro: CVPR 2017
arxiv: https://arxiv.org/abs/1704.03414
paper: http://abhinavsh.info/papers/pdfs/adversarial_object_detection.pdf
github(Caffe): https://github.com/xiaolonw/adversarial-frcnn

Faster R-CNN

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

intro: NIPS 2015
arxiv: http://arxiv.org/abs/1506.01497
gitxiv: http://www.gitxiv.com/posts/8pfpcvefDYn2gSgXk/faster-r-cnn-towards-real-time-object-detection-with-region
slides: http://web.cs.hacettepe.edu.tr/~aykut/classes/spring2016/bil722/slides/w05-FasterR-CNN.pdf
github(official, Matlab): https://github.com/ShaoqingRen/faster_rcnn
github(Caffe): https://github.com/rbgirshick/py-faster-rcnn
github(MXNet): https://github.com/msracver/Deformable-ConvNets/tree/master/faster_rcnn
github(PyTorch--recommend): https://github.com//jwyang/faster-rcnn.pytorch
github: https://github.com/mitmul/chainer-faster-rcnn
github(Torch):: https://github.com/andreaskoepf/faster-rcnn.torch
github(Torch):: https://github.com/ruotianluo/Faster-RCNN-Densecap-torch
github(TensorFlow): https://github.com/smallcorgi/Faster-RCNN_TF
github(TensorFlow): https://github.com/CharlesShang/TFFRCNN
github(C++ demo): https://github.com/YihangLou/FasterRCNN-Encapsulation-Cplusplus
github(Keras): https://github.com/yhenon/keras-frcnn
github: https://github.com/Eniac-Xie/faster-rcnn-resnet
github(C++): https://github.com/D-X-Y/caffe-faster-rcnn/tree/dev

R-CNN minus R

intro: BMVC 2015
arxiv: http://arxiv.org/abs/1506.06981

Faster R-CNN in MXNet with distributed implementation and data parallelization

github: https://github.com/dmlc/mxnet/tree/master/example/rcnn

Contextual Priming and Feedback for Faster R-CNN

intro: ECCV 2016. Carnegie Mellon University
paper: http://abhinavsh.info/context_priming_feedback.pdf
poster: http://www.eccv2016.org/files/posters/P-1A-20.pdf

An Implementation of Faster RCNN with Study for Region Sampling

intro: Technical Report, 3 pages. CMU
arxiv: https://arxiv.org/abs/1702.02138
github: https://github.com/endernewton/tf-faster-rcnn
github: https://github.com/ruotianluo/pytorch-faster-rcnn

Interpretable R-CNN

intro: North Carolina State University & Alibaba
keywords: AND-OR Graph (AOG)
arxiv: https://arxiv.org/abs/1711.05226

Domain Adaptive Faster R-CNN for Object Detection in the Wild

intro: CVPR 2018. ETH Zurich & ESAT/PSI
arxiv: https://arxiv.org/abs/1803.03243

Mask R-CNN

arxiv: http://arxiv.org/abs/1703.06870
github(Keras): https://github.com/matterport/Mask_RCNN
github(Caffe2): https://github.com/facebookresearch/Detectron
github(Pytorch): https://github.com/wannabeOG/Mask-RCNN
github(MXNet): https://github.com/TuSimple/mx-maskrcnn
github(Chainer): https://github.com/DeNA/Chainer_Mask_R-CNN

Light-Head R-CNN

Light-Head R-CNN: In Defense of Two-Stage Object Detector

intro: Tsinghua University & Megvii Inc
arxiv: https://arxiv.org/abs/1711.07264
github(offical): https://github.com/zengarden/light_head_rcnn
github: https://github.com/terrychenism/Deformable-ConvNets/blob/master/rfcn/symbols/resnet_v1_101_rfcn_light.py#L784

Cascade R-CNN

Cascade R-CNN: Delving into High Quality Object Detection

arxiv: https://arxiv.org/abs/1712.00726
github: https://github.com/zhaoweicai/cascade-rcnn

SPP-Net

Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition

intro: ECCV 2014 / TPAMI 2015
arxiv: http://arxiv.org/abs/1406.4729
github: https://github.com/ShaoqingRen/SPP_net
notes: http://zhangliliang.com/2014/09/13/paper-note-sppnet/

DeepID-Net: Deformable Deep Convolutional Neural Networks for Object Detection

intro: PAMI 2016
intro: an extension of R-CNN. box pre-training, cascade on region proposals, deformation layers and context representations
project page: http://www.ee.cuhk.edu.hk/%CB%9Cwlouyang/projects/imagenetDeepId/index.html
arxiv: http://arxiv.org/abs/1412.5661

Object Detectors Emerge in Deep Scene CNNs

intro: ICLR 2015
arxiv: http://arxiv.org/abs/1412.6856
paper: https://www.robots.ox.ac.uk/~vgg/rg/papers/zhou_iclr15.pdf
paper: https://people.csail.mit.edu/khosla/papers/iclr2015_zhou.pdf
slides: http://places.csail.mit.edu/slide_iclr2015.pdf

segDeepM: Exploiting Segmentation and Context in Deep Neural Networks for Object Detection

intro: CVPR 2015
project(code+data): https://www.cs.toronto.edu/~yukun/segdeepm.html
arxiv: https://arxiv.org/abs/1502.04275
github: https://github.com/YknZhu/segDeepM

Object Detection Networks on Convolutional Feature Maps

intro: TPAMI 2015
keywords: NoC
arxiv: http://arxiv.org/abs/1504.06066

Improving Object Detection with Deep Convolutional Networks via Bayesian Optimization and Structured Prediction

arxiv: http://arxiv.org/abs/1504.03293
slides: http://www.ytzhang.net/files/publications/2015-cvpr-det-slides.pdf
github: https://github.com/YutingZhang/fgs-obj

DeepBox: Learning Objectness with Convolutional Networks

keywords: DeepBox
arxiv: http://arxiv.org/abs/1505.02146
github: https://github.com/weichengkuo/DeepBox

YOLO

You Only Look Once: Unified, Real-Time Object Detection

arxiv: http://arxiv.org/abs/1506.02640
code: https://pjreddie.com/darknet/yolov1/
github: https://github.com/pjreddie/darknet
blog: https://pjreddie.com/darknet/yolov1/
slides: https://docs.google.com/presentation/d/1aeRvtKG21KHdD5lg6Hgyhx5rPq_ZOsGjG5rJ1HP7BbA/pub?start=false&loop=false&delayms=3000&slide=id.p
reddit: https://www.reddit.com/r/MachineLearning/comments/3a3m0o/realtime_object_detection_with_yolo/
github: https://github.com/gliese581gg/YOLO_tensorflow
github: https://github.com/xingwangsfu/caffe-yolo
github: https://github.com/frankzhangrui/Darknet-Yolo
github: https://github.com/BriSkyHekun/py-darknet-yolo
github: https://github.com/tommy-qichang/yolo.torch
github: https://github.com/frischzenger/yolo-windows
github: https://github.com/AlexeyAB/yolo-windows
github: https://github.com/nilboy/tensorflow-yolo

darkflow - translate darknet to tensorflow. Load trained weights, retrain/fine-tune them using tensorflow, export constant graph def to C++

Start Training YOLO with Our Own Data

intro: train with customized data and class numbers/labels. Linux / Windows version for darknet.
blog: http://guanghan.info/blog/en/my-works/train-yolo/
github: https://github.com/Guanghan/darknet

YOLO: Core ML versus MPSNNGraph

intro: Tiny YOLO for iOS implemented using CoreML but also using the new MPS graph API.
blog: http://machinethink.net/blog/yolo-coreml-versus-mps-graph/
github: https://github.com/hollance/YOLO-CoreML-MPSNNGraph

TensorFlow YOLO object detection on Android

intro: Real-time object detection on Android using the YOLO network with TensorFlow
github: https://github.com/natanielruiz/android-yolo

Computer Vision in iOS â Object Detection

YOLOv2

YOLO9000: Better, Faster, Stronger

arxiv: https://arxiv.org/abs/1612.08242
code: http://pjreddie.com/yolo9000/ https://pjreddie.com/darknet/yolov2/
github(Chainer): https://github.com/leetenki/YOLOv2
github(Keras): https://github.com/allanzelener/YAD2K
github(PyTorch): https://github.com/longcw/yolo2-pytorch
github(Tensorflow): https://github.com/hizhangp/yolo_tensorflow
github(Windows): https://github.com/AlexeyAB/darknet
github: https://github.com/choasUp/caffe-yolo9000
github: https://github.com/philipperemy/yolo-9000
github(TensorFlow): https://github.com/KOD-Chen/YOLOv2-Tensorflow
github(Keras): https://github.com/yhcc/yolo2
github(Keras): https://github.com/experiencor/keras-yolo2
github(TensorFlow): https://github.com/WojciechMormul/yolo2

darknet_scripts

intro: Auxilary scripts to work with (YOLO) darknet deep learning famework. AKA -> How to generate YOLO anchors?
github: https://github.com/Jumabek/darknet_scripts

Yolo_mark: GUI for marking bounded boxes of objects in images for training Yolo v2

github: https://github.com/AlexeyAB/Yolo_mark

LightNet: Bringing pjreddie's DarkNet out of the shadows

https://github.com//explosion/lightnet

YOLO v2 Bounding Box Tool

intro: Bounding box labeler tool to generate the training data in the format YOLO v2 requires.
github: https://github.com/Cartucho/yolo-boundingbox-labeler-GUI

Loss Rank Mining: A General Hard Example Mining Method for Real-time Detectors

intro: LRM is the first hard example mining strategy which could fit YOLOv2 perfectly and make it better applied in series of real scenarios where both real-time rates and accurate detection are strongly demanded.
arxiv: https://arxiv.org/abs/1804.04606

Object detection at 200 Frames Per Second

intro: faster than Tiny-Yolo-v2
arxiv: https://arxiv.org/abs/1805.06361

Event-based Convolutional Networks for Object Detection in Neuromorphic Cameras

intro: YOLE--Object Detection in Neuromorphic Cameras
arxiv:https://arxiv.org/abs/1805.07931

OmniDetector: With Neural Networks to Bounding Boxes

intro: a person detector on n fish-eye images of indoor scenesï¼NIPS 2018ï¼
arxiv:https://arxiv.org/abs/1805.08503
datasets:https://gitlab.com/omnidetector/omnidetector

YOLOv3

YOLOv3: An Incremental Improvement

arxiv:https://arxiv.org/abs/1804.02767
paper:https://pjreddie.com/media/files/papers/YOLOv3.pdf
code: https://pjreddie.com/darknet/yolo/
github(Official):https://github.com/pjreddie/darknet
github:https://github.com/mystic123/tensorflow-yolo-v3
github:https://github.com/experiencor/keras-yolo3
github:https://github.com/qqwweee/keras-yolo3
github:https://github.com/marvis/pytorch-yolo3
github:https://github.com/ayooshkathuria/pytorch-yolo-v3
github:https://github.com/ayooshkathuria/YOLO_v3_tutorial_from_scratch
github:https://github.com/eriklindernoren/PyTorch-YOLOv3
github:https://github.com/ultralytics/yolov3
github:https://github.com/BobLiu20/YOLOv3_PyTorch
github:https://github.com/andy-yun/pytorch-0.4-yolov3
github:https://github.com/DeNA/PyTorch_YOLOv3

YOLT

You Only Look Twice: Rapid Multi-Scale Object Detection In Satellite Imagery

intro: Small Object Detection
arxiv:https://arxiv.org/abs/1805.09512
github:https://github.com/avanetten/yolt

SSD

SSD: Single Shot MultiBox Detector

intro: ECCV 2016 Oral
arxiv: http://arxiv.org/abs/1512.02325
paper: http://www.cs.unc.edu/~wliu/papers/ssd.pdf
slides: http://www.cs.unc.edu/%7Ewliu/papers/ssd_eccv2016_slide.pdf
github(Official): https://github.com/weiliu89/caffe/tree/ssd
video: http://weibo.com/p/2304447a2326da963254c963c97fb05dd3a973
github: https://github.com/zhreshold/mxnet-ssd
github: https://github.com/zhreshold/mxnet-ssd.cpp
github: https://github.com/rykov8/ssd_keras
github: https://github.com/balancap/SSD-Tensorflow
github: https://github.com/amdegroot/ssd.pytorch
github(Caffe): https://github.com/chuanqi305/MobileNet-SSD

What's the diffience in performance between this new code you pushed and the previous code? #327

https://github.com/weiliu89/caffe/issues/327

DSSD

DSSD : Deconvolutional Single Shot Detector

intro: UNC Chapel Hill & Amazon Inc
arxiv: https://arxiv.org/abs/1701.06659
github: https://github.com/chengyangfu/caffe/tree/dssd
github: https://github.com/MTCloudVision/mxnet-dssd
demo: http://120.52.72.53/www.cs.unc.edu/c3pr90ntc0td/~cyfu/dssd_lalaland.mp4

Enhancement of SSD by concatenating feature maps for object detection

intro: rainbow SSD (R-SSD)
arxiv: https://arxiv.org/abs/1705.09587

Context-aware Single-Shot Detector

keywords: CSSD, DiCSSD, DeCSSD, effective receptive fields (ERFs), theoretical receptive fields (TRFs)
arxiv: https://arxiv.org/abs/1707.08682

Feature-Fused SSD: Fast Detection for Small Objects

https://arxiv.org/abs/1709.05054

FSSD

FSSD: Feature Fusion Single Shot Multibox Detector

https://arxiv.org/abs/1712.00960

Weaving Multi-scale Context for Single Shot Detector

intro: WeaveNet
keywords: fuse multi-scale information
arxiv: https://arxiv.org/abs/1712.03149

ESSD

Extend the shallow part of Single Shot MultiBox Detector via Convolutional Neural Network

https://arxiv.org/abs/1801.05918

Tiny SSD: A Tiny Single-shot Detection Deep Convolutional Neural Network for Real-time Embedded Object Detection

https://arxiv.org/abs/1802.06488

MDSSD

MDSSD: Multi-scale Deconvolutional Single Shot Detector for small objects

arxiv: https://arxiv.org/abs/1805.07009

Pelee

Pelee: A Real-Time Object Detection System on Mobile Devices

https://github.com/Robert-JunWang/Pelee

intro: (ICLR 2018 workshop track)
arxiv: https://arxiv.org/abs/1804.06882
github: https://github.com/Robert-JunWang/Pelee

Fire SSD

Fire SSD: Wide Fire Modules based Single Shot Detector on Edge Device

intro:low cost, fast speed and high mAP on factor edge computing devices
arxiv:https://arxiv.org/abs/1806.05363

R-FCN

R-FCN: Object Detection via Region-based Fully Convolutional Networks

arxiv: http://arxiv.org/abs/1605.06409
github: https://github.com/daijifeng001/R-FCN
github(MXNet): https://github.com/msracver/Deformable-ConvNets/tree/master/rfcn
github: https://github.com/Orpine/py-R-FCN
github: https://github.com/PureDiors/pytorch_RFCN
github: https://github.com/bharatsingh430/py-R-FCN-multiGPU
github: https://github.com/xdever/RFCN-tensorflow

R-FCN-3000 at 30fps: Decoupling Detection and Classification

https://arxiv.org/abs/1712.01802

Recycle deep features for better object detection

arxiv: http://arxiv.org/abs/1607.05066

FPN

Feature Pyramid Networks for Object Detection

intro: Facebook AI Research
arxiv: https://arxiv.org/abs/1612.03144

Action-Driven Object Detection with Top-Down Visual Attentions

arxiv: https://arxiv.org/abs/1612.06704

Beyond Skip Connections: Top-Down Modulation for Object Detection

intro: CMU & UC Berkeley & Google Research
arxiv: https://arxiv.org/abs/1612.06851

Wide-Residual-Inception Networks for Real-time Object Detection

intro: Inha University
arxiv: https://arxiv.org/abs/1702.01243

Attentional Network for Visual Object Detection

intro: University of Maryland & Mitsubishi Electric Research Laboratories
arxiv: https://arxiv.org/abs/1702.01478

Learning Chained Deep Features and Classifiers for Cascade in Object Detection

keykwords: CC-Net
intro: chained cascade network (CC-Net). 81.1% mAP on PASCAL VOC 2007
arxiv: https://arxiv.org/abs/1702.07054

DeNet: Scalable Real-time Object Detection with Directed Sparse Sampling

intro: ICCV 2017 (poster)
arxiv: https://arxiv.org/abs/1703.10295

Discriminative Bimodal Networks for Visual Localization and Detection with Natural Language Queries

intro: CVPR 2017
arxiv: https://arxiv.org/abs/1704.03944

Spatial Memory for Context Reasoning in Object Detection

arxiv: https://arxiv.org/abs/1704.04224

Accurate Single Stage Detector Using Recurrent Rolling Convolution

intro: CVPR 2017. SenseTime
keywords: Recurrent Rolling Convolution (RRC)
arxiv: https://arxiv.org/abs/1704.05776
github: https://github.com/xiaohaoChen/rrc_detection

Deep Occlusion Reasoning for Multi-Camera Multi-Target Detection

https://arxiv.org/abs/1704.05775

LCDet: Low-Complexity Fully-Convolutional Neural Networks for Object Detection in Embedded Systems

intro: Embedded Vision Workshop in CVPR. UC San Diego & Qualcomm Inc
arxiv: https://arxiv.org/abs/1705.05922

Point Linking Network for Object Detection

intro: Point Linking Network (PLN)
arxiv: https://arxiv.org/abs/1706.03646

Perceptual Generative Adversarial Networks for Small Object Detection

https://arxiv.org/abs/1706.05274

Few-shot Object Detection

https://arxiv.org/abs/1706.08249

Yes-Net: An effective Detector Based on Global Information

https://arxiv.org/abs/1706.09180

SMC Faster R-CNN: Toward a scene-specialized multi-object detector

https://arxiv.org/abs/1706.10217

Towards lightweight convolutional neural networks for object detection

https://arxiv.org/abs/1707.01395

RON: Reverse Connection with Objectness Prior Networks for Object Detection

Mimicking Very Efficient Network for Object Detection

intro: CVPR 2017. SenseTime & Beihang University
paper: http://openaccess.thecvf.com/content_cvpr_2017/papers/Li_Mimicking_Very_Efficient_CVPR_2017_paper.pdf

Residual Features and Unified Prediction Network for Single Stage Detection

https://arxiv.org/abs/1707.05031

Deformable Part-based Fully Convolutional Network for Object Detection

intro: BMVC 2017 (oral). Sorbonne UniversitÃ©s & CEDRIC
arxiv: https://arxiv.org/abs/1707.06175

Adaptive Feeding: Achieving Fast and Accurate Detections by Adaptively Combining Object Detectors

intro: ICCV 2017
arxiv: https://arxiv.org/abs/1707.06399

Recurrent Scale Approximation for Object Detection in CNN

intro: ICCV 2017
keywords: Recurrent Scale Approximation (RSA)
arxiv: https://arxiv.org/abs/1707.09531
github: https://github.com/sciencefans/RSA-for-object-detection

DSOD

DSOD: Learning Deeply Supervised Object Detectors from Scratch

intro: ICCV 2017. Fudan University & Tsinghua University & Intel Labs China
arxiv: https://arxiv.org/abs/1708.01241
github: https://github.com/szq0214/DSOD
github:https://github.com/Windaway/DSOD-Tensorflow
github:https://github.com/chenyuntc/dsod.pytorch

Learning Object Detectors from Scratch with Gated Recurrent Feature Pyramids

arxiv:https://arxiv.org/abs/1712.00886
github:https://github.com/szq0214/GRP-DSOD

Tiny-DSOD: Lightweight Object Detection for Resource-Restricted Usages

intro: BMVC 2018
arXiv: https://arxiv.org/abs/1807.11013

Object Detection from Scratch with Deep Supervision

intro: This is an extended version of DSOD
arXiv: https://arxiv.org/abs/1809.09294

RetinaNet

Focal Loss for Dense Object Detection

intro: ICCV 2017 Best student paper award. Facebook AI Research
keywords: RetinaNet
arxiv: https://arxiv.org/abs/1708.02002

CoupleNet: Coupling Global Structure with Local Parts for Object Detection

intro: ICCV 2017
arxiv: https://arxiv.org/abs/1708.02863

Incremental Learning of Object Detectors without Catastrophic Forgetting

intro: ICCV 2017. Inria
arxiv: https://arxiv.org/abs/1708.06977

Zoom Out-and-In Network with Map Attention Decision for Region Proposal and Object Detection

https://arxiv.org/abs/1709.04347

StairNet: Top-Down Semantic Aggregation for Accurate One Shot Detection

https://arxiv.org/abs/1709.05788

Dynamic Zoom-in Network for Fast Object Detection in Large Images

https://arxiv.org/abs/1711.05187

Zero-Annotation Object Detection with Web Knowledge Transfer

intro: NTU, Singapore & Amazon
keywords: multi-instance multi-label domain adaption learning framework
arxiv: https://arxiv.org/abs/1711.05954

MegDet

MegDet: A Large Mini-Batch Object Detector

intro: Peking University & Tsinghua University & Megvii Inc
arxiv: https://arxiv.org/abs/1711.07240

Receptive Field Block Net for Accurate and Fast Object Detection

intro: RFBNet
arxiv: https://arxiv.org/abs/1711.07767
github: https://github.com//ruinmessi/RFBNet

An Analysis of Scale Invariance in Object Detection - SNIP

arxiv: https://arxiv.org/abs/1711.08189
github: https://github.com/bharatsingh430/snip

Feature Selective Networks for Object Detection

https://arxiv.org/abs/1711.08879

Learning a Rotation Invariant Detector with Rotatable Bounding Box

arxiv: https://arxiv.org/abs/1711.09405
github: https://github.com/liulei01/DRBox

Scalable Object Detection for Stylized Objects

intro: Microsoft AI & Research Munich
arxiv: https://arxiv.org/abs/1711.09822

Learning Object Detectors from Scratch with Gated Recurrent Feature Pyramids

arxiv: https://arxiv.org/abs/1712.00886
github: https://github.com/szq0214/GRP-DSOD

Deep Regionlets for Object Detection

keywords: region selection network, gating network
arxiv: https://arxiv.org/abs/1712.02408

Training and Testing Object Detectors with Virtual Images

intro: IEEE/CAA Journal of Automatica Sinica
arxiv: https://arxiv.org/abs/1712.08470

Large-Scale Object Discovery and Detector Adaptation from Unlabeled Video

keywords: object mining, object tracking, unsupervised object discovery by appearance-based clustering, self-supervised detector adaptation
arxiv: https://arxiv.org/abs/1712.08832

Spot the Difference by Object Detection

intro: Tsinghua University & JD Group
arxiv: https://arxiv.org/abs/1801.01051

Localization-Aware Active Learning for Object Detection

arxiv: https://arxiv.org/abs/1801.05124

Object Detection with Mask-based Feature Encoding

arxiv: https://arxiv.org/abs/1802.03934

LSTD: A Low-Shot Transfer Detector for Object Detection

intro: AAAI 2018
arxiv: https://arxiv.org/abs/1803.01529

Pseudo Mask Augmented Object Detection

https://arxiv.org/abs/1803.05858

Revisiting RCNN: On Awakening the Classification Power of Faster RCNN

https://arxiv.org/abs/1803.06799

Learning Region Features for Object Detection

intro: Peking University & MSRA
arxiv: https://arxiv.org/abs/1803.07066

Single-Shot Bidirectional Pyramid Networks for High-Quality Object Detection

intro: Singapore Management University & Zhejiang University
arxiv: https://arxiv.org/abs/1803.08208

Object Detection for Comics using Manga109 Annotations

intro: University of Tokyo & National Institute of Informatics, Japan
arxiv: https://arxiv.org/abs/1803.08670

Task-Driven Super Resolution: Object Detection in Low-resolution Images

arxiv: https://arxiv.org/abs/1803.11316

Transferring Common-Sense Knowledge for Object Detection

arxiv: https://arxiv.org/abs/1804.01077

Multi-scale Location-aware Kernel Representation for Object Detection

Loss Rank Mining: A General Hard Example Mining Method for Real-time Detectors

intro: National University of Defense Technology
arxiv: https://arxiv.org/abs/1804.04606

Robust Physical Adversarial Attack on Faster R-CNN Object Detector

arxiv: https://arxiv.org/abs/1804.05810

RefineNet

Single-Shot Refinement Neural Network for Object Detection

intro: CVPR 2018
arxiv: https://arxiv.org/abs/1711.06897
github: https://github.com/sfzhang15/RefineDet
github: https://github.com/lzx1413/PytorchSSD
github: https://github.com/ddlee96/RefineDet_mxnet
github: https://github.com/MTCloudVision/RefineDet-Mxnet

DetNet

DetNet: A Backbone network for Object Detection

intro: Tsinghua University & Face++
arxiv: https://arxiv.org/abs/1804.06215

SSOD

Self-supervisory Signals for Object Discovery and Detection

Google Brain
arxiv:https://arxiv.org/abs/1806.03370

CornerNet

CornerNet: Detecting Objects as Paired Keypoints

intro: ECCV 2018
arXiv: https://arxiv.org/abs/1808.01244
github: https://github.com/umich-vl/CornerNet

M2Det

M2Det: A Single-Shot Object Detector based on Multi-Level Feature Pyramid Network

intro: AAAI 2019
arXiv: https://arxiv.org/abs/1811.04533
github: https://github.com/qijiezhao/M2Det

3D Object Detection

3D Backbone Network for 3D Object Detection

arXiv: https://arxiv.org/abs/1901.08373

LMNet: Real-time Multiclass Object Detection on CPU using 3D LiDARs

ZSDï¼Zero-Shot Object Detectionï¼

Zero-Shot Detection

intro: Australian National University
keywords: YOLO
arxiv: https://arxiv.org/abs/1803.07113

Zero-Shot Object Detection

arxiv: https://arxiv.org/abs/1804.04340

Zero-Shot Object Detection: Learning to Simultaneously Recognize and Localize Novel Concepts

arxiv: https://arxiv.org/abs/1803.06049

Zero-Shot Object Detection by Hybrid Region Embedding

arxiv: https://arxiv.org/abs/1805.06157

OSDï¼One-Shot Object Detectionï¼

Comparison Network for One-Shot Conditional Object Detection

arXiv: https://arxiv.org/abs/1904.02317

One-Shot Object Detection

RepMet: Representative-based metric learning for classification and one-shot object detection

intro: IBM Research AI
arxiv:https://arxiv.org/abs/1806.04728
github: TODO

Weakly Supervised Object Detection

Weakly Supervised Object Detection in Artworks

intro: ECCV 2018 Workshop Computer Vision for Art Analysis
arXiv: https://arxiv.org/abs/1810.02569
Datasets: https://wsoda.telecom-paristech.fr/downloads/dataset/IconArt_v1.zip

Cross-Domain Weakly-Supervised Object Detection through Progressive Domain Adaptation

intro: CVPR 2018
arXiv: https://arxiv.org/abs/1803.11365
homepage: https://naoto0804.github.io/cross_domain_detection/
paper: http://openaccess.thecvf.com/content_cvpr_2018/html/Inoue_Cross-Domain_Weakly-Supervised_Object_CVPR_2018_paper.html
github: https://github.com/naoto0804/cross-domain-detection

Softer-NMS

ãSofter-NMS: Rethinking Bounding Box Regression for Accurate Object Detectionã

intro: CMU & Face++
arXiv: https://arxiv.org/abs/1809.08545
github: https://github.com/yihui-he/softer-NMS

2019

Feature Selective Anchor-Free Module for Single-Shot Object Detection

intro: CVPR 2019
arXiv: https://arxiv.org/abs/1903.00621

Object Detection based on Region Decomposition and Assembly

intro: AAAI 2019
arXiv: https://arxiv.org/abs/1901.08225

Bottom-up Object Detection by Grouping Extreme and Center Points

intro: one stage 43.2% on COCO test-dev
arXiv: https://arxiv.org/abs/1901.08043
github: https://github.com/xingyizhou/ExtremeNet

ORSIm Detector: A Novel Object Detection Framework in Optical Remote Sensing Imagery Using Spatial-Frequency Channel Features

intro: IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING
arXiv: https://arxiv.org/abs/1901.07925

Consistent Optimization for Single-Shot Object Detection

intro: improves RetinaNet from 39.1 AP to 40.1 AP on COCO datase
arXiv: https://arxiv.org/abs/1901.06563

Learning Pairwise Relationship for Multi-object Detection in Crowded Scenes

arXiv: https://arxiv.org/abs/1901.03796

RetinaMask: Learning to predict masks improves state-of-the-art single-shot detection for free

arXiv: https://arxiv.org/abs/1901.03353
github: https://github.com/chengyangfu/retinamask

Region Proposal by Guided Anchoring

intro: CUHK - SenseTime Joint Lab
arXiv: https://arxiv.org/abs/1901.03278

Scale-Aware Trident Networks for Object Detection

intro: mAP of 48.4 on the COCO dataset
arXiv: https://arxiv.org/abs/1901.01892

2018

Large-Scale Object Detection of Images from Network Cameras in Variable Ambient Lighting Conditions

arXiv: https://arxiv.org/abs/1812.11901

Strong-Weak Distribution Alignment for Adaptive Object Detection

arXiv: https://arxiv.org/abs/1812.04798

AutoFocus: Efficient Multi-Scale Inference

intro: AutoFocus obtains an mAP of 47.9% (68.3% at 50% overlap) on the COCO test-dev set while processing 6.4 images per second on a Titan X (Pascal) GPU
arXiv: https://arxiv.org/abs/1812.01600

NOTE-RCNN: NOise Tolerant Ensemble RCNN for Semi-Supervised Object Detection

intro: Google Could
arXiv: https://arxiv.org/abs/1812.00124

SPLAT: Semantic Pixel-Level Adaptation Transforms for Detection

intro: UC Berkeley
arXiv: https://arxiv.org/abs/1812.00929

Grid R-CNN

intro: SenseTime
arXiv: https://arxiv.org/abs/1811.12030

Deformable ConvNets v2: More Deformable, Better Results

intro: Microsoft Research Asia
arXiv: https://arxiv.org/abs/1811.11168

Anchor Box Optimization for Object Detection

intro: Microsoft Research
arXiv: https://arxiv.org/abs/1812.00469

Efficient Coarse-to-Fine Non-Local Module for the Detection of Small Objects

intro: https://arxiv.org/abs/1811.12152

NOTE-RCNN: NOise Tolerant Ensemble RCNN for Semi-Supervised Object Detection

arXiv: https://arxiv.org/abs/1812.00124

Learning RoI Transformer for Detecting Oriented Objects in Aerial Images

arXiv: https://arxiv.org/abs/1812.00155

Integrated Object Detection and Tracking with Tracklet-Conditioned Detection

intro: Microsoft Research Asia
arXiv: https://arxiv.org/abs/1811.11167

Deep Regionlets: Blended Representation and Deep Learning for Generic Object Detection

arXiv: https://arxiv.org/abs/1811.11318

Gradient Harmonized Single-stage Detector

intro: AAAI 2019
arXiv: https://arxiv.org/abs/1811.05181

CFENet: Object Detection with Comprehensive Feature Enhancement Module

intro: ACCV 2018
github: https://github.com/qijiezhao/CFENet

DeRPN: Taking a further step toward more general object detection

intro: AAAI 2019
arXiv: https://arxiv.org/abs/1811.06700
github: https://github.com/HCIILAB/DeRPN

Hybrid Knowledge Routed Modules for Large-scale Object Detection

intro: Sun Yat-Sen University & Huawei Noahâs Ark Lab
arXiv: https://arxiv.org/abs/1810.12681
github: https://github.com/chanyn/HKRM

ãReceptive Field Block Net for Accurate and Fast Object Detectionã

intro: ECCV 2018
arXiv: https://arxiv.org/abs/1711.07767
github: https://github.com/ruinmessi/RFBNet

Deep Feature Pyramid Reconfiguration for Object Detection

intro: ECCV 2018
arXiv: https://arxiv.org/abs/1808.07993

Unsupervised Hard Example Mining from Videos for Improved Object Detection

intro: ECCV 2018
arXiv: https://arxiv.org/abs/1808.04285

Acquisition of Localization Confidence for Accurate Object Detection

intro: ECCV 2018
arXiv: https://arxiv.org/abs/1807.11590
github: https://github.com/vacancy/PreciseRoIPooling

Toward Scale-Invariance and Position-Sensitive Region Proposal Networks

intro: ECCV 2018
arXiv: https://arxiv.org/abs/1807.09528

MetaAnchor: Learning to Detect Objects with Customized Anchors

arxiv: https://arxiv.org/abs/1807.00980

Relation Network for Object Detection

intro: CVPR 2018
arxiv: https://arxiv.org/abs/1711.11575
github:https://github.com/msracver/Relation-Networks-for-Object-Detection

Quantization Mimic: Towards Very Tiny CNN for Object Detection

Tsinghua University1 & The Chinese University of Hong Kong2 &SenseTime3
arxiv: https://arxiv.org/abs/1805.02152

Learning Rich Features for Image Manipulation Detection

intro: CVPR 2018 Camera Ready
arxiv: https://arxiv.org/abs/1805.04953

SNIPER: Efficient Multi-Scale Training

arxiv:https://arxiv.org/abs/1805.09300
github:https://github.com/mahyarnajibi/SNIPER

Soft Sampling for Robust Object Detection

intro: the robustness of object detection under the presence of missing annotations
arxiv:https://arxiv.org/abs/1806.06986

Cost-effective Object Detection: Active Sample Mining with Switchable Selection Criteria

intro: TNNLS 2018
arxiv:https://arxiv.org/abs/1807.00147
code: http://kezewang.com/codes/ASM_ver1.zip

Other

R3-Net: A Deep Network for Multi-oriented Vehicle Detection in Aerial Images and Videos

arxiv: https://arxiv.org/abs/1808.05560
youtube: https://youtu.be/xCYD-tYudN0

Detection Toolbox

Detectron(FAIR): Detectron is Facebook AI Research's software system that implements state-of-the-art object detection algorithms, including Mask R-CNN. It is written in Python and powered by the Caffe2 deep learning framework.
Detectron2: Detectron2 is FAIR's next-generation research platform for object detection and segmentation.
maskrcnn-benchmark(FAIR): Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.
mmdetection(SenseTime&CUHK): mmdetection is an open source object detection toolbox based on PyTorch. It is a part of the open-mmlab project developed by Multimedia Laboratory, CUHK.

Top Related Projects

Convert designs to code with AI

Introducing Visual Copilot: A new AI model to turn Figma designs to high quality code using your components.

Try Visual Copilot

Top Related Projects

Quick Overview

Pros

Cons

Competitor Comparisons

Pros of Detectron2

Cons of Detectron2

Code Comparison

Pros of models

Cons of models

Code comparison

Summary

Pros of Mask_RCNN

Cons of Mask_RCNN

Code Comparison

Summary

Pros of mmdetection

Cons of mmdetection

Code comparison

Summary

Pros of YOLOv5

Cons of YOLOv5

Code Comparison

Summary

Pros of darknet

Cons of darknet

Code comparison

Summary

Convert designs to code with AI

README

object-detection

Survey

Papers&Codes

R-CNN

Fast R-CNN

Faster R-CNN

Mask R-CNN

Light-Head R-CNN

Cascade R-CNN

SPP-Net

YOLO

YOLOv2

YOLOv3

YOLT

SSD

DSSD

FSSD

ESSD

MDSSD

Pelee

Fire SSD

R-FCN

FPN

DSOD

RetinaNet

MegDet

RefineNet

DetNet

SSOD

CornerNet

M2Det

3D Object Detection

ZSDï¼Zero-Shot Object Detectionï¼

OSDï¼One-Shot Object Detectionï¼

Weakly Supervised Object Detection

Softer-NMS

2019

2018

Other

Detection Toolbox

Top Related Projects

Convert designs to code with AI

ZSDï¼Zero-Shot Object Detectionï¼

OSDï¼One-Shot Object Detectionï¼