Convert Figma logo to code with AI

meituan logoYOLOv6

YOLOv6: a single-stage object detection framework dedicated to industrial applications.

5,732
1,037
5,732
269

Top Related Projects

51,450

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

13,305

Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors

21,700

YOLOv4 / Scaled-YOLOv4 / YOLO - Neural Networks for Object Detection (Windows and Linux version of Darknet )

OpenMMLab Detection Toolbox and Benchmark

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

1,992

implementation of paper - You Only Learn One Representation: Unified Network for Multiple Tasks (https://arxiv.org/abs/2105.04206)

Quick Overview

YOLOv6 is an object detection model developed by Meituan, designed for industrial applications. It offers state-of-the-art performance while maintaining high efficiency, making it suitable for various scenarios from embedded devices to large-scale server clusters.

Pros

  • High performance-to-speed ratio, suitable for real-time applications
  • Supports various hardware platforms, including NVIDIA, TensorRT, and NCNN
  • Provides different model sizes (N/T/S/M/L) for different application needs
  • Regularly updated with new features and improvements

Cons

  • Limited documentation compared to some other YOLO versions
  • Fewer pre-trained models available compared to more established object detection frameworks
  • May require more fine-tuning for specific use cases
  • Relatively new, so the community and ecosystem are still growing

Code Examples

  1. Loading a pre-trained YOLOv6 model:
from yolov6.utils.events import load_yaml
from yolov6.layers.common import DetectBackend
from yolov6.utils.nms import non_max_suppression

# Load model
model = DetectBackend('yolov6s.pt', device='cpu')
model.model.eval()
  1. Performing inference on an image:
import cv2
import torch

# Load and preprocess image
img = cv2.imread('image.jpg')
img = cv2.cvtColor(img, cv2.COLOR_BGR2RGB)
img = torch.from_numpy(img).permute(2, 0, 1).float().div(255.0).unsqueeze(0)

# Run inference
with torch.no_grad():
    pred = model(img)
    pred = non_max_suppression(pred, conf_thres=0.25, iou_thres=0.45, classes=None, agnostic=False, max_det=1000)
  1. Visualizing detection results:
from yolov6.utils.plotting import plot_box_and_label

# Draw bounding boxes on the image
for i, det in enumerate(pred):
    if len(det):
        for *xyxy, conf, cls in reversed(det):
            plot_box_and_label(img[0], max(round(sum(img[0].shape) / 2 * 0.003), 2), xyxy, label=f'{names[int(cls)]} {conf:.2f}')

# Display the result
cv2.imshow('YOLOv6 Detection', img[0].permute(1, 2, 0).numpy())
cv2.waitKey(0)

Getting Started

  1. Clone the repository:

    git clone https://github.com/meituan/YOLOv6.git
    cd YOLOv6
    
  2. Install dependencies:

    pip install -r requirements.txt
    
  3. Download a pre-trained model:

    wget https://github.com/meituan/YOLOv6/releases/download/0.3.0/yolov6s.pt
    
  4. Run inference on an image:

    python tools/infer.py --weights yolov6s.pt --source path/to/image.jpg
    

Competitor Comparisons

51,450

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

Pros of YOLOv5

  • More established and widely adopted in the community
  • Extensive documentation and tutorials available
  • Broader range of pre-trained models for various tasks

Cons of YOLOv5

  • Slightly lower inference speed compared to YOLOv6
  • May require more computational resources for training

Code Comparison

YOLOv5:

from yolov5 import YOLOv5

model = YOLOv5('yolov5s.pt')
results = model('image.jpg')

YOLOv6:

from yolov6.core.inferer import Inferer

inferer = Inferer(model='yolov6s.pt', device='cpu')
results = inferer.infer('image.jpg')

Both repositories offer similar ease of use for inference, with YOLOv5 having a slightly more straightforward API. YOLOv5 provides a more comprehensive ecosystem with additional tools and utilities, while YOLOv6 focuses on improved speed and efficiency.

YOLOv5 is better suited for users who prioritize community support and a wide range of pre-trained models. YOLOv6 may be preferred by those seeking cutting-edge performance and faster inference times, especially on edge devices or in real-time applications.

13,305

Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors

Pros of YOLOv7

  • Higher accuracy and performance on benchmark datasets
  • More extensive documentation and community support
  • Includes additional features like instance segmentation

Cons of YOLOv7

  • Larger model size, potentially slower inference
  • More complex architecture, which may be harder to understand and modify
  • Less focus on mobile and edge device deployment

Code Comparison

YOLOv7:

class YOLOv7(nn.Module):
    def __init__(self, nc=80):
        super().__init__()
        self.model = parse_model(yaml_file, ch=[3])
        self.nc = nc

YOLOv6:

class YOLOv6(nn.Module):
    def __init__(self, config, channels=3, num_classes=None, anchors=None):
        super().__init__()
        self.backbone = build_backbone(config, channels, False)
        self.neck = build_neck(config)

The code snippets show that YOLOv7 uses a YAML file for model configuration, while YOLOv6 uses a more modular approach with separate backbone and neck components. YOLOv7's implementation may be more flexible, but YOLOv6's structure could be easier to customize for specific use cases.

Both repositories offer state-of-the-art object detection capabilities, with YOLOv7 focusing on high performance and additional features, while YOLOv6 emphasizes efficiency and mobile deployment. The choice between them depends on specific project requirements and hardware constraints.

21,700

YOLOv4 / Scaled-YOLOv4 / YOLO - Neural Networks for Object Detection (Windows and Linux version of Darknet )

Pros of darknet

  • More established and widely used, with a larger community and extensive documentation
  • Supports a broader range of YOLO versions (YOLOv3, YOLOv4, etc.)
  • Offers pre-trained models for various tasks and datasets

Cons of darknet

  • Written in C, which may be less accessible for some developers compared to YOLOv6's Python implementation
  • Generally slower inference speed compared to YOLOv6's optimized architecture
  • Less focus on mobile and edge device deployment

Code Comparison

darknet (C):

layer make_yolo_layer(int batch, int w, int h, int n, int total, int *mask, int classes)
{
    int i;
    layer l = {0};
    l.type = YOLO;

    l.n = n;
    l.total = total;
    l.batch = batch;
    l.h = h;
    l.w = w;
    l.c = n*(classes + 4 + 1);
    l.out_w = l.w;
    l.out_h = l.h;
    l.out_c = l.c;
    l.classes = classes;
    l.cost = calloc(1, sizeof(float));
    l.biases = calloc(total*2, sizeof(float));
    if(mask) l.mask = mask;
    else{
        l.mask = calloc(n, sizeof(int));
        for(i = 0; i < n; ++i){
            l.mask[i] = i;
        }
    }
    l.bias_updates = calloc(n*2, sizeof(float));
    l.outputs = h*w*n*(classes + 4 + 1);
    l.inputs = l.outputs;
    l.truths = 90*(4 + 1);
    l.delta = calloc(batch*l.outputs, sizeof(float));
    l.output = calloc(batch*l.outputs, sizeof(float));

    l.forward = forward_yolo_layer;
    l.backward = backward_yolo_layer;
#ifdef GPU
    l.forward_gpu = forward_yolo_layer_gpu;
    l.backward_gpu = backward_yolo_layer_gpu;
    l.output_gpu = cuda_make_array(l.output, batch*l.outputs);
    l.delta_gpu = cuda_make_array(l.delta, batch*l.outputs);
#endif

    fprintf(stderr, "yolo\n");
    srand(0);

    return l;
}

YOLOv6 (Python):

class YOLOv6(nn.Module):
    def __init__(self, config):
        super().__init__()
        self.backbone = build_backbone(config)
        self.neck = build_neck(config)
        self.head = build_head(config)

    def forward(self, x):
        x = self.backbone(x)
        x = self.neck(x)
        x = self.head(x)
        return x

OpenMMLab Detection Toolbox and Benchmark

Pros of mmdetection

  • Extensive model zoo with a wide variety of pre-trained models
  • Highly modular and customizable architecture
  • Comprehensive documentation and active community support

Cons of mmdetection

  • Steeper learning curve due to its complexity
  • Potentially slower inference speed compared to YOLOv6
  • Larger codebase and higher resource requirements

Code Comparison

mmdetection:

from mmdet.apis import init_detector, inference_detector

config_file = 'configs/faster_rcnn/faster_rcnn_r50_fpn_1x_coco.py'
checkpoint_file = 'checkpoints/faster_rcnn_r50_fpn_1x_coco_20200130-047c8118.pth'
model = init_detector(config_file, checkpoint_file, device='cuda:0')
result = inference_detector(model, 'test.jpg')

YOLOv6:

from yolov6.utils.events import LOGGER
from yolov6.core.inferer import Inferer

model = Inferer('yolov6s.pt', device='cuda:0')
img = 'test.jpg'
result = model.infer(img, conf_thres=0.25, iou_thres=0.45, classes=None)

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

Pros of Detectron2

  • More comprehensive and flexible framework for object detection, segmentation, and other CV tasks
  • Extensive documentation and community support
  • Built on PyTorch, offering easier customization and integration with other PyTorch models

Cons of Detectron2

  • Steeper learning curve due to its complexity and extensive features
  • Generally slower inference speed compared to YOLOv6
  • Requires more computational resources for training and inference

Code Comparison

Detectron2:

from detectron2.engine import DefaultPredictor
from detectron2.config import get_cfg

cfg = get_cfg()
cfg.merge_from_file(model_zoo.get_config_file("COCO-Detection/faster_rcnn_R_50_FPN_3x.yaml"))
predictor = DefaultPredictor(cfg)
outputs = predictor(image)

YOLOv6:

from yolov6.utils.events import LOGGER, load_yaml
from yolov6.core.inferer import Inferer

cfg = load_yaml('configs/yolov6s.py')
inferer = Inferer(cfg, img_size=640)
results = inferer.infer(img_path)
1,992

implementation of paper - You Only Learn One Representation: Unified Network for Multiple Tasks (https://arxiv.org/abs/2105.04206)

Pros of YOLOR

  • Implements a unified network for object detection, offering potential performance improvements
  • Provides pre-trained models for various tasks, including object detection and instance segmentation
  • Supports multiple backbones and offers flexibility in model architecture

Cons of YOLOR

  • Less frequent updates and maintenance compared to YOLOv6
  • May have higher computational requirements due to its unified network approach
  • Documentation could be more comprehensive for easier implementation and customization

Code Comparison

YOLOv6:

from yolov6.core.evaler import Evaler
from yolov6.utils.config import Config

cfg = Config.fromfile('configs/yolov6s.py')
evaler = Evaler(cfg, img_size=640)
evaler.eval()

YOLOR:

from yolor.models.models import *
from yolor.utils.datasets import *
from yolor.utils.utils import *

model = Darknet('cfg/yolor_p6.cfg', img_size=640)
model.load_state_dict(torch.load('yolor_p6.pt')['model'])

Both repositories offer YOLO-based object detection models, but YOLOv6 focuses on efficiency and deployment optimization, while YOLOR emphasizes a unified network approach. YOLOv6 provides more recent updates and extensive documentation, making it potentially easier to implement and customize for specific use cases.

Convert Figma logo designs to code with AI

Visual Copilot

Introducing Visual Copilot: A new AI model to turn Figma designs to high quality code using your components.

Try Visual Copilot

README

English | 简体中文


Open In Colab Open In Kaggle

YOLOv6

Implementation of paper:

What's New

  • [2023.09.15] Release YOLOv6-Segmentation. 🚀 Performance
  • [2023.04.28] Release YOLOv6Lite models on mobile or CPU. ⭐️ Mobile Benchmark
  • [2023.03.10] Release YOLOv6-Face. 🔥 Performance
  • [2023.03.02] Update base models to version 3.0.
  • [2023.01.06] Release P6 models and enhance the performance of P5 models. ⭐️ Benchmark
  • [2022.11.04] Release base models to simplify the training and deployment process.
  • [2022.09.06] Customized quantization methods. 🚀 Quantization Tutorial
  • [2022.09.05] Release M/L models and update N/T/S models with enhanced performance.
  • [2022.06.23] Release N/T/S models with excellent performance.

Benchmark

ModelSizemAPval
0.5:0.95
SpeedT4
trt fp16 b1
(fps)
SpeedT4
trt fp16 b32
(fps)
Params
(M)
FLOPs
(G)
YOLOv6-N64037.577911874.711.4
YOLOv6-S64045.033948418.545.3
YOLOv6-M64050.017522634.985.8
YOLOv6-L64052.89811659.6150.7
YOLOv6-N6128044.922828110.449.8
YOLOv6-S6128050.39810841.4198.0
YOLOv6-M6128055.2475579.6379.5
YOLOv6-L6128057.22629140.4673.4
Table Notes
  • All checkpoints are trained with self-distillation except for YOLOv6-N6/S6 models trained to 300 epochs without distillation.
  • Results of the mAP and speed are evaluated on COCO val2017 dataset with the input resolution of 640×640 for P5 models and 1280x1280 for P6 models.
  • Speed is tested with TensorRT 7.2 on T4.
  • Refer to Test speed tutorial to reproduce the speed results of YOLOv6.
  • Params and FLOPs of YOLOv6 are estimated on deployed models.
Legacy models
ModelSizemAPval
0.5:0.95
SpeedT4
trt fp16 b1
(fps)
SpeedT4
trt fp16 b32
(fps)
Params
(M)
FLOPs
(G)
YOLOv6-N64035.9300e
36.3400e
80212344.311.1
YOLOv6-T64040.3300e
41.1400e
44965915.036.7
YOLOv6-S64043.5300e
43.8400e
35849517.244.2
YOLOv6-M64049.517923334.382.2
YOLOv6-L-ReLU64051.711314958.5144.0
YOLOv6-L64052.59812158.5144.0
  • Speed is tested with TensorRT 7.2 on T4.

Quantized model 🚀

ModelSizePrecisionmAPval
0.5:0.95
SpeedT4
trt b1
(fps)
SpeedT4
trt b32
(fps)
YOLOv6-N RepOpt640INT834.811141828
YOLOv6-N640FP1635.98021234
YOLOv6-T RepOpt640INT839.87411167
YOLOv6-T640FP1640.3449659
YOLOv6-S RepOpt640INT843.3619924
YOLOv6-S640FP1643.5377541
  • Speed is tested with TensorRT 8.4 on T4.
  • Precision is figured on models for 300 epochs.

Mobile Benchmark

ModelSizemAPval
0.5:0.95
sm8350
(ms)
mt6853
(ms)
sdm660
(ms)
Params
(M)
FLOPs
(G)
YOLOv6Lite-S320*32022.47.9911.9941.860.550.56
YOLOv6Lite-M320*32025.19.0813.2747.950.790.67
YOLOv6Lite-L320*32028.011.3716.2061.401.090.87
YOLOv6Lite-L320*19225.07.029.6636.131.090.52
YOLOv6Lite-L224*12818.93.634.9917.761.090.24
Table Notes
  • From the perspective of model size and input image ratio, we have built a series of models on the mobile terminal to facilitate flexible applications in different scenarios.
  • All checkpoints are trained with 400 epochs without distillation.
  • Results of the mAP and speed are evaluated on COCO val2017 dataset, and the input resolution is the Size in the table.
  • Speed is tested on MNN 2.3.0 AArch64 with 2 threads by arm82 acceleration. The inference warm-up is performed 10 times, and the cycle is performed 100 times.
  • Qualcomm 888(sm8350), Dimensity 720(mt6853) and Qualcomm 660(sdm660) correspond to chips with different performances at the high, middle and low end respectively, which can be used as a reference for model capabilities under different chips.
  • Refer to Test NCNN Speed tutorial to reproduce the NCNN speed results of YOLOv6Lite.

Quick Start

Install
git clone https://github.com/meituan/YOLOv6
cd YOLOv6
pip install -r requirements.txt
Reproduce our results on COCO

Please refer to Train COCO Dataset.

Finetune on custom data

Single GPU

# P5 models
python tools/train.py --batch 32 --conf configs/yolov6s_finetune.py --data data/dataset.yaml --fuse_ab --device 0
# P6 models
python tools/train.py --batch 32 --conf configs/yolov6s6_finetune.py --data data/dataset.yaml --img 1280 --device 0

Multi GPUs (DDP mode recommended)

# P5 models
python -m torch.distributed.launch --nproc_per_node 8 tools/train.py --batch 256 --conf configs/yolov6s_finetune.py --data data/dataset.yaml --fuse_ab --device 0,1,2,3,4,5,6,7
# P6 models
python -m torch.distributed.launch --nproc_per_node 8 tools/train.py --batch 128 --conf configs/yolov6s6_finetune.py --data data/dataset.yaml --img 1280 --device 0,1,2,3,4,5,6,7
  • fuse_ab: add anchor-based auxiliary branch and use Anchor Aided Training Mode (Not supported on P6 models currently)
  • conf: select config file to specify network/optimizer/hyperparameters. We recommend to apply yolov6n/s/m/l_finetune.py when training on your custom dataset.
  • data: prepare dataset and specify dataset paths in data.yaml ( COCO, YOLO format coco labels )
  • make sure your dataset structure as follows:
├── coco
│   ├── annotations
│   │   ├── instances_train2017.json
│   │   └── instances_val2017.json
│   ├── images
│   │   ├── train2017
│   │   └── val2017
│   ├── labels
│   │   ├── train2017
│   │   ├── val2017
│   ├── LICENSE
│   ├── README.txt

YOLOv6 supports different input resolution modes. For details, see How to Set the Input Size.

Resume training

If your training process is corrupted, you can resume training by

# single GPU training.
python tools/train.py --resume

# multi GPU training.
python -m torch.distributed.launch --nproc_per_node 8 tools/train.py --resume

Above command will automatically find the latest checkpoint in YOLOv6 directory, then resume the training process.

Your can also specify a checkpoint path to --resume parameter by

# remember to replace /path/to/your/checkpoint/path to the checkpoint path which you want to resume training.
--resume /path/to/your/checkpoint/path

This will resume from the specific checkpoint you provide.

Evaluation

Reproduce mAP on COCO val2017 dataset with 640×640 or 1280x1280 resolution

# P5 models
python tools/eval.py --data data/coco.yaml --batch 32 --weights yolov6s.pt --task val --reproduce_640_eval
# P6 models
python tools/eval.py --data data/coco.yaml --batch 32 --weights yolov6s6.pt --task val --reproduce_640_eval --img 1280
  • verbose: set True to print mAP of each classes.
  • do_coco_metric: set True / False to enable / disable pycocotools evaluation method.
  • do_pr_metric: set True / False to print or not to print the precision and recall metrics.
  • config-file: specify a config file to define all the eval params, for example: yolov6n_with_eval_params.py
Inference

First, download a pretrained model from the YOLOv6 release or use your trained model to do inference.

Second, run inference with tools/infer.py

# P5 models
python tools/infer.py --weights yolov6s.pt --source img.jpg / imgdir / video.mp4
# P6 models
python tools/infer.py --weights yolov6s6.pt --img 1280 1280 --source img.jpg / imgdir / video.mp4

If you want to inference on local camera or web camera, you can run:

# P5 models
python tools/infer.py --weights yolov6s.pt --webcam --webcam-addr 0
# P6 models
python tools/infer.py --weights yolov6s6.pt --img 1280 1280 --webcam --webcam-addr 0

webcam-addr can be local camera number id or rtsp address.

Deployment
Tutorials
Third-party resources

FAQ(Continuously updated)

If you have any questions, welcome to join our WeChat group to discuss and exchange.