41 lines (30 loc) · 1.21 KB

Supported Model Types

Auto-generated from vision-core TaskFactory documentation. Do not edit manually; run python scripts/sync_supported_model_types.py.

Source: /home/oli/repos/vision-inference/build/_deps/vision-core-src/README.md

The TaskFactory supports the following model type strings:

Object Detection:

"yolo", "yolov7e2e", "yolov10", "yolo26", "yolov4" - YOLO-based variants
"yolonas" - YOLO-NAS
"rtdetr" - RT-DETR family (RT-DETR v1, v2, and v4; excludes v3; includes D-FINE and DEIM v1/v2)
"rtdetrul" - RT-DETR (Ultralytics implementation)
"rfdetr" - RF-DETR

Instance Segmentation:

"yoloseg" - YOLOv5/YOLOv8/YOLO11
"yolov10seg"- YOLOv10
"yolo26seg" - YOLO26
"rfdetrseg" - RF-DETR

Classification:

"torchvision-classifier" - Torchvision models (ResNet, EfficientNet, etc.)
"tensorflow-classifier" - TensorFlow/Keras models
"vit-classifier" - Vision Transformers

Video Classification:

"videomae" - VideoMAE
"vivit" - ViViT
"timesformer" - TimeSformer

Optical Flow:

"raft" - RAFT optical flow

Pose Estimation:

"vitpose" - ViTPose

Depth Estimation:

"depth_anything_v2", "depth-anything-v2" - Depth Anything V2