HyperAI
HyperAI超神経
ホーム
プラットフォーム
ドキュメント
ニュース
論文
チュートリアル
データセット
百科事典
SOTA
LLMモデル
GPU ランキング
学会
検索
サイトについて
利用規約
プライバシーポリシー
日本語
HyperAI
HyperAI超神経
Toggle Sidebar
サイトを検索…
⌘
K
Command Palette
Search for a command to run...
プラットフォーム
ホーム
SOTA
オブジェクト検出
Object Detection On Coco 2017
Object Detection On Coco 2017
評価指標
mAP
評価結果
このベンチマークにおける各モデルのパフォーマンス結果
Columns
モデル名
mAP
Paper Title
UniRepLKNet-XL++
56.4
UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition
UniRepLKNet-L++
55.8
UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition
UniRepLKNet-B++
54.8
UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition
UniRepLKNet-S++
54.3
UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition
MixMIM-L
54.1
MixMAE: Mixed and Masked Autoencoder for Efficient Pretraining of Hierarchical Vision Transformers
UniRepLKNet-S
53
UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition
MixMIM-B
52.2
MixMAE: Mixed and Masked Autoencoder for Efficient Pretraining of Hierarchical Vision Transformers
UniRepLKNet-T
51.7
UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition
BiFormer-B (IN1k pretrain, MaskRCNN 12ep)
48.6
BiFormer: Vision Transformer with Bi-Level Routing Attention
DeBiFormer-B (IN1k pretrain, MaskRCNN 12ep)
48.5
DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention
BiFormer-S (IN1k pretrain, MaskRCNN 12ep)
47.8
BiFormer: Vision Transformer with Bi-Level Routing Attention
DeBiFormer-S (IN1k pretrain, MaskRCNN 12ep)
47.5
DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention
DeBiFormer-B (IN1k pretrain, Retina)
47.1
DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention
DeBiFormer-S (IN1k pretrain, Retina)
45.6
DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention
YOLO-Drone
35.45
YOLO-Drone:Airborne real-time detection of dense small objects from high-altitude perspective
DyHead (SAP)
-
Stochastic Subsampling With Average Pooling
Lpixel
-
Paint Transformer: Feed Forward Neural Painting with Stroke Prediction
MaxViT-T
-
MaxViT: Multi-Axis Vision Transformer
DAT-T++
-
DAT++: Spatially Dynamic Vision Transformer with Deformable Attention
MaxViT-S
-
MaxViT: Multi-Axis Vision Transformer
0 of 24 row(s) selected.
Previous
Next
Object Detection On Coco 2017 | SOTA | HyperAI超神経