Dense Captioning On Visual Genome
评估指标
mAP
评测结果
各个模型在此基准测试上的表现结果
模型名称 | mAP | Paper Title | Repository |
---|---|---|---|
ControlCap | 18.2 | ControlCap: Controllable Region-level Captioning | - |
GRiT (ViT-B) | 15.5 | GRiT: A Generative Region-to-text Transformer for Object Understanding | |
CAG-Net | 10.5 | Context and Attribute Grounded Dense Captioning | - |
FCLN | 5.4 | DenseCap: Fully Convolutional Localization Networks for Dense Captioning |
0 of 4 row(s) selected.