HyperAI超神经

Scene Text Recognition On Svtp

评估指标

Accuracy

评测结果

各个模型在此基准测试上的表现结果

模型名称
Accuracy
Paper TitleRepository
CLIP4STR-L97.4CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model-
DPAN89.0Look Back Again: Dual Parallel Attention Network for Accurate and Robust Scene Text Recognition
CLIP4STR-B97.2CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model-
CLIP4STR-L (DataComp-1B)98.1CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model-
SIGA_T90.5Self-supervised Implicit Glyph Attention for Text Recognition
MATRN90.6Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features
CCD-ViT-Base96.1Self-supervised Character-to-Character Distillation for Text Recognition-
CCD-ViT-Small92.7Self-supervised Character-to-Character Distillation for Text Recognition-
CCD-ViT-Tiny91.6Self-supervised Character-to-Character Distillation for Text Recognition-
DTrOCR 105M98.6DTrOCR: Decoder-only Transformer for Optical Character Recognition
CDistNet (Ours)89.77CDistNet: Perceiving Multi-Domain Character Distance for Robust Text Recognition
PARSeq95.7±0.9Scene Text Recognition with Permuted Autoregressive Sequence Models
CPPD96.7Context Perception Parallel Decoder for Scene Text Recognition
DiffusionSTR89.2DiffusionSTR: Diffusion Model for Scene Text Recognition-
CLIP4STR-L*98.13An Empirical Study of Scaling Law for OCR
S-GTR90.6Visual Semantics Allow for Textual Reasoning Better in Scene Text Recognition
MGP-STR98.3Multi-Granularity Prediction for Scene Text Recognition
0 of 17 row(s) selected.