Scene Text Recognition On Svtp

评估指标

Accuracy

评测结果

各个模型在此基准测试上的表现结果

模型名称
Accuracy
Paper TitleRepository
CLIP4STR-L97.4CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model-
DPAN89.0Look Back Again: Dual Parallel Attention Network for Accurate and Robust Scene Text Recognition
CLIP4STR-B97.2CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model-
CLIP4STR-L (DataComp-1B)98.1CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model-
SIGA_T90.5Self-supervised Implicit Glyph Attention for Text Recognition-
MATRN90.6Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features-
CCD-ViT-Base96.1Self-supervised Character-to-Character Distillation for Text Recognition-
CCD-ViT-Small92.7Self-supervised Character-to-Character Distillation for Text Recognition-
CCD-ViT-Tiny91.6Self-supervised Character-to-Character Distillation for Text Recognition-
DTrOCR 105M98.6DTrOCR: Decoder-only Transformer for Optical Character Recognition-
CDistNet (Ours)89.77CDistNet: Perceiving Multi-Domain Character Distance for Robust Text Recognition-
PARSeq95.7±0.9Scene Text Recognition with Permuted Autoregressive Sequence Models-
CPPD96.7Context Perception Parallel Decoder for Scene Text Recognition-
DiffusionSTR89.2DiffusionSTR: Diffusion Model for Scene Text Recognition-
CLIP4STR-L*98.13An Empirical Study of Scaling Law for OCR-
S-GTR90.6Visual Semantics Allow for Textual Reasoning Better in Scene Text Recognition-
MGP-STR98.3Multi-Granularity Prediction for Scene Text Recognition-
0 of 17 row(s) selected.
Scene Text Recognition On Svtp | SOTA | HyperAI超神经