HyperAI초신경

Scene Text Recognition On Svtp

평가 지표

Accuracy

평가 결과

이 벤치마크에서 각 모델의 성능 결과

모델 이름
Accuracy
Paper TitleRepository
CLIP4STR-L97.4CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model-
DPAN89.0Look Back Again: Dual Parallel Attention Network for Accurate and Robust Scene Text Recognition
CLIP4STR-B97.2CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model-
CLIP4STR-L (DataComp-1B)98.1CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model-
SIGA_T90.5Self-supervised Implicit Glyph Attention for Text Recognition
MATRN90.6Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features
CCD-ViT-Base96.1Self-supervised Character-to-Character Distillation for Text Recognition-
CCD-ViT-Small92.7Self-supervised Character-to-Character Distillation for Text Recognition-
CCD-ViT-Tiny91.6Self-supervised Character-to-Character Distillation for Text Recognition-
DTrOCR 105M98.6DTrOCR: Decoder-only Transformer for Optical Character Recognition
CDistNet (Ours)89.77CDistNet: Perceiving Multi-Domain Character Distance for Robust Text Recognition
PARSeq95.7±0.9Scene Text Recognition with Permuted Autoregressive Sequence Models
CPPD96.7Context Perception Parallel Decoder for Scene Text Recognition
DiffusionSTR89.2DiffusionSTR: Diffusion Model for Scene Text Recognition-
CLIP4STR-L*98.13An Empirical Study of Scaling Law for OCR
S-GTR90.6Visual Semantics Allow for Textual Reasoning Better in Scene Text Recognition
MGP-STR98.3Multi-Granularity Prediction for Scene Text Recognition
0 of 17 row(s) selected.