HyperAI
HyperAI초신경
홈
플랫폼
문서
뉴스
연구 논문
튜토리얼
데이터셋
백과사전
SOTA
LLM 모델
GPU 랭킹
컨퍼런스
전체 검색
소개
서비스 약관
개인정보 처리방침
한국어
HyperAI
HyperAI초신경
Toggle Sidebar
전체 사이트 검색...
⌘
K
Command Palette
Search for a command to run...
플랫폼
홈
SOTA
장면 텍스트 인식
Scene Text Recognition On Svt
Scene Text Recognition On Svt
평가 지표
Accuracy
평가 결과
이 벤치마크에서 각 모델의 성능 결과
Columns
모델 이름
Accuracy
Paper Title
CLIP4STR-H (DFN-5B)
99.1
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model
DTrOCR 105M
98.9
DTrOCR: Decoder-only Transformer for Optical Character Recognition
CLIP4STR-B*
98.76
An Empirical Study of Scaling Law for OCR
CLIP4STR-L (DataComp-1B)
98.6
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model
MGP-STR
98.6
Multi-Granularity Prediction for Scene Text Recognition
CLIP4STR-L
98.5
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model
CPPD
98.5
Context Perception Parallel Decoder for Scene Text Recognition
CLIP4STR-B
98.3
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model
PARSeq
97.9±0.2
Scene Text Recognition with Permuted Autoregressive Sequence Models
CCD-ViT-Base(ARD_2.8M)
97.8
Self-supervised Character-to-Character Distillation for Text Recognition
CCD-ViT-Small(ARD_2.8M)
96.4
Self-supervised Character-to-Character Distillation for Text Recognition
CCD-ViT-Tiny(ARD_2.8M)
96.0
Self-supervised Character-to-Character Distillation for Text Recognition
S-GTR
95.8
Visual Semantics Allow for Textual Reasoning Better in Scene Text Recognition
SIGA_T
95.1
Self-supervised Implicit Glyph Attention for Text Recognition
MATRN
95
Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features
Yet Another Text Recognizer
94.7
Why You Should Try the Real Data for the Scene Text Recognition
NRTR+TPS++
94.6
TPS++: Attention-Enhanced Thin-Plate Spline for Scene Text Recognition
DPAN
93.9
Look Back Again: Dual Parallel Attention Network for Accurate and Robust Scene Text Recognition
CDistNet (Ours)
93.82
CDistNet: Perceiving Multi-Domain Character Distance for Robust Text Recognition
DiffusionSTR
93.6
DiffusionSTR: Diffusion Model for Scene Text Recognition
0 of 37 row(s) selected.
Previous
Next