HyperAI

Image Generation On Textatlaseval

Métriques

StyledTextSynth Clip Score
StyledTextSynth FID
StyledTextSynth OCR (Accuracy)
StyledTextSynth OCR (Cer)
StyledTextSynth OCR (F1 Score)
TextScenesHQ Clip Score
TextScenesHQ FID
TextScenesHQ OCR (Accuracy)
TextScenesHQ OCR (Cer)
TextScenesHQ OCR (F1 Score)
TextVisionBlend Clip Score
TextVisionBlend FID
TextVisionBlend OCR (Accuracy)
TextVisionBlend OCR (Cer)
TextVsionBlend OCR (F1 Score)

Résultats

Résultats de performance de divers modèles sur ce benchmark

Tableau comparatif
Nom du modèleStyledTextSynth Clip ScoreStyledTextSynth FIDStyledTextSynth OCR (Accuracy)StyledTextSynth OCR (Cer)StyledTextSynth OCR (F1 Score)TextScenesHQ Clip ScoreTextScenesHQ FIDTextScenesHQ OCR (Accuracy)TextScenesHQ OCR (Cer)TextScenesHQ OCR (F1 Score)TextVisionBlend Clip ScoreTextVisionBlend FIDTextVisionBlend OCR (Accuracy)TextVisionBlend OCR (Cer)TextVsionBlend OCR (F1 Score)
infinity-mm-scaling-multimodal-performance0.272784.950.800.931.420.234671.591.060.881.740.197995.692.980.833.44
Modèle 20.293890.7030.580.7838.250.336786.7369.26-51.630.1938153.218.380.937.94
Modèle 30.284971.0927.210.7333.860.236364.4419.030.7324.450.1846118.8514.550.8816.25
Modèle 40.293880.3315.820.7321.400.3197-35.070.5737.940.1697-41.540.5744.22
pixart-s-weak-to-strong-training-of-diffusion0.276482.830.420.900.620.234772.620.340.910.530.189181.292.400.831.57
textdiffuser-2-unleashing-the-power-of0.2510114.310.760.991.460.225284.100.660.961.25-----
anytext-multilingual-visual-text-generation0.2501117.710.350.980.660.2174101.320.420.950.8-----