HyperAI

Image Generation On Textatlaseval

المقاييس

StyledTextSynth Clip Score
StyledTextSynth FID
StyledTextSynth OCR (Accuracy)
StyledTextSynth OCR (Cer)
StyledTextSynth OCR (F1 Score)
TextScenesHQ Clip Score
TextScenesHQ FID
TextScenesHQ OCR (Accuracy)
TextScenesHQ OCR (Cer)
TextScenesHQ OCR (F1 Score)
TextVisionBlend Clip Score
TextVisionBlend FID
TextVisionBlend OCR (Accuracy)
TextVisionBlend OCR (Cer)
TextVsionBlend OCR (F1 Score)

النتائج

نتائج أداء النماذج المختلفة على هذا المعيار القياسي

جدول المقارنة
اسم النموذجStyledTextSynth Clip ScoreStyledTextSynth FIDStyledTextSynth OCR (Accuracy)StyledTextSynth OCR (Cer)StyledTextSynth OCR (F1 Score)TextScenesHQ Clip ScoreTextScenesHQ FIDTextScenesHQ OCR (Accuracy)TextScenesHQ OCR (Cer)TextScenesHQ OCR (F1 Score)TextVisionBlend Clip ScoreTextVisionBlend FIDTextVisionBlend OCR (Accuracy)TextVisionBlend OCR (Cer)TextVsionBlend OCR (F1 Score)
infinity-mm-scaling-multimodal-performance0.272784.950.800.931.420.234671.591.060.881.740.197995.692.980.833.44
النموذج 20.293890.7030.580.7838.250.336786.7369.26-51.630.1938153.218.380.937.94
النموذج 30.284971.0927.210.7333.860.236364.4419.030.7324.450.1846118.8514.550.8816.25
النموذج 40.293880.3315.820.7321.400.3197-35.070.5737.940.1697-41.540.5744.22
pixart-s-weak-to-strong-training-of-diffusion0.276482.830.420.900.620.234772.620.340.910.530.189181.292.400.831.57
textdiffuser-2-unleashing-the-power-of0.2510114.310.760.991.460.225284.100.660.961.25-----
anytext-multilingual-visual-text-generation0.2501117.710.350.980.660.2174101.320.420.950.8-----