HyperAI

Text To Image Generation On Coco

Métriques

FID
Inception score

Résultats

Résultats de performance de divers modèles sur ce benchmark

Tableau comparatif
Nom du modèleFIDInception score
improving-text-to-image-synthesis-using23.9325.70
fusedream-training-free-text-to-image21.1634.26
shifted-diffusion-for-text-to-image10.6-
retrieval-augmented-multimodal-language29.5-
stackgan-realistic-image-synthesis-with74.058.45
stylegan-t-unlocking-the-power-of-gans-for13.9-
re-imagen-retrieval-augmented-text-to-image5.25-
gligen-open-set-grounded-text-to-image5.82-
nuwa-visual-synthesis-pre-training-for-neural-18.7
nuwa-visual-synthesis-pre-training-for-neural27.517.9
lafite-towards-language-free-training-for8.1232.34
make-a-scene-scene-based-text-to-image11.84-
hierarchical-text-conditional-image10.39-
photorealistic-text-to-image-diffusion-models7.27-
vector-quantized-diffusion-model-for-text-to13.86-
data-extrapolation-for-text-to-image5.00-
lafite-towards-language-free-training-for26.9426.02
ediffi-text-to-image-diffusion-models-with-an6.95-
l-verse-bidirectional-generation-between45.8-
gligen-open-set-grounded-text-to-image6.38-
raphael-text-to-image-generation-via-large6.61-
knn-diffusion-image-generation-via-large12.5-
cogview2-faster-and-better-text-to-image17.7-
cogview-mastering-text-to-image-generation27.118.2
victr-visual-information-captured-text-10.38
nuwa-visual-synthesis-pre-training-for-neural12.9 27.2
fusedream-training-free-text-to-image21.8934.67
ernie-vilg-2-0-improving-text-to-image6.75-
victr-visual-information-captured-text32.3732.37
generating-multiple-objects-at-spatially55.3012.12
all-are-worth-words-a-vit-backbone-for-score5.95-
re-imagen-retrieval-augmented-text-to-image6.88-
dm-gan-dynamic-memory-generative-adversarial32.6430.49
vector-quantized-diffusion-model-for-text-to19.75-
nuwa-visual-synthesis-pre-training-for-neural 26.032.2
kandinsky-an-improved-text-to-image-synthesis8.03-
ernie-vilg-unified-generative-pre-training14.7-
retrieval-augmented-multimodal-language15.7-
scaling-up-gans-for-text-to-image-synthesis9.09-
retrieval-augmented-multimodal-language28-
victr-visual-information-captured-text29.2628.18
scaling-up-gans-for-text-to-image-synthesis7.28-
l-verse-bidirectional-generation-between37.2-
nuwa-visual-synthesis-pre-training-for-neural 35.223.3
tr0n-translator-networks-for-0-shot-plug-and10.9-
stylegan-t-unlocking-the-power-of-gans-for7.3-
improving-text-to-image-synthesis-using20.7933.34
retrieval-augmented-multimodal-language12.63-
swinv2-imagen-hierarchical-vision-transformer7.2131.46
fusedream-training-free-text-to-image21.1634.26
chatpainter-improving-text-to-image-9.74
shifted-diffusion-for-text-to-image10.88-
nuwa-visual-synthesis-pre-training-for-neural 27.118.2
improving-diffusion-based-image-synthesis-16.21-
19101332124.7027.88
all-are-worth-words-a-vit-backbone-for-score5.48-
long-and-short-guidance-in-score-identity8.15-
cross-modal-contrastive-learning-for-text-to9.33-
galip-generative-adversarial-clips-for-text12.54-
make-a-scene-scene-based-text-to-image7.55-
cogview2-faster-and-better-text-to-image24-
recurrent-affine-transformation-for-text-to14.6-
simple-diffusion-end-to-end-diffusion-for8.3-
truncated-diffusion-probabilistic-models6.29-
glide-towards-photorealistic-image-generation12.24-
generating-multiple-objects-at-spatially33.3524.76
nuwa-visual-synthesis-pre-training-for-neural9.330.5
high-resolution-image-synthesis-with-latent12.63-
gligen-open-set-grounded-text-to-image5.61-