HyperAI超神经
首页
资讯
最新论文
教程
数据集
百科
SOTA
LLM 模型天梯
GPU 天梯
顶会
开源项目
全站搜索
关于
中文
HyperAI超神经
Toggle sidebar
全站搜索…
⌘
K
首页
SOTA
Image Reconstruction
Image Reconstruction On Imagenet
Image Reconstruction On Imagenet
评估指标
FID
LPIPS
PSNR
SSIM
评测结果
各个模型在此基准测试上的表现结果
Columns
模型名称
FID
LPIPS
PSNR
SSIM
Paper Title
Repository
Taming-VQGAN (16x16)
3.64
0.177
19.93
0.542
Taming Transformers for High-Resolution Image Synthesis
Open-Magvit2 (16x16)
1.17
-
21.90
-
Open-MAGVIT2: An Open-Source Project Toward Democratizing Auto-regressive Visual Generation
TiTok-S-128
1.71
-
-
-
An Image is Worth 32 Tokens for Reconstruction and Generation
VQGAN-LC (16x16)
2.62
0.120
23.80
0.589
Scaling the Codebook Size of VQGAN to 100,000 with a Utilization Rate of 99%
OptVQ (16x16x8)
0.91
0.066
27.57
0.729
Preventing Local Pitfalls in Vector Quantization via Optimal Transport
MaskBit (16x16)
1.66
-
-
-
MaskBit: Embedding-free Image Generation via Bit Tokens
IBQ (16x16)
1.00
0.2030
-
-
Taming Scalable Visual Tokenizer for Autoregressive Image Generation
-
ViT-VQGAN (16x16)
1.28
-
-
-
Vector-quantized Image Modeling with Improved VQGAN
MaskGIT-VQGAN (16x16)
2.28
-
-
-
MaskGIT: Masked Generative Image Transformer
RQ-VAE (8x8x16)
1.83
-
-
-
Autoregressive Image Generation using Residual Quantization
OptVQ (16x16x4)
1.00
0.076
26.59
0.717
Preventing Local Pitfalls in Vector Quantization via Optimal Transport
Mo-VQGAN (16x16x4)
1.12
0.113
22.42
0.673
MoVQ: Modulating Quantized Vectors for High-Fidelity Image Generation
0 of 12 row(s) selected.
Previous
Next