Multimodal Side-Tuning (ResNet50) | 90.30 | Multimodal Side-Tuning for Document Classification | |
Multimodal Side-Tuning (MobileNetV2) | 90.50 | Multimodal Side-Tuning for Document Classification | |
Eff-GNN + Word2Vec [word2vec] + Image Embedding | 77.5 | Efficient Document Image Classification Using Region-Based Graph Neural Network | - |