HyperAIHyperAI
2 months ago

Fine-grained Visual Textual Alignment for Cross-Modal Retrieval using Transformer Encoders

Nicola Messina; Giuseppe Amato; Andrea Esuli; Fabrizio Falchi; Claudio Gennaro; Stéphane Marchand-Maillet
Fine-grained Visual Textual Alignment for Cross-Modal Retrieval using Transformer Encoders
Abstract