HyperAIHyperAI

Command Palette

Search for a command to run...

Cross-Modality Fusion Transformer for Multispectral Object Detection

Fang Qingyun Han Dapeng Wang Zhaokui

Abstract

Multispectral image pairs can provide the combined information, making objectdetection applications more reliable and robust in the open world. To fullyexploit the different modalities, we present a simple yet effectivecross-modality feature fusion approach, named Cross-Modality Fusion Transformer(CFT) in this paper. Unlike prior CNNs-based works, guided by the transformerscheme, our network learns long-range dependencies and integrates globalcontextual information in the feature extraction stage. More importantly, byleveraging the self attention of the transformer, the network can naturallycarry out simultaneous intra-modality and inter-modality fusion, and robustlycapture the latent interactions between RGB and Thermal domains, therebysignificantly improving the performance of multispectral object detection.Extensive experiments and ablation studies on multiple datasets demonstratethat our approach is effective and achieves state-of-the-art detectionperformance. Our code and models are available athttps://github.com/DocF/multispectral-object-detection.


Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing

HyperAI Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp