Date

3 years ago

Modality refers to the specific way people receive information. Since multimedia data is often a medium for transmitting multiple types of information (for example, a video often transmits text, visual, and auditory information at the same time), multimodal learning has gradually developed into the main means of multimedia content analysis and understanding.

Multimodal learning mainly includes the following research directions:

Multimodal representation learning: mainly studies how to digitize the semantic information contained in multiple modal data into real-valued vectors.
Inter-modal mapping: mainly studies how to map the information in a specific modality data to another modality.
Alignment: Mainly studies how to identify the correspondence between components and elements between different modes.
Fusion: Mainly studies how to integrate models and features between different modalities.
Collaborative learning: mainly studies how to transfer knowledge learned in information-rich modalities to information-poor modalities, so that the learning of each modality can assist each other. Typical methods include multimodal zero-shot learning and domain adaptation.

References

【1】AI Review Column - Review of Multimodal Learning Research Progress (Zhihu)

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Date

3 years ago

Multimodal learning mainly includes the following research directions:

Multimodal representation learning: mainly studies how to digitize the semantic information contained in multiple modal data into real-valued vectors.
Inter-modal mapping: mainly studies how to map the information in a specific modality data to another modality.
Alignment: Mainly studies how to identify the correspondence between components and elements between different modes.
Fusion: Mainly studies how to integrate models and features between different modalities.
Collaborative learning: mainly studies how to transfer knowledge learned in information-rich modalities to information-poor modalities, so that the learning of each modality can assist each other. Typical methods include multimodal zero-shot learning and domain adaptation.

References

【1】AI Review Column - Review of Multimodal Learning Research Progress (Zhihu)

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Date

3 years ago

Multimodal learning mainly includes the following research directions:

Multimodal representation learning: mainly studies how to digitize the semantic information contained in multiple modal data into real-valued vectors.
Inter-modal mapping: mainly studies how to map the information in a specific modality data to another modality.
Alignment: Mainly studies how to identify the correspondence between components and elements between different modes.
Fusion: Mainly studies how to integrate models and features between different modalities.
Collaborative learning: mainly studies how to transfer knowledge learned in information-rich modalities to information-poor modalities, so that the learning of each modality can assist each other. Typical methods include multimodal zero-shot learning and domain adaptation.

References

【1】AI Review Column - Review of Multimodal Learning Research Progress (Zhihu)

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Multimodal Learning | Wiki | HyperAI

Command Palette

Multimodal Learning

References

Build AI with AI

HyperAI Newsletters

Command Palette

Multimodal Learning

References

Build AI with AI

HyperAI Newsletters

Command Palette

Multimodal Learning

References

Build AI with AI

HyperAI Newsletters