Date

a year ago

Multimodal Contrastive Learning with Joint Example Selection (JEST) is a new algorithm proposed by the DeepMind research team in 2024.Data curation via joint example selection further accelerates multimodal learning". JEST aims to solve the problem of high energy consumption during the training of large language models (such as ChatGPT). The JEST algorithm significantly reduces the required computing resources and time by selecting high-quality sub-batches from large-scale "super batches" for training.

The core idea of the JEST algorithm is to use multimodal contrastive learning and joint example selection to improve training efficiency. It first evaluates the learnability of the entire sub-batch, then samples according to the score and selects the sub-batch most relevant to learning for training. This method not only improves training efficiency, but also speeds up multimodal learning. When using the filtering ratio of 50%, 80%, and 90%, only 2 billion, 1 billion, and 670 million training samples are needed respectively to achieve the final performance of the 3 billion uniform benchmark.

In addition, the JEST algorithm also considers the synergistic effect between multi-resolution training and online batch selection, further reducing the computational cost.

Related Wiki

Mem-I Reinforcement Learning Framework

Mem-I has achieved significant improvements over existing memory-enhanced agent baselines in multiple benchmark tests.

12 days ago

Chain-of-frames

Analogous to the concept of thought chains in the field of LLM, CoF is applicable to today's generative video models.

2 months ago

Fully Homomorphic Encryption (FHE)

FHE is widely used in scenarios such as cloud computing security, federated learning, medical data analysis, and financial data collaboration.

a month ago

RewardMap, a multi-stage Reinforcement Learning Framework

RewardMap enhances the capabilities of multimodal large language models in structured vision tasks.

23 days ago

CapRL Describes Reinforcement Learning

CapRL can effectively train models to generate more general and accurate image descriptions.

21 days ago

TreeSynth Is a Synthetic Data Method Based on tree-guided subspaces.

TreeSynth demonstrates exceptional robustness and scalability in large-scale data synthesis.

a month ago

SAC Flow

SAC Flow achieves state-of-the-art performance in continuous control and robot operation benchmarks.

a month ago

Exponential-Gaussian Mixture Network EGMN

EGMN successfully captured the potential interaction effects between user preferences and video features.

a month ago

Layout Control Framework InstanceAssemble

InstanceAssemble enables high-quality and controllable image generation under multimodal conditions.

12 days ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Date

a year ago

In addition, the JEST algorithm also considers the synergistic effect between multi-resolution training and online batch selection, further reducing the computational cost.

Related Wiki

Mem-I Reinforcement Learning Framework

Mem-I has achieved significant improvements over existing memory-enhanced agent baselines in multiple benchmark tests.

12 days ago

Chain-of-frames

Analogous to the concept of thought chains in the field of LLM, CoF is applicable to today's generative video models.

2 months ago

Fully Homomorphic Encryption (FHE)

FHE is widely used in scenarios such as cloud computing security, federated learning, medical data analysis, and financial data collaboration.

a month ago

RewardMap, a multi-stage Reinforcement Learning Framework

RewardMap enhances the capabilities of multimodal large language models in structured vision tasks.

23 days ago

CapRL Describes Reinforcement Learning

CapRL can effectively train models to generate more general and accurate image descriptions.

21 days ago

TreeSynth Is a Synthetic Data Method Based on tree-guided subspaces.

TreeSynth demonstrates exceptional robustness and scalability in large-scale data synthesis.

a month ago

SAC Flow

SAC Flow achieves state-of-the-art performance in continuous control and robot operation benchmarks.

a month ago

Exponential-Gaussian Mixture Network EGMN

EGMN successfully captured the potential interaction effects between user preferences and video features.

a month ago

Layout Control Framework InstanceAssemble

InstanceAssemble enables high-quality and controllable image generation under multimodal conditions.

12 days ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

Multimodal Contrastive Learning With Joint Example Selection (JEST)

Build AI with AI

HyperAI Newsletters

Command Palette

Multimodal Contrastive Learning With Joint Example Selection (JEST)

Related Wiki

Mem-I Reinforcement Learning Framework

Chain-of-frames

Fully Homomorphic Encryption (FHE)

RewardMap, a multi-stage Reinforcement Learning Framework

CapRL Describes Reinforcement Learning

TreeSynth Is a Synthetic Data Method Based on tree-guided subspaces.

SAC Flow

Exponential-Gaussian Mixture Network EGMN

Layout Control Framework InstanceAssemble

Build AI with AI

HyperAI Newsletters

Command Palette

Multimodal Contrastive Learning With Joint Example Selection (JEST)

Related Wiki

Mem-I Reinforcement Learning Framework

Chain-of-frames

Fully Homomorphic Encryption (FHE)

RewardMap, a multi-stage Reinforcement Learning Framework

CapRL Describes Reinforcement Learning

TreeSynth Is a Synthetic Data Method Based on tree-guided subspaces.

SAC Flow

Exponential-Gaussian Mixture Network EGMN

Layout Control Framework InstanceAssemble

Build AI with AI

HyperAI Newsletters

Related Wiki

Mem-I Reinforcement Learning Framework

Chain-of-frames

Fully Homomorphic Encryption (FHE)

RewardMap, a multi-stage Reinforcement Learning Framework

CapRL Describes Reinforcement Learning

TreeSynth Is a Synthetic Data Method Based on tree-guided subspaces.

SAC Flow

Exponential-Gaussian Mixture Network EGMN

Layout Control Framework InstanceAssemble

Related Wiki

Mem-I Reinforcement Learning Framework

Chain-of-frames

Fully Homomorphic Encryption (FHE)

RewardMap, a multi-stage Reinforcement Learning Framework

CapRL Describes Reinforcement Learning

TreeSynth Is a Synthetic Data Method Based on tree-guided subspaces.

SAC Flow

Exponential-Gaussian Mixture Network EGMN

Layout Control Framework InstanceAssemble