Date

6 months ago

A Multi-dimensional Data Selection Method for Pre-training Language Models (Meta-rater) was proposed by Shanghai Artificial Intelligence Laboratory and East China Normal University on June 4, 2025. It aims to integrate the four dimensions of professionalism, readability, reasoning, and cleanliness with existing quality indicators by learning optimal weights.Meta-rater: A Multi-dimensional Data Selection Method for Pre-training Language Models", which won the ACL 25 Best Theme Paper Award.

Meta-rater uses a surrogate model to train a regression model and predict the validation set loss, thereby identifying the optimal quality score combination. Experimental results show that Meta-rater can triple the convergence speed of a 1.3 billion parameter model and improve downstream task performance by 3.23%. This advantage is scalable to a 7.2 billion parameter model.

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Date

6 months ago

Related Wiki

MultiPL-MoE Architecture

MultiPL-MoE is an effective method for extending low-source programming languages in the post-pre-training stage.

2 months ago

Group Variance Strategy Optimization GVPO

Given the limitations of existing fine-tuning techniques such as GRPO, GVPO has emerged as a reliable and versatile post-training paradigm.

3 months ago

Fully Homomorphic Encryption (FHE)

FHE is widely used in scenarios such as cloud computing security, federated learning, medical data analysis, and financial data collaboration.

3 months ago

Cache-to-Cache (C2C)

C2C enables direct semantic communication by transforming and fusing key-value (KV) caches between models.

2 months ago

Gated Attention

The Tongyi Qianwen team systematically studied the role of gating mechanisms in standard softmax attention.

2 months ago

Agentic Context Engineering

ACE enables agents to improve themselves by dynamically optimizing the input context.

3 months ago

Guess – Think – Answer

GTA significantly outperforms standard SFT baselines and state-of-the-art RL methods in multiple text classification benchmarks.

3 months ago

Exponential-Gaussian Mixture Network EGMN

EGMN successfully captured the potential interaction effects between user preferences and video features.

3 months ago

RewardMap, a multi-stage Reinforcement Learning Framework

RewardMap enhances the capabilities of multimodal large language models in structured vision tasks.

2 months ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

Multi-dimensional pre-training Data Screening Framework Meta-rater

Build AI with AI

HyperAI Newsletters

Command Palette

Multi-dimensional pre-training Data Screening Framework Meta-rater

Related Wiki

MultiPL-MoE Architecture

Group Variance Strategy Optimization GVPO

Fully Homomorphic Encryption (FHE)

Cache-to-Cache (C2C)

Gated Attention

Agentic Context Engineering

Guess – Think – Answer

Exponential-Gaussian Mixture Network EGMN

RewardMap, a multi-stage Reinforcement Learning Framework

Build AI with AI

HyperAI Newsletters

Command Palette

Multi-dimensional pre-training Data Screening Framework Meta-rater

Related Wiki

MultiPL-MoE Architecture

Group Variance Strategy Optimization GVPO

Fully Homomorphic Encryption (FHE)

Cache-to-Cache (C2C)

Gated Attention

Agentic Context Engineering

Guess – Think – Answer

Exponential-Gaussian Mixture Network EGMN

RewardMap, a multi-stage Reinforcement Learning Framework

Build AI with AI

HyperAI Newsletters

Related Wiki

MultiPL-MoE Architecture

Group Variance Strategy Optimization GVPO

Fully Homomorphic Encryption (FHE)

Cache-to-Cache (C2C)

Gated Attention

Agentic Context Engineering

Guess – Think – Answer

Exponential-Gaussian Mixture Network EGMN

RewardMap, a multi-stage Reinforcement Learning Framework

Related Wiki

MultiPL-MoE Architecture

Group Variance Strategy Optimization GVPO

Fully Homomorphic Encryption (FHE)

Cache-to-Cache (C2C)

Gated Attention

Agentic Context Engineering

Guess – Think – Answer

Exponential-Gaussian Mixture Network EGMN

RewardMap, a multi-stage Reinforcement Learning Framework