3 months ago

Fei Zhao Chonggang Lu Haofu Qian Fangcheng Shi Zijie Meng Jianzhao Huang Xu Tang Zheyong Xie Zheyu Ye Zhe Xu

Abstract

As a key medium for human interaction and information exchange, social networking services (SNS) pose unique challenges for large language models (LLMs): heterogeneous workloads, fast-shifting norms and slang, and multilingual, culturally diverse corpora that induce sharp distribution shift. Supervised fine-tuning (SFT) can specialize models but often triggers a ``seesaw'' between in-distribution gains and out-of-distribution robustness, especially for smaller models. To address these challenges, we introduce RedOne 2.0, an SNS-oriented LLM trained with a progressive, RL-prioritized post-training paradigm designed for rapid and stable adaptation. The pipeline consist in three stages: (1) Exploratory Learning on curated SNS corpora to establish initial alignment and identify systematic weaknesses; (2) Targeted Fine-Tuning that selectively applies SFT to the diagnosed gaps while mixing a small fraction of general data to mitigate forgetting; and (3) Refinement Learning that re-applies RL with SNS-centric signals to consolidate improvements and harmonize trade-offs across tasks. Across various tasks spanning three categories, our 4B scale model delivers an average improvements about 2.41 over the 7B sub-optimal baseline. Additionally, RedOne 2.0 achieves average performance lift about 8.74 from the base model with less than half the data required by SFT-centric method RedOne, evidencing superior data efficiency and stability at compact scales. Overall, RedOne 2.0 establishes a competitive, cost-effective baseline for domain-specific LLMs in SNS scenario, advancing capability without sacrificing robustness.

Source PDF

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

3 months ago

Supervised Fine-Tuning

LLM

Model Training

Method/Architecture

Fei Zhao Chonggang Lu Haofu Qian Fangcheng Shi Zijie Meng Jianzhao Huang Xu Tang Zheyong Xie Zheyu Ye Zhe Xu

Abstract

Source PDF

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

3 months ago

Supervised Fine-Tuning

LLM

Model Training

Method/Architecture

Fei Zhao Chonggang Lu Haofu Qian Fangcheng Shi Zijie Meng Jianzhao Huang Xu Tang Zheyong Xie Zheyu Ye Zhe Xu

Abstract

Source PDF

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

RedOne 2.0: Rethinking Domain-specific LLM Post-Training in Social Networking Services

Fei Zhao Chonggang Lu Haofu Qian Fangcheng Shi Zijie Meng Jianzhao Huang Xu Tang Zheyong Xie Zheyu Ye Zhe Xu2 more

Abstract

Build AI with AI

HyperAI Newsletters

Command Palette

RedOne 2.0: Rethinking Domain-specific LLM Post-Training in Social Networking Services

Fei Zhao Chonggang Lu Haofu Qian Fangcheng Shi Zijie Meng Jianzhao Huang Xu Tang Zheyong Xie Zheyu Ye Zhe Xu2 more

Abstract

Build AI with AI

HyperAI Newsletters

Command Palette

RedOne 2.0: Rethinking Domain-specific LLM Post-Training in Social Networking Services

Fei Zhao Chonggang Lu Haofu Qian Fangcheng Shi Zijie Meng Jianzhao Huang Xu Tang Zheyong Xie Zheyu Ye Zhe Xu2 more

Abstract

Build AI with AI

HyperAI Newsletters

Fei Zhao Chonggang Lu Haofu Qian Fangcheng Shi Zijie Meng Jianzhao Huang Xu Tang Zheyong Xie Zheyu Ye Zhe Xu

Fei Zhao Chonggang Lu Haofu Qian Fangcheng Shi Zijie Meng Jianzhao Huang Xu Tang Zheyong Xie Zheyu Ye Zhe Xu

Fei Zhao Chonggang Lu Haofu Qian Fangcheng Shi Zijie Meng Jianzhao Huang Xu Tang Zheyong Xie Zheyu Ye Zhe Xu