HyperAIHyperAI

HelpSteer3 Human Preference Dataset

Date

2 months ago

Size

247.99 MB

Organization

NVIDIA

Publish URL

huggingface.co

Paper URL

arxiv.org

License

CC BY 4.0

HelpSteer3 is a human preference dataset released by NVIDIA in 2025. The related paper results are "HelpSteer3-Preference: Open Human-Annotated Preference Data across Diverse Tasks and Languages", which aims to improve the model's responsiveness to user prompts through human feedback and reinforcement learning techniques.

The dataset contains 40,476 preference samples, each of which includes a domain, language, context, two responses, an overall preference score between the two responses, and personal preference scores from up to three annotators. It includes multilingual data (Chinese, Korean, French, Spanish, Japanese, German, Russian, Portuguese, Italian, Vietnamese, and Dutch).

HelpSteer3.torrent
Seeding 1Downloading 0Completed 20Total Downloads 87
  • HelpSteer3/
    • README.md
      1.4 KB
    • README.txt
      2.79 KB
      • data/
        • HelpSteer3.zip
          247.99 MB