HelpSteer3 Human Preference Dataset
Date
2 months ago
Size
247.99 MB
Publish URL
Paper URL
License
CC BY 4.0
HelpSteer3 is a human preference dataset released by NVIDIA in 2025. The related paper results are "HelpSteer3-Preference: Open Human-Annotated Preference Data across Diverse Tasks and Languages", which aims to improve the model's responsiveness to user prompts through human feedback and reinforcement learning techniques.
The dataset contains 40,476 preference samples, each of which includes a domain, language, context, two responses, an overall preference score between the two responses, and personal preference scores from up to three annotators. It includes multilingual data (Chinese, Korean, French, Spanish, Japanese, German, Russian, Portuguese, Italian, Vietnamese, and Dutch).
HelpSteer3.torrent
Seeding 1Downloading 0Completed 20Total Downloads 87