Orca-Math-200K Microsoft Math Word Problems Dataset
Date
a year ago
Size
70.88 MB
Publish URL
Tags
Categories
Orca-Math-200K is a high-quality synthetic dataset created by Microsoft that contains approximately 200,000 elementary school math questions. All answers in this dataset are generated using Azure GPT4-Turbo.
The researchers created multiple agents to assist in the construction of the dataset, which involved seed set construction, Agent-Ask Me Anything question generation, Agent-proposer-editor collaborative generation, DMath dataset import, dataset enhancement, and iterative learning. The dataset aims to improve the mathematical capabilities of language models in order to provide a solid foundation for language models in mathematical problem solving.
orca-math-word.torrent
Seeding 2Downloading 1Completed 122Total Downloads 226