HyperAI

Orca-Math-200K Microsoft Math Word Problems Dataset

Date

a year ago

Size

70.88 MB

Organization

Microsoft

Publish URL

huggingface.co

Orca-Math-200K is a high-quality synthetic dataset created by Microsoft that contains approximately 200,000 elementary school math questions. All answers in this dataset are generated using Azure GPT4-Turbo.

The researchers created multiple agents to assist in the construction of the dataset, which involved seed set construction, Agent-Ask Me Anything question generation, Agent-proposer-editor collaborative generation, DMath dataset import, dataset enhancement, and iterative learning. The dataset aims to improve the mathematical capabilities of language models in order to provide a solid foundation for language models in mathematical problem solving.

orca-math-word.torrent
Seeding 2Downloading 1Completed 122Total Downloads 226
  • orca-math-word/
    • README.md
      1.34 KB
    • README.txt
      2.68 KB
      • data/
        • orca-math-word-200k.zip
          70.88 MB