HyperAIHyperAI

Command Palette

Search for a command to run...

seq-monkey Sequence Monkey Open Source Dataset 1.0

Date

2 years ago

Size

10.73 GB

Organization

Publish URL

github.com

Sequence Monkey is a large-scale language model provided by Mobvoi.The Sequence Monkey dataset is a data set used to train the Sequence Monkey model. Part of the dataset is now open to the public.

The 1.0 version of the dataset covers the following areas: Chinese general text corpus, ancient poetry modern translation corpus, and text generation corpus. Among them, the Chinese general text corpus is 13 million pieces of data extracted from the sequence monkey training set and is open to the public. The ancient poetry modern translation open source dataset is a dataset of ancient and modern text translations, with 680,000 poems open. The text generation fine-tuning dataset has 5,000 question-and-answer data open, which can be used for word error detection, word error correction, and text polishing tasks.

seq-monkey.torrent
Seeding 1Downloading 0Completed 382Total Downloads 859
  • seq-monkey/
    • README.md
      1.36 KB
    • README.txt
      2.72 KB
      • data/
        • seq-monkey-data-main 2.zip
          10.73 GB

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp