HyperAIHyperAI

Command Palette

Search for a command to run...

ChatHaruhi-RolePlaying role-playing Dialogue Dataset

Date

2 years ago

Size

93.83 MB

Paper URL

arxiv.org

Featured Image

* This dataset supports online use.Click here to jump.

ChatHaruhi is a dataset containing 32 Chinese/English TV/anime characters and over 54k simulated dialogues.

Role-playing chatbots built with large language models have attracted widespread attention, but more advanced technology is needed to imitate specific fictional characters. The researchers proposed an algorithm to control the language model through improved prompts and memory of characters extracted from scripts. By collecting corpora from movies, novels, and scripts and performing structured extraction, the researchers collected more than 23,000 dialogue messages. These dialogue data can be used to train and test role-playing language models. At the same time, using the algorithm proposed by the researchers and with the help of GPT3 and GPT4, the researchers generated more than 27,000 additional dialogues for these characters.

ChatHaruhi-RolePlaying.torrent
Seeding 1Downloading 0Completed 351Total Downloads 1,106
  • ChatHaruhi-RolePlaying/
    • README.md
      1.45 KB
    • README.txt
      2.9 KB
      • data/
        • ChatHaruhi-RolePlaying.zip
          93.83 MB

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp