HyperAIHyperAI

Command Palette

Search for a command to run...

DPO-zh-en-emoji Emoji Question Answering Dataset

Date

a year ago

Size

5.59 MB

Organization

* This dataset is available online.Click here to jump.

Dataset Introduction

The DPO-zh-en-emoji dataset is a dataset specially designed for fine-tuning large language models launched by shareAI in 2024, where "DPO" stands for Direct Preference Optimization. This dataset contains a large amount of question-answer pair data. Each question has two versions of the answer, Chinese and English, and the answers are integrated with fun and humorous elements, including the use of emojis. The research team carefully selected some questions from Zhihu, logical reasoning, and idiots as queries, and used the llama3 70b instruct model to sample and generate a Chinese version of the answer and an English version of the answer for each query. Such a design helps to activate the language style preferences of multilingual chat models and improve the quality of model-generated content and its compliance with human preferences.

DPO-zh-en-emoji.torrent
Seeding 2Downloading 0Completed 131Total Downloads 342
  • DPO-zh-en-emoji/
    • README.md
      1.58 KB
    • README.txt
      3.16 KB
      • data/
        • DPO-zh-en-emoji.zip
          5.59 MB

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp