Date

2 years ago

Size

13.98 MB

Tags

Text Generation

Supervised Fine-Tuning

LLM

Natural Language Processing

Model Training

The Alpaca-Cleaned dataset is a cleaned version of the original Alpaca dataset released by Stanford University in 2024. The original Alpaca is a dataset of 52,000 instructions and demonstrations generated by the engine of OpenAI (text-davinci-003). This instruction data can be used to perform instruction tuning on language models, making them better at following instructions. This dataset solves some problems in the original Alpaca, such as hallucinatory answers, merged instructions, empty outputs, and inconsistent input fields, thereby improving the quality and consistency of the data. The Alpaca-Cleaned dataset has a variety of application scenarios, including text generation, question-answering systems, natural language understanding, and code understanding and generation. Its features include quality optimization, performance improvement, rich model resources, and open source and community support. It encourages community participation, continuous updates and improvements, and promotes the development of the NLP field.

Citation

@misc{alpaca, author = {Rohan Taori and Ishaan Gulrajani and Tianyi Zhang and Yann Dubois and Xuechen Li and Carlos Guestrin and Percy Liang and Tatsunori B. Hashimoto}, title = {Stanford Alpaca: An Instruction-following LLaMA model}, year = {2023}, publisher = {GitHub}, journal = {GitHub repository}, howpublished = {\url{https://github.com/tatsu-lab/stanford\_alpaca}}, }

Alpaca-Cleaned.torrent

Seeding 1Downloading 0Completed 311Total Downloads 401

Alpaca-Cleaned/
- README.md
  1.57 KB
- README.txt
  3.15 KB

This dataset is contributed by community users and is intended for educational and informational purposes only. If any content involves copyright infringement, please contact us at [email protected] for prompt review and removal.

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Download

Discuss on Discord

Date

2 years ago

Size

13.98 MB

Citation

Alpaca-Cleaned.torrent

Seeding 1Downloading 0Completed 311Total Downloads 401

Alpaca-Cleaned/
- README.md
  1.57 KB
- README.txt
  3.15 KB

Related Datasets

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Download

Discuss on Discord

Date

2 years ago

Size

13.98 MB

Citation

Alpaca-Cleaned.torrent

Seeding 1Downloading 0Completed 311Total Downloads 401

Alpaca-Cleaned/
- README.md
  1.57 KB
- README.txt
  3.15 KB

Related Datasets

Movie Feelings Dataset

12 days ago

Rice Leaf Diseases Dataset

a month ago

Transfermarkt Football Dataset

2 months ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

Alpaca-Cleaned Instruction fine-tuning Dataset

Citation

Build AI with AI

HyperAI Newsletters

Command Palette

Alpaca-Cleaned Instruction fine-tuning Dataset

Citation

Related Datasets

Movie Feelings Dataset

Rice Leaf Diseases Dataset

Transfermarkt Football Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

Alpaca-Cleaned Instruction fine-tuning Dataset

Citation

Related Datasets

Movie Feelings Dataset

Rice Leaf Diseases Dataset

Transfermarkt Football Dataset

Build AI with AI

HyperAI Newsletters

Related Datasets

Movie Feelings Dataset

Rice Leaf Diseases Dataset

Transfermarkt Football Dataset

Related Datasets

Movie Feelings Dataset

Rice Leaf Diseases Dataset

Transfermarkt Football Dataset