Date

2 years ago

Size

2.89 MB

Organization

Publish URL

github.com

Paper URL

arxiv.org

Tags

LLM

Multimodal Representation

OceanInstruct is a large-scale language model instruction dataset designed specifically for the field of ocean science. It contains 20,000 instructions and aims to provide training data for large-scale language models in the ocean field. These instructions cover a wide range of ocean science knowledge, ensuring that the model has professional capabilities in ocean science question answering, content generation, and underwater embodied intelligence. This dataset was used to train the OceanGPT model, which performs well in ocean science question answering, content generation, and other aspects. The OceanGPT model outperforms the baseline language model on multiple tasks, showing its advantage in handling ocean tasks that require expertise. This dataset was open sourced by Zhejiang University in 2024, and the related paper results are "OceanGPT: A Large Language Model for Ocean Science Tasks". The address of the super neuro report isSelected for ACL 2024! Zhejiang University launches the first ocean language model OceanGPT, making underwater embodied intelligence a reality". In addition, OceanBench also proposed OceanBench oceanography benchmark evaluation dataset, which is a benchmark evaluation dataset specifically for oceanographic tasks. This dataset includes a total of 15 ocean-related tasks, such as question answering and description tasks, and is designed to comprehensively evaluate the capabilities of large language models (LLMs) in the field of oceanography. The samples in OceanBench are generated from seed datasets in an automated way and manually verified by experts to ensure the professionalism and accuracy of the data.

OceanInstruct.torrent

Seeding 2Downloading 0Completed 207Total Downloads 389

OceanInstruct/
- README.md
  1.48 KB
- README.txt
  2.96 KB

This dataset is contributed by community users and is intended for educational and informational purposes only. If any content involves copyright infringement, please contact us at [email protected] for prompt review and removal.

Related Datasets

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Download

Discuss on Discord

Date

2 years ago

Size

2.89 MB

Organization

Publish URL

github.com

Paper URL

arxiv.org

Related Datasets

Nemotron Personas France (French Synthetic Personas Dataset)

2 months ago

Nemotron-Personas-Brazil Brazilian Synthetic Character Dataset

4 months ago

RoVid-X Robot Video Generation Dataset

2 months ago

Patient Segmentation Dataset

4 months ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

OceanInstruct Ocean Large Model Instruction Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

OceanInstruct Ocean Large Model Instruction Dataset

Related Datasets

Nemotron Personas France (French Synthetic Personas Dataset)

Nemotron-Personas-Brazil Brazilian Synthetic Character Dataset

RoVid-X Robot Video Generation Dataset

Patient Segmentation Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

OceanInstruct Ocean Large Model Instruction Dataset

Related Datasets

Nemotron Personas France (French Synthetic Personas Dataset)

Nemotron-Personas-Brazil Brazilian Synthetic Character Dataset

RoVid-X Robot Video Generation Dataset

Patient Segmentation Dataset

Build AI with AI

HyperAI Newsletters

Related Datasets

Nemotron Personas France (French Synthetic Personas Dataset)

Nemotron-Personas-Brazil Brazilian Synthetic Character Dataset

RoVid-X Robot Video Generation Dataset

Patient Segmentation Dataset

Related Datasets

Nemotron Personas France (French Synthetic Personas Dataset)

Nemotron-Personas-Brazil Brazilian Synthetic Character Dataset

RoVid-X Robot Video Generation Dataset

Patient Segmentation Dataset