Use this Dataset

Discuss on Discord

Date

2 years ago

Size

16.04 MB

Organization

Publish URL

Paper URL

Tags

MedCalc-Bench is a dataset specifically designed to evaluate the medical computing capabilities of large language models (LLMs). It was jointly released in 2024 by nine institutions including the National Library of Medicine, National Institutes of Health and the University of Virginia. The relevant paper results are "MEDCALC-BENCH: Evaluating Large Language Models for Medical Calculations", has been accepted by NeurIPS 2024. This dataset contains 10,055 training instances and 1,047 test instances, covering 55 different computational tasks. Each instance includes a patient's note, a question to calculate a specific clinical value, the final answer value, and a step-by-step solution. The purpose of MedCalc-Bench is to improve the linguistic and computational reasoning abilities of LLMs in medical settings. The features of the dataset include line number, calculator ID, calculator name, category, output type, note ID, note type, patient note, question, related entity, true answer, lower bound, upper bound, and true explanation. These features provide the model with rich contextual information for accurate calculation and reasoning. The dataset is divided into training and test sets, which can be used to fine-tune LLMs to improve their performance in medical computing tasks.

MedCalc-Bench.torrent

Seeding 1Downloading 0Completed 213Total Downloads 477

MedCalc-Bench/
- README.md
  1.94 KB
- README.txt
  3.88 KB

This dataset is contributed by community users and is intended for educational and informational purposes only. If any content involves copyright infringement, please contact us at [email protected] for prompt review and removal.

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Powered by MailChimp

Use this Dataset

Discuss on Discord

Date

2 years ago

Size

16.04 MB

Organization

Publish URL

Paper URL

arxiv.org

Tags

MedCalc-Bench is a dataset specifically designed to evaluate the medical computing capabilities of large language models (LLMs). It was jointly released in 2024 by nine institutions including the National Library of Medicine, National Institutes of Health and the University of Virginia. The relevant paper results are "MEDCALC-BENCH: Evaluating Large Language Models for Medical Calculations", has been accepted by NeurIPS 2024. This dataset contains 10,055 training instances and 1,047 test instances, covering 55 different computational tasks. Each instance includes a patient's note, a question to calculate a specific clinical value, the final answer value, and a step-by-step solution. The purpose of MedCalc-Bench is to improve the linguistic and computational reasoning abilities of LLMs in medical settings. The features of the dataset include line number, calculator ID, calculator name, category, output type, note ID, note type, patient note, question, related entity, true answer, lower bound, upper bound, and true explanation. These features provide the model with rich contextual information for accurate calculation and reasoning. The dataset is divided into training and test sets, which can be used to fine-tune LLMs to improve their performance in medical computing tasks.

MedCalc-Bench.torrent

Seeding 1Downloading 0Completed 213Total Downloads 477

MedCalc-Bench/
- README.md
  1.94 KB
- README.txt
  3.88 KB

This dataset is contributed by community users and is intended for educational and informational purposes only. If any content involves copyright infringement, please contact us at [email protected] for prompt review and removal.

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Powered by MailChimp