Date

a year ago

Organization

Publish URL

github.com

Paper URL

arxiv.org

Total amount of data:

Crystal composition mode (Composition): about 4.7M tokens
Crystal structure mode (CIF): about 615.5M tokens
Text Descriptions: about 3.1B tokens The process of building this dataset includes collecting original CIF files and material properties from multiple mainstream material databases, and automatically generating structural language descriptions based on crystal structures, thereby forming multi-modal, unified structure data samples. Each sample record contains the corresponding material ID, chemical formula, property values (such as band gap, formation energy, density, elastic modulus, etc.) and other information. The core goal of LLM4Mat-Bench is to promote the cross-integration of materials science and natural language processing, and to promote the research and application development in the fields of task-specific model evaluation, attribute prediction, instruction fine-tuning, etc. Its multi-source, multi-modal, and large-scale characteristics make it an important reference benchmark in the research of material language models.

Citation

"`bib @article{rubungo2025llm4mat, title={LLM4Mat-bench: benchmarking large language models for materials property prediction}, author={Rubungo, Andre Niyongabo and Li, Kangming and Hattrick-Simpers, Jason and Dieng, Adji Bousso}, journal={Machine Learning: Science and Technology}, volume={6}, number={2}, pages={020501}, year={2025}, publisher={IOP Publishing} }

This dataset is contributed by community users and is intended for educational and informational purposes only. If any content involves copyright infringement, please contact us at [email protected] for prompt review and removal.

Related Datasets

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Discuss on Discord

Date

a year ago

Organization

Publish URL

github.com

Paper URL

arxiv.org

Total amount of data:

Crystal composition mode (Composition): about 4.7M tokens
Crystal structure mode (CIF): about 615.5M tokens
Text Descriptions: about 3.1B tokens The process of building this dataset includes collecting original CIF files and material properties from multiple mainstream material databases, and automatically generating structural language descriptions based on crystal structures, thereby forming multi-modal, unified structure data samples. Each sample record contains the corresponding material ID, chemical formula, property values (such as band gap, formation energy, density, elastic modulus, etc.) and other information. The core goal of LLM4Mat-Bench is to promote the cross-integration of materials science and natural language processing, and to promote the research and application development in the fields of task-specific model evaluation, attribute prediction, instruction fine-tuning, etc. Its multi-source, multi-modal, and large-scale characteristics make it an important reference benchmark in the research of material language models.

Citation

Related Datasets

WGO-Bench Robot Video Benchmark Dataset

6 hours ago

MAKIEVAL Multilingual Cultural Knowledge Assessment Dataset

6 hours ago

Verbatim Spans Query Condition Evidence Extraction Dataset

in 2 hours

RHELM Long-Term Memory Assessment Dataset

9 hours ago

Movie Feelings Dataset

12 days ago

SMOL Multilingual Translation Parallel Dataset

a month ago

chi-bench Medical Intelligent Agent Benchmark Evaluation Dataset

13 days ago

ViMU Video Metaphor Understanding Dataset

a month ago

MemLens Multimodal Long Context Benchmark Dataset

a month ago

MathNet Multimodal Mathematical Benchmark Inference Dataset

a month ago

Claw-Eval Real-World Benchmark Dataset

a month ago

QCalEval Quantum Calibration Graph Understanding Dataset

2 months ago

RSRCC Remote Sensing Area Change Understanding Benchmark Dataset

8 days ago

World Model Bench Dataset

2 months ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

LLM4Mat-Bench Crystal Structure Dataset

Total amount of data:

Citation

Build AI with AI

HyperAI Newsletters

Command Palette

LLM4Mat-Bench Crystal Structure Dataset

Total amount of data:

Citation

Related Datasets

WGO-Bench Robot Video Benchmark Dataset

MAKIEVAL Multilingual Cultural Knowledge Assessment Dataset

Verbatim Spans Query Condition Evidence Extraction Dataset

RHELM Long-Term Memory Assessment Dataset

Movie Feelings Dataset

SMOL Multilingual Translation Parallel Dataset

chi-bench Medical Intelligent Agent Benchmark Evaluation Dataset

ViMU Video Metaphor Understanding Dataset

MemLens Multimodal Long Context Benchmark Dataset

MathNet Multimodal Mathematical Benchmark Inference Dataset

Claw-Eval Real-World Benchmark Dataset

QCalEval Quantum Calibration Graph Understanding Dataset

RSRCC Remote Sensing Area Change Understanding Benchmark Dataset

World Model Bench Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

LLM4Mat-Bench Crystal Structure Dataset

Total amount of data:

Citation

Related Datasets

WGO-Bench Robot Video Benchmark Dataset

MAKIEVAL Multilingual Cultural Knowledge Assessment Dataset

Verbatim Spans Query Condition Evidence Extraction Dataset

RHELM Long-Term Memory Assessment Dataset

Movie Feelings Dataset

SMOL Multilingual Translation Parallel Dataset

chi-bench Medical Intelligent Agent Benchmark Evaluation Dataset

ViMU Video Metaphor Understanding Dataset

MemLens Multimodal Long Context Benchmark Dataset

MathNet Multimodal Mathematical Benchmark Inference Dataset

Claw-Eval Real-World Benchmark Dataset

QCalEval Quantum Calibration Graph Understanding Dataset

RSRCC Remote Sensing Area Change Understanding Benchmark Dataset

World Model Bench Dataset

Build AI with AI

HyperAI Newsletters

Related Datasets

WGO-Bench Robot Video Benchmark Dataset

MAKIEVAL Multilingual Cultural Knowledge Assessment Dataset

Verbatim Spans Query Condition Evidence Extraction Dataset

RHELM Long-Term Memory Assessment Dataset

Movie Feelings Dataset

SMOL Multilingual Translation Parallel Dataset

chi-bench Medical Intelligent Agent Benchmark Evaluation Dataset

ViMU Video Metaphor Understanding Dataset

MemLens Multimodal Long Context Benchmark Dataset

MathNet Multimodal Mathematical Benchmark Inference Dataset

Claw-Eval Real-World Benchmark Dataset

QCalEval Quantum Calibration Graph Understanding Dataset

RSRCC Remote Sensing Area Change Understanding Benchmark Dataset

World Model Bench Dataset

Related Datasets

WGO-Bench Robot Video Benchmark Dataset

MAKIEVAL Multilingual Cultural Knowledge Assessment Dataset

Verbatim Spans Query Condition Evidence Extraction Dataset

RHELM Long-Term Memory Assessment Dataset

Movie Feelings Dataset

SMOL Multilingual Translation Parallel Dataset

chi-bench Medical Intelligent Agent Benchmark Evaluation Dataset

ViMU Video Metaphor Understanding Dataset

MemLens Multimodal Long Context Benchmark Dataset

MathNet Multimodal Mathematical Benchmark Inference Dataset

Claw-Eval Real-World Benchmark Dataset

QCalEval Quantum Calibration Graph Understanding Dataset

RSRCC Remote Sensing Area Change Understanding Benchmark Dataset

World Model Bench Dataset