Date

9 months ago

Size

7.03 GB

Organization

Paper URL

2509.15293

License

Apache 2.0

Tags

Embodied Intelligence

Action Recognition

FoMER Bench is a Foundational Model Embodied Reasoning (FoMER) benchmark released in 2025 by Mohamed bin Zayed University of Artificial Intelligence, Linköping University, and Australian National University.How Good are Foundation Models in Step-by-Step Embodied Reasoning?”, which aims to evaluate the reasoning ability of LMM in complex embodied decision-making scenarios. This dataset contains over 1,100 examples, covering detailed step-by-step reasoning across 10 tasks and 8 embodied reasoning tasks. It encompasses three different robot types and multiple robot modes, enabling evaluation of LLM capabilities across various tasks, such as next-step action prediction, action affordance, physical common sense, temporal reasoning, tool use and manipulation, risk assessment, and robot navigation. The data includes multiple-choice questions (MCQs), true/false questions (TFs), and open-ended questions. Each example is accompanied by an input observation (video or image frame + text prompt), multiple candidate actions, and corresponding step-by-step reasoning traces.

Citation

@misc{dissanayake2025goodfoundationmodelsstepbystep, title={How Good are Foundation Models in Step-by-Step Embodied Reasoning?}, author={Dinura Dissanayake and Ahmed Heakl and Omkar Thawakar and Noor Ahsan and Ritesh Thawkar and Ketan More and Jean Lahoud and Rao Anwer and Hisham Cholakkal and Ivan Laptev and Fahad Shahbaz Khan and Salman Khan}, year={2025}, eprint={2509.15293}, archivePrefix={arXiv}, primaryClass={cs.CV}, url={https://arxiv.org/abs/2509.15293}, }

FoMER.torrent

Seeding 1Downloading 0Completed 9Total Downloads 100

FoMER/
- README.md
  1.79 KB
- README.txt
  3.59 KB

This dataset is contributed by community users and is intended for educational and informational purposes only. If any content involves copyright infringement, please contact us at [email protected] for prompt review and removal.

Related Datasets

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Use this Dataset

Discuss on Discord

Date

9 months ago

Size

7.03 GB

Organization

Paper URL

2509.15293

License

Apache 2.0

Citation

FoMER.torrent

Seeding 1Downloading 0Completed 9Total Downloads 100

FoMER/
- README.md
  1.79 KB
- README.txt
  3.59 KB

Related Datasets

MAKIEVAL Multilingual Cultural Knowledge Assessment Dataset

2 hours ago

Verbatim Spans Query Condition Evidence Extraction Dataset

in 6 hours

SAM 3D Artist Objects 3D Object Reconstruction Dataset

5 days ago

Nemotron-SFT-Math-v4 Mathematical Inference SFT Dataset

7 days ago

ChartNet Chart Understanding Multimodal Dataset

25 days ago

SMOL Multilingual Translation Parallel Dataset

a month ago

chi-bench Medical Intelligent Agent Benchmark Evaluation Dataset

13 days ago

MathNet Multimodal Mathematical Benchmark Inference Dataset

a month ago

QCalEval Quantum Calibration Graph Understanding Dataset

2 months ago

RSRCC Remote Sensing Area Change Understanding Benchmark Dataset

8 days ago

World Model Bench Dataset

2 months ago

GPT-5.4-step-by-step-reasoning Dataset

2 months ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

FoMER Bench Multimodal Evaluation Dataset

Citation

Build AI with AI

HyperAI Newsletters

Command Palette

FoMER Bench Multimodal Evaluation Dataset

Citation

Related Datasets

MAKIEVAL Multilingual Cultural Knowledge Assessment Dataset

Verbatim Spans Query Condition Evidence Extraction Dataset

SAM 3D Artist Objects 3D Object Reconstruction Dataset

Nemotron-SFT-Math-v4 Mathematical Inference SFT Dataset

ChartNet Chart Understanding Multimodal Dataset

SMOL Multilingual Translation Parallel Dataset

chi-bench Medical Intelligent Agent Benchmark Evaluation Dataset

MathNet Multimodal Mathematical Benchmark Inference Dataset

QCalEval Quantum Calibration Graph Understanding Dataset

RSRCC Remote Sensing Area Change Understanding Benchmark Dataset

World Model Bench Dataset

GPT-5.4-step-by-step-reasoning Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

FoMER Bench Multimodal Evaluation Dataset

Citation

Related Datasets

MAKIEVAL Multilingual Cultural Knowledge Assessment Dataset

Verbatim Spans Query Condition Evidence Extraction Dataset

SAM 3D Artist Objects 3D Object Reconstruction Dataset

Nemotron-SFT-Math-v4 Mathematical Inference SFT Dataset

ChartNet Chart Understanding Multimodal Dataset

SMOL Multilingual Translation Parallel Dataset

chi-bench Medical Intelligent Agent Benchmark Evaluation Dataset

MathNet Multimodal Mathematical Benchmark Inference Dataset

QCalEval Quantum Calibration Graph Understanding Dataset

RSRCC Remote Sensing Area Change Understanding Benchmark Dataset

World Model Bench Dataset

GPT-5.4-step-by-step-reasoning Dataset

Build AI with AI

HyperAI Newsletters

Related Datasets

MAKIEVAL Multilingual Cultural Knowledge Assessment Dataset

Verbatim Spans Query Condition Evidence Extraction Dataset

SAM 3D Artist Objects 3D Object Reconstruction Dataset

Nemotron-SFT-Math-v4 Mathematical Inference SFT Dataset

ChartNet Chart Understanding Multimodal Dataset

SMOL Multilingual Translation Parallel Dataset

chi-bench Medical Intelligent Agent Benchmark Evaluation Dataset

MathNet Multimodal Mathematical Benchmark Inference Dataset

QCalEval Quantum Calibration Graph Understanding Dataset

RSRCC Remote Sensing Area Change Understanding Benchmark Dataset

World Model Bench Dataset

GPT-5.4-step-by-step-reasoning Dataset

Related Datasets

MAKIEVAL Multilingual Cultural Knowledge Assessment Dataset

Verbatim Spans Query Condition Evidence Extraction Dataset

SAM 3D Artist Objects 3D Object Reconstruction Dataset

Nemotron-SFT-Math-v4 Mathematical Inference SFT Dataset

ChartNet Chart Understanding Multimodal Dataset

SMOL Multilingual Translation Parallel Dataset

chi-bench Medical Intelligent Agent Benchmark Evaluation Dataset

MathNet Multimodal Mathematical Benchmark Inference Dataset

QCalEval Quantum Calibration Graph Understanding Dataset

RSRCC Remote Sensing Area Change Understanding Benchmark Dataset

World Model Bench Dataset

GPT-5.4-step-by-step-reasoning Dataset