HyperAI超神经

Reasoning

Reasoning是指通过逻辑分析和推理来解决问题和做出决策的过程。其目标是提高决策的准确性和效率,确保基于现有数据和知识得出合理结论。在人工智能领域,Reasoning的应用价值尤为突出,能够帮助机器理解复杂情境,进行自主学习和优化,广泛应用于智能助手、自动驾驶、医疗诊断等场景。

3D Human Reconstruction

29 篇论文 | 10 个基准测试

Common Sense Reasoning

81 篇论文 | 24 个基准测试

Temporal Sequences

1 篇论文 | 1 个基准测试

Physical Intuition

2 篇论文 | 1 个基准测试

Natural Language Visual Grounding

13 篇论文 | 1 个基准测试

Zero-Shot Video Question Answer

65 篇论文 | 16 个基准测试

Visual Reasoning

45 篇论文 | 12 个基准测试

Decision Making

1 篇论文 | 1 个基准测试

Odd One Out

2 篇论文 | 1 个基准测试

Video-based Generative Performance Benchmarking

19 篇论文 | 1 个基准测试

Abstract Algebra

1 篇论文 | 1 个基准测试

Program Repair

5 篇论文 | 3 个基准测试

Image Paragraph Captioning

10 篇论文 | 1 个基准测试

Video-based Generative Performance Benchmarking (Contextual Understanding)

16 篇论文 | 1 个基准测试

Video-based Generative Performance Benchmarking (Correctness of Information)

16 篇论文 | 1 个基准测试

Video-based Generative Performance Benchmarking (Detail Orientation))

16 篇论文 | 1 个基准测试

Video-based Generative Performance Benchmarking (Temporal Understanding)

16 篇论文 | 1 个基准测试

Video-based Generative Performance Benchmarking (Consistency)

16 篇论文 | 1 个基准测试

Visual Commonsense Reasoning

2 篇论文 | 7 个基准测试

Formal Logic

1 篇论文 | 1 个基准测试

Automated Theorem Proving

27 篇论文 | 9 个基准测试

Arithmetic Reasoning

55 篇论文 | 5 个基准测试

Error Understanding

4 篇论文 | 2 个基准测试

Mathematical Induction

2 篇论文 | 1 个基准测试

Physical Commonsense Reasoning

1 篇论文 | 1 个基准测试

Analogical Similarity

2 篇论文 | 1 个基准测试

Elementary Mathematics

1 篇论文 | 1 个基准测试

Logical Reasoning

7 篇论文 | 10 个基准测试

Math Word Problem Solving

70 篇论文 | 13 个基准测试

Multimodal Reasoning

3 篇论文 | 3 个基准测试

Visual Entailment

11 篇论文 | 3 个基准测试

Human Judgment Correlation

3 篇论文 | 2 个基准测试

High School Mathematics

1 篇论文 | 1 个基准测试

College Mathematics

1 篇论文 | 1 个基准测试

Crass AI

3 篇论文 | 1 个基准测试

Analytic Entailment

2 篇论文 | 1 个基准测试

Crash Blossom

2 篇论文 | 1 个基准测试

Entailed Polarity

2 篇论文 | 1 个基准测试

Evaluating Information Essentiality

2 篇论文 | 1 个基准测试

Human Judgment Classification

2 篇论文 | 1 个基准测试

Identify Odd Metapor

2 篇论文 | 1 个基准测试

Logical Args

2 篇论文 | 1 个基准测试

Metaphor Boolean

2 篇论文 | 1 个基准测试

Presuppositions As NLI

2 篇论文 | 1 个基准测试

Commonsense Reasoning for RL

1 篇论文 | 1 个基准测试

Professional Accounting

1 篇论文 | 1 个基准测试