Reasoning
Reasoning是指通过逻辑分析和推理来解决问题和做出决策的过程。其目标是提高决策的准确性和效率,确保基于现有数据和知识得出合理结论。在人工智能领域,Reasoning的应用价值尤为突出,能够帮助机器理解复杂情境,进行自主学习和优化,广泛应用于智能助手、自动驾驶、医疗诊断等场景。
3D Human Reconstruction
29 篇论文 | 10 个基准测试
Common Sense Reasoning
81 篇论文 | 24 个基准测试
Temporal Sequences
1 篇论文 | 1 个基准测试
Physical Intuition
2 篇论文 | 1 个基准测试
Natural Language Visual Grounding
13 篇论文 | 1 个基准测试
Zero-Shot Video Question Answer
65 篇论文 | 16 个基准测试
Visual Reasoning
45 篇论文 | 12 个基准测试
Decision Making
1 篇论文 | 1 个基准测试
Odd One Out
2 篇论文 | 1 个基准测试
Video-based Generative Performance Benchmarking
19 篇论文 | 1 个基准测试
Abstract Algebra
1 篇论文 | 1 个基准测试
Program Repair
5 篇论文 | 3 个基准测试
Image Paragraph Captioning
10 篇论文 | 1 个基准测试
Video-based Generative Performance Benchmarking (Contextual Understanding)
16 篇论文 | 1 个基准测试
Video-based Generative Performance Benchmarking (Correctness of Information)
16 篇论文 | 1 个基准测试
Video-based Generative Performance Benchmarking (Detail Orientation))
16 篇论文 | 1 个基准测试
Video-based Generative Performance Benchmarking (Temporal Understanding)
16 篇论文 | 1 个基准测试
Video-based Generative Performance Benchmarking (Consistency)
16 篇论文 | 1 个基准测试
Visual Commonsense Reasoning
2 篇论文 | 7 个基准测试
Formal Logic
1 篇论文 | 1 个基准测试
Automated Theorem Proving
27 篇论文 | 9 个基准测试
Arithmetic Reasoning
55 篇论文 | 5 个基准测试
Error Understanding
4 篇论文 | 2 个基准测试
Mathematical Induction
2 篇论文 | 1 个基准测试
Physical Commonsense Reasoning
1 篇论文 | 1 个基准测试
Analogical Similarity
2 篇论文 | 1 个基准测试
Elementary Mathematics
1 篇论文 | 1 个基准测试
Logical Reasoning
7 篇论文 | 10 个基准测试
Math Word Problem Solving
70 篇论文 | 13 个基准测试
Multimodal Reasoning
3 篇论文 | 3 个基准测试
Visual Entailment
11 篇论文 | 3 个基准测试
Human Judgment Correlation
3 篇论文 | 2 个基准测试
High School Mathematics
1 篇论文 | 1 个基准测试
College Mathematics
1 篇论文 | 1 个基准测试
Crass AI
3 篇论文 | 1 个基准测试
Analytic Entailment
2 篇论文 | 1 个基准测试
Crash Blossom
2 篇论文 | 1 个基准测试
Entailed Polarity
2 篇论文 | 1 个基准测试
Evaluating Information Essentiality
2 篇论文 | 1 个基准测试
Human Judgment Classification
2 篇论文 | 1 个基准测试
Identify Odd Metapor
2 篇论文 | 1 个基准测试
Logical Args
2 篇论文 | 1 个基准测试
Metaphor Boolean
2 篇论文 | 1 个基准测试
Presuppositions As NLI
2 篇论文 | 1 个基准测试
Commonsense Reasoning for RL
1 篇论文 | 1 个基准测试
Professional Accounting
1 篇论文 | 1 个基准测试