HyperAI초신경

Robot Manipulation On Simpler Env

평가 지표

Variant Aggregation
Variant Aggregation-Move Near
Variant Aggregation-Open/Close Drawer
Variant Aggregation-Pick Coke Can
Visual Matching
Visual Matching-Move Near
Visual Matching-Open/Close Drawer
Visual Matching-Pick Coke Can

평가 결과

이 벤치마크에서 각 모델의 성능 결과

모델 이름
Variant Aggregation
Variant Aggregation-Move Near
Variant Aggregation-Open/Close Drawer
Variant Aggregation-Pick Coke Can
Visual Matching
Visual Matching-Move Near
Visual Matching-Open/Close Drawer
Visual Matching-Pick Coke Can
Paper TitleRepository
Octo-Base0.0120.0310.0110.0060.1680.0420.2270.170Octo: An Open-Source Generalist Robot Policy-
RT-2-X0.6610.7920.3530.8230.6060.7790.2500.787RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control
SoFar0.6760.7400.2970.9070.7490.9170.4030.923SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation
SpatialVLA0.6880.7170.3620.8950.7190.6960.5930.810SpatialVLA: Exploring Spatial Representations for Visual-Language-Action Model-
OpenVLA0.4110.4770.1770.5450.2770.4620.3560.163OpenVLA: An Open-Source Vision-Language-Action Model
RoboVLM0.4630.5600.0850.6830.5630.6630.2680.727Towards Generalist Robot Policies: What Matters in Building Vision-Language-Action Models-
TraceVLA0.4500.5640.3100.6000.4600.6000.2400.560TraceVLA: Visual Trace Prompting Enhances Spatial-Temporal Awareness for Generalist Robotic Policies-
RT-1-X0.3970.3230.2940.4900.5340.3170.5970.567RT-1: Robotics Transformer for Real-World Control at Scale
0 of 8 row(s) selected.