Robot Manipulation On Simpler Env
Métriques
Variant Aggregation
Variant Aggregation-Move Near
Variant Aggregation-Open/Close Drawer
Variant Aggregation-Pick Coke Can
Visual Matching
Visual Matching-Move Near
Visual Matching-Open/Close Drawer
Visual Matching-Pick Coke Can
Résultats
Résultats de performance de divers modèles sur ce benchmark
Tableau comparatif
Nom du modèle | Variant Aggregation | Variant Aggregation-Move Near | Variant Aggregation-Open/Close Drawer | Variant Aggregation-Pick Coke Can | Visual Matching | Visual Matching-Move Near | Visual Matching-Open/Close Drawer | Visual Matching-Pick Coke Can |
---|---|---|---|---|---|---|---|---|
octo-an-open-source-generalist-robot-policy | 0.012 | 0.031 | 0.011 | 0.006 | 0.168 | 0.042 | 0.227 | 0.170 |
rt-2-vision-language-action-models-transfer | 0.661 | 0.792 | 0.353 | 0.823 | 0.606 | 0.779 | 0.250 | 0.787 |
sofar-language-grounded-orientation-bridges | 0.676 | 0.740 | 0.297 | 0.907 | 0.749 | 0.917 | 0.403 | 0.923 |
spatialvla-exploring-spatial-representations | 0.688 | 0.717 | 0.362 | 0.895 | 0.719 | 0.696 | 0.593 | 0.810 |
openvla-an-open-source-vision-language-action | 0.411 | 0.477 | 0.177 | 0.545 | 0.277 | 0.462 | 0.356 | 0.163 |
towards-generalist-robot-policies-what | 0.463 | 0.560 | 0.085 | 0.683 | 0.563 | 0.663 | 0.268 | 0.727 |
tracevla-visual-trace-prompting-enhances | 0.450 | 0.564 | 0.310 | 0.600 | 0.460 | 0.600 | 0.240 | 0.560 |
rt-1-robotics-transformer-for-real-world | 0.397 | 0.323 | 0.294 | 0.490 | 0.534 | 0.317 | 0.597 | 0.567 |