HyperAI
HyperAI
Startseite
Plattform
Dokumentation
Neuigkeiten
Forschungsarbeiten
Tutorials
Datensätze
Wiki
SOTA
LLM-Modelle
GPU-Rangliste
Veranstaltungen
Suche
Über
Nutzungsbedingungen
Datenschutzrichtlinie
Deutsch
HyperAI
HyperAI
Toggle Sidebar
Seite durchsuchen…
⌘
K
Command Palette
Search for a command to run...
Plattform
Startseite
SOTA
Visions- und Sprachnavigation
Vision And Language Navigation On Touchdown
Vision And Language Navigation On Touchdown
Metriken
Task Completion (TC)
Ergebnisse
Leistungsergebnisse verschiedener Modelle zu diesem Benchmark
Columns
Modellname
Task Completion (TC)
Paper Title
FLAME
40.20
FLAME: Learning to Navigate with Multimodal LLM in Urban Environments
ORAR + junction type + heading delta
29.1
Analyzing Generalization of Vision and Language Navigation to Unseen Outdoor Areas
ORAR
24.2
Analyzing Generalization of Vision and Language Navigation to Unseen Outdoor Areas
ARC + L2STOP
16.68
Learning to Stop: A Simple yet Effective Approach to Urban Vision-Language Navigation
VLN Transformer +M-50 +style
16.2
Multimodal Text Style Transfer for Outdoor Vision-and-Language Navigation
VLN Transformer
14.9
Multimodal Text Style Transfer for Outdoor Vision-and-Language Navigation
ARC
14.13
Learning to Stop: A Simple yet Effective Approach to Urban Vision-Language Navigation
Retouch-RConcat
12.8
Retouchdown: Adding Touchdown to StreetLearn as a Shareable Resource for Language Grounding Tasks in Street View
Gated Attention (GA)
11.9
Multimodal Text Style Transfer for Outdoor Vision-and-Language Navigation
RConcat
11.8
Multimodal Text Style Transfer for Outdoor Vision-and-Language Navigation
RConcat
10.7
Touchdown: Natural Language Navigation and Spatial Reasoning in Visual Street Environments
Gated Attention (GA)
5.5
Touchdown: Natural Language Navigation and Spatial Reasoning in Visual Street Environments
0 of 12 row(s) selected.
Previous
Next
Vision And Language Navigation On Touchdown | SOTA | HyperAI