Search for a command to run...
Room-Across-Room: Multilingual Vision-and-Language Navigation with Dense Spatiotemporal Grounding