Search for a command to run...
Spatial Reasoning with Vision-Language Models in Ego-Centric Multi-View Scenes