Search for a command to run...
SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning Capabilities