Search for a command to run...
VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction