Search for a command to run...
D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI