Search for a command to run...
Learning to Foresee: Unveiling the Unlocking Efficiency of On-Policy Distillation