Search for a command to run...
Reproducing, Analyzing, and Detecting Reward Hacking in Rubric-Based Reinforcement Learning