Search for a command to run...
Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges