HyperAIHyperAI

Command Palette

Search for a command to run...

Resources - On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification | Papers | HyperAI