Search for a command to run...
Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback