Instruction Following On Ifeval | SOTA | HyperAI

Performance results of various models on this benchmark

Inst-level loose-accuracy

Inst-level strict-accuracy

Prompt-level loose-accuracy

Prompt-level strict-accuracy

4 rows total