HyperAI

Leistungsergebnisse verschiedener Modelle auf diesem Benchmark

Inst-level loose-accuracy

Inst-level strict-accuracy

Prompt-level loose-accuracy

Prompt-level strict-accuracy

4 Zeilen insgesamt

Instruction Following On Ifeval | SOTA | HyperAI