Instruction Following On Ifeval | SOTA | HyperAI

Résultats de performance de différents modèles sur ce benchmark

Inst-level loose-accuracy

Inst-level strict-accuracy

Prompt-level loose-accuracy

Prompt-level strict-accuracy

4 lignes au total