HyperAI

Auto Debugging On Big Bench Lite

Metrics

Exact string match

Results

Performance results of various models on this benchmark

Comparison Table
Model NameExact string match
palm-scaling-language-modeling-with-pathways-138.2
palm-scaling-language-modeling-with-pathways-114.7
palm-scaling-language-modeling-with-pathways-138.2