| GPT-3 175B (zero-shot) | - | 45.5 | - | Language Models are Few-Shot Learners | |
| LLaMA 33B (zero-shot) | - | 48.3 | 64.1 | LLaMA: Open and Efficient Foundation Language Models | |
| LLaMA 65B (zero-shot) | - | 51.6 | 67.9 | LLaMA: Open and Efficient Foundation Language Models | |
| GPT-3 175B (0-shot) | - | - | 58.4 | Language Models are Few-Shot Learners | |
| BLOOM 176B (one-shot) | - | 39.14 | 52.3 | BloombergGPT: A Large Language Model for Finance | |
| GPT-NeoX (one-shot) | - | 34.33 | 41.23 | BloombergGPT: A Large Language Model for Finance | |
| OPT 66B (one-shot) | - | 37.02 | 47.42 | BloombergGPT: A Large Language Model for Finance | |
| PaLM 8B (zero-shot) | - | 42.3 | 57.9 | PaLM: Scaling Language Modeling with Pathways | |
| Bloomberg GPT (one-shot) | - | 41.74 | 54.32 | BloombergGPT: A Large Language Model for Finance | |
| PaLM 540B (zero-shot) | - | 49.1 | 68.1 | PaLM: Scaling Language Modeling with Pathways | |