HyperAI

Few Shot Text Classification On Raft

Metriken

Over
ADE
Avg
B77
NIS
OSE
SOT
SRI
TAI
TC
TEH
ToS

Ergebnisse

Leistungsergebnisse verschiedener Modelle zu diesem Benchmark

Modellname
Over
ADE
Avg
B77
NIS
OSE
SOT
SRI
TAI
TC
TEH
ToS
Paper TitleRepository
GPT-3 zero-shot0.3780.1630.2920.0000.5720.3230.6280.0270.3620.2900.3030.164RAFT: A Real-World Few-Shot Text Classification Benchmark
T-Few0.950.8040.7580.6950.8330.6760.9150.5080.7360.8790.5860.75Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning
Plurality-class0.3370.4460.3310.0000.3530.1640.2710.4930.3440.3910.3660.471RAFT: A Real-World Few-Shot Text Classification Benchmark
GPT-20.4980.6000.4580.1210.5610.2450.3800.4920.6120.7230.3110.498RAFT: A Real-World Few-Shot Text Classification Benchmark
AdaBoost0.8380.5430.5140.0230.6260.4750.4550.5060.5560.6250.4430.560RAFT: A Real-World Few-Shot Text Classification Benchmark
BART MNLI zero-shot0.4620.2340.3820.3320.6150.3600.6440.0260.4690.4000.5430.122RAFT: A Real-World Few-Shot Text Classification Benchmark
GPT-30.9370.6860.6270.2990.6790.4310.7690.5160.6560.8210.5260.574RAFT: A Real-World Few-Shot Text Classification Benchmark
GPT-Neo0.6810.4520.4810.1490.4080.3430.4060.4930.6050.6360.5540.565RAFT: A Real-World Few-Shot Text Classification Benchmark
Human (crowdsourced)0.9170.8300.7350.6070.8570.6460.9080.4680.6090.8970.7220.627RAFT: A Real-World Few-Shot Text Classification Benchmark
0 of 9 row(s) selected.