HyperAI초신경

Few Shot Text Classification On Raft

평가 지표

Over
ADE
Avg
B77
NIS
OSE
SOT
SRI
TAI
TC
TEH
ToS

평가 결과

이 벤치마크에서 각 모델의 성능 결과

모델 이름
Over
ADE
Avg
B77
NIS
OSE
SOT
SRI
TAI
TC
TEH
ToS
Paper TitleRepository
GPT-3 zero-shot0.3780.1630.2920.0000.5720.3230.6280.0270.3620.2900.3030.164RAFT: A Real-World Few-Shot Text Classification Benchmark
T-Few0.950.8040.7580.6950.8330.6760.9150.5080.7360.8790.5860.75Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning
Plurality-class0.3370.4460.3310.0000.3530.1640.2710.4930.3440.3910.3660.471RAFT: A Real-World Few-Shot Text Classification Benchmark
GPT-20.4980.6000.4580.1210.5610.2450.3800.4920.6120.7230.3110.498RAFT: A Real-World Few-Shot Text Classification Benchmark
AdaBoost0.8380.5430.5140.0230.6260.4750.4550.5060.5560.6250.4430.560RAFT: A Real-World Few-Shot Text Classification Benchmark
BART MNLI zero-shot0.4620.2340.3820.3320.6150.3600.6440.0260.4690.4000.5430.122RAFT: A Real-World Few-Shot Text Classification Benchmark
GPT-30.9370.6860.6270.2990.6790.4310.7690.5160.6560.8210.5260.574RAFT: A Real-World Few-Shot Text Classification Benchmark
GPT-Neo0.6810.4520.4810.1490.4080.3430.4060.4930.6050.6360.5540.565RAFT: A Real-World Few-Shot Text Classification Benchmark
Human (crowdsourced)0.9170.8300.7350.6070.8570.6460.9080.4680.6090.8970.7220.627RAFT: A Real-World Few-Shot Text Classification Benchmark
0 of 9 row(s) selected.