HyperAI超神経

Few Shot Text Classification On Raft

評価指標

Over
ADE
Avg
B77
NIS
OSE
SOT
SRI
TAI
TC
TEH
ToS

評価結果

このベンチマークにおける各モデルのパフォーマンス結果

モデル名
Over
ADE
Avg
B77
NIS
OSE
SOT
SRI
TAI
TC
TEH
ToS
Paper TitleRepository
GPT-3 zero-shot0.3780.1630.2920.0000.5720.3230.6280.0270.3620.2900.3030.164RAFT: A Real-World Few-Shot Text Classification Benchmark
T-Few0.950.8040.7580.6950.8330.6760.9150.5080.7360.8790.5860.75Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning
Plurality-class0.3370.4460.3310.0000.3530.1640.2710.4930.3440.3910.3660.471RAFT: A Real-World Few-Shot Text Classification Benchmark
GPT-20.4980.6000.4580.1210.5610.2450.3800.4920.6120.7230.3110.498RAFT: A Real-World Few-Shot Text Classification Benchmark
AdaBoost0.8380.5430.5140.0230.6260.4750.4550.5060.5560.6250.4430.560RAFT: A Real-World Few-Shot Text Classification Benchmark
BART MNLI zero-shot0.4620.2340.3820.3320.6150.3600.6440.0260.4690.4000.5430.122RAFT: A Real-World Few-Shot Text Classification Benchmark
GPT-30.9370.6860.6270.2990.6790.4310.7690.5160.6560.8210.5260.574RAFT: A Real-World Few-Shot Text Classification Benchmark
GPT-Neo0.6810.4520.4810.1490.4080.3430.4060.4930.6050.6360.5540.565RAFT: A Real-World Few-Shot Text Classification Benchmark
Human (crowdsourced)0.9170.8300.7350.6070.8570.6460.9080.4680.6090.8970.7220.627RAFT: A Real-World Few-Shot Text Classification Benchmark
0 of 9 row(s) selected.