Question Answering On Uniprotqa

BLEU-2

BLEU-4

MEATOR

ROUGE-1

ROUGE-2

ROUGE-L

Results

Performance results of various models on this benchmark

Model Name	BLEU-2	BLEU-4	MEATOR	ROUGE-1	ROUGE-2	ROUGE-L	Paper Title	Repository
Llama2-7B-chat	0.019	0.002	0.052	0.103	0.060	0.009	Llama 2: Open Foundation and Fine-Tuned Chat Models
BioMedGPT-10B	0.571	0.535	0.754	0.743	0.759	0.622	BioMedGPT: Open Multimodal Generative Pre-trained Transformer for BioMedicine

0 of 2 row(s) selected.