Command Palette

Search for a command to run...

Question Answering On Mapeval Api 1

평가 지표

Accuracy (%)

평가 결과

이 벤치마크에서 각 모델의 성능 결과

Paper TitleRepository
Claude-3.5-Sonnet (ReAct)64.00MapEval: A Map-Based Evaluation of Geo-Spatial Reasoning in Foundation Models
GPT-3.5-Turbo (Chameleon)49.33MapEval: A Map-Based Evaluation of Geo-Spatial Reasoning in Foundation Models
0 of 2 row(s) selected.