Command Palette
Search for a command to run...
HalluQA Chinese Large Model Hallucination Evaluation Dataset
Date
2 years ago
Publish URL
Paper URL

This repository contains data and evaluation scripts for the HalluQA (Chinese Halluated Question Answering) benchmark. The full data for HalluQA is in HalluQA.json. The paper introducing HalluQA and detailed experimental results on several large Chinese language models are inhereHalluQA contains 450 carefully designed adversarial questions that span multiple domains and take into account Chinese historical culture, customs, and social phenomena.
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.
AI Co-coding
Ready-to-use GPUs
Best Pricing
Hyper Newsletters
Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp