Date

2 years ago

Size

982.62 KB

Organization

Publish URL

github.com

Paper URL

arxiv.org

Tags

Intelligent Question Answering

LLM

Natural Language Processing

AI for Science

Model Training

With the rapid development of large language models (LLMs), more and more people are beginning to worry about some of the risks they may bring. Therefore, the "safety and monochromaticity" direction around large models has received great attention. 1**00PoisonMpts is the industry's first open source Chinese data set for large language model governance. The first batch of annotation engineers for "100 bottles of poison for AI" are composed of more than a dozen well-known experts and scholars, including environmental sociology expert Fan Yechao, famous sociologist Li Yinhe, psychologist Li Songwei, and human rights law expert Liu Xiaonan.The annotators each asked 100 tricky questions that induced bias and discriminatory answers, and annotated the answers of the big model, completing the attack and defense with AI from "poisoning" to "detoxification". The first batch of field data revolved around AI anti-discrimination, empathy, and deliberative expression, covering dimensions such as jurisprudence, psychology, children's education, accessibility, little-known facts, intimate relationships, and environmental fairness, including questions raised by experts and answers written by experts themselves or approved by them. The research team explored the self-alignment of large models based on the expert principle. For specific methods and experimental analysis, please refer to the technical report "Self-alignment of Large Models Based on Expert Principles"Link Research TeamBased on two evaluation criteria: safety and responsibilityA benchmark for assessing China's large-scale model-level values is proposed.**For more information, please read the paper "CVALUES: Measuring the Value of China's Large Language Model from Security to Responsibility"Link

Citation

@misc{xu2023cvalues, title={CValues: Measuring the Values of Chinese Large Language Models from Safety to Responsibility}, author={Guohai Xu and Jiayi Liu and Ming Yan and Haotian Xu and Jinghui Si and Zhuoran Zhou and Peng Yi and Xing Gao and Jitao Sang and Rong Zhang and Ji Zhang and Chao Peng and Fei Huang and Jingren Zhou} year={2023}, eprint={2307.09705}, archivePrefix={arXiv}, primaryClass={cs.CL} }

100PoisonMpts.torrent

Seeding 1Downloading 0Completed 251Total Downloads 603

100PoisonMpts/
- README.md
  2.07 KB
- README.txt
  4.13 KB

This dataset is contributed by community users and is intended for educational and informational purposes only. If any content involves copyright infringement, please contact us at [email protected] for prompt review and removal.

Related Datasets

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Use this Dataset

Discuss on Discord

Date

2 years ago

Size

982.62 KB

Organization

Publish URL

github.com

Paper URL

arxiv.org

Citation

100PoisonMpts.torrent

Seeding 1Downloading 0Completed 251Total Downloads 603

100PoisonMpts/
- README.md
  2.07 KB
- README.txt
  4.13 KB

Related Datasets

MAKIEVAL Multilingual Cultural Knowledge Assessment Dataset

21 hours ago

Verbatim Spans Query Condition Evidence Extraction Dataset

in 3 hours

RHELM Long-Term Memory Assessment Dataset

2 days ago

SAM 3D Artist Objects 3D Object Reconstruction Dataset

7 days ago

FigureBench Scientific Illustration Generation Benchmark Dataset

9 days ago

Movie Feelings Dataset

14 days ago

EAVSD E-commerce Advertising Video Storyboard Dataset

a month ago

chi-bench Medical Intelligent Agent Benchmark Evaluation Dataset

15 days ago

QCalEval Quantum Calibration Graph Understanding Dataset

2 months ago

Transfermarkt Football Dataset

2 months ago

MDPBench Multilingual Document Parsing Benchmark Dataset

10 days ago

ToolACE Complex Tools Learning Dialogue Dataset

3 months ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

100 PoisonMpts Chinese Large Model Governance Dataset

Citation

Build AI with AI

HyperAI Newsletters

Command Palette

100 PoisonMpts Chinese Large Model Governance Dataset

Citation

Related Datasets

MAKIEVAL Multilingual Cultural Knowledge Assessment Dataset

Verbatim Spans Query Condition Evidence Extraction Dataset

RHELM Long-Term Memory Assessment Dataset

SAM 3D Artist Objects 3D Object Reconstruction Dataset

FigureBench Scientific Illustration Generation Benchmark Dataset

Movie Feelings Dataset

EAVSD E-commerce Advertising Video Storyboard Dataset

chi-bench Medical Intelligent Agent Benchmark Evaluation Dataset

QCalEval Quantum Calibration Graph Understanding Dataset

Transfermarkt Football Dataset

MDPBench Multilingual Document Parsing Benchmark Dataset

ToolACE Complex Tools Learning Dialogue Dataset

Build AI with AI

HyperAI Newsletters

Command Palette

100 PoisonMpts Chinese Large Model Governance Dataset

Citation

Related Datasets

MAKIEVAL Multilingual Cultural Knowledge Assessment Dataset

Verbatim Spans Query Condition Evidence Extraction Dataset

RHELM Long-Term Memory Assessment Dataset

SAM 3D Artist Objects 3D Object Reconstruction Dataset

FigureBench Scientific Illustration Generation Benchmark Dataset

Movie Feelings Dataset

EAVSD E-commerce Advertising Video Storyboard Dataset

chi-bench Medical Intelligent Agent Benchmark Evaluation Dataset

QCalEval Quantum Calibration Graph Understanding Dataset

Transfermarkt Football Dataset

MDPBench Multilingual Document Parsing Benchmark Dataset

ToolACE Complex Tools Learning Dialogue Dataset

Build AI with AI

HyperAI Newsletters

Related Datasets

MAKIEVAL Multilingual Cultural Knowledge Assessment Dataset

Verbatim Spans Query Condition Evidence Extraction Dataset

RHELM Long-Term Memory Assessment Dataset

SAM 3D Artist Objects 3D Object Reconstruction Dataset

FigureBench Scientific Illustration Generation Benchmark Dataset

Movie Feelings Dataset

EAVSD E-commerce Advertising Video Storyboard Dataset

chi-bench Medical Intelligent Agent Benchmark Evaluation Dataset

QCalEval Quantum Calibration Graph Understanding Dataset

Transfermarkt Football Dataset

MDPBench Multilingual Document Parsing Benchmark Dataset

ToolACE Complex Tools Learning Dialogue Dataset

Related Datasets

MAKIEVAL Multilingual Cultural Knowledge Assessment Dataset

Verbatim Spans Query Condition Evidence Extraction Dataset

RHELM Long-Term Memory Assessment Dataset

SAM 3D Artist Objects 3D Object Reconstruction Dataset

FigureBench Scientific Illustration Generation Benchmark Dataset

Movie Feelings Dataset

EAVSD E-commerce Advertising Video Storyboard Dataset

chi-bench Medical Intelligent Agent Benchmark Evaluation Dataset

QCalEval Quantum Calibration Graph Understanding Dataset

Transfermarkt Football Dataset

MDPBench Multilingual Document Parsing Benchmark Dataset

ToolACE Complex Tools Learning Dialogue Dataset