Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
dataset.py		dataset.py
dataset_halluqa.json		dataset_halluqa.json
dataset_halluqa_mc.json		dataset_halluqa_mc.json
eval_base.py		eval_base.py
eval_halluqa_mc.py		eval_halluqa_mc.py

README.md

HalluQA

Information

Paper: Evaluating Hallucinations in Chinese Large Language Models
Institution:
- Fudan University
- Shanghai AI Laboratory
arXiv: https://arxiv.org/abs/2310.03368
GitHub: https://github.com/OpenMOSS/HalluQA

Evaluators

Evaluator	Metric	Description
TODO	TODO	Generation task
`HalluQAMCEvaluator`	Accuracy	Multi-choice task

Citation

@article{DBLP:journals/corr/abs-2310-03368,
  author       = {Qinyuan Cheng and
                  Tianxiang Sun and
                  Wenwei Zhang and
                  Siyin Wang and
                  Xiangyang Liu and
                  Mozhi Zhang and
                  Junliang He and
                  Mianqiu Huang and
                  Zhangyue Yin and
                  Kai Chen and
                  Xipeng Qiu},
  title        = {Evaluating Hallucinations in Chinese Large Language Models},
  journal      = {CoRR},
  volume       = {abs/2310.03368},
  year         = {2023},
  url          = {https://doi.org/10.48550/arXiv.2310.03368},
  doi          = {10.48550/arXiv.2310.03368},
  eprinttype    = {arXiv},
  eprint       = {2310.03368},
  timestamp    = {Thu, 19 Oct 2023 13:12:52 +0200},
  biburl       = {https://dblp.org/rec/journals/corr/abs-2310-03368.bib},
  bibsource    = {dblp computer science bibliography, https://dblp.org}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

halluqa

halluqa

README.md

HalluQA

Information

Evaluators

Citation

Files

halluqa

Directory actions

More options

Directory actions

More options

Latest commit

History

halluqa

Folders and files

parent directory

README.md

HalluQA

Information

Evaluators

Citation