Hallucination Evaluation

12 papers with code • 0 benchmarks • 1 datasets

Evaluate the ability of LLM to generate non-hallucination text or assess the capability of LLM to recognize hallucinations.

Benchmarks

Add a Result

These leaderboards are used to track progress in Hallucination Evaluation

No evaluation results yet. Help compare methods by submitting evaluation metrics.

Most implemented Social Latest No code

ictnlp/truthx • • 27 Feb 2024

During inference, by editing LLM's internal representations in truthful space, TruthX effectively enhances the truthfulness of LLMs.

Paper
Code

The rapid growth of Large Language Models (LLMs) has driven the development of Large Vision-Language Models (LVLMs).

Paper
Code