MKA: Leveraging Cross-Lingual Consensus for Model Abstention

This repository provides the code for the work done in MKA: Leveraging Cross-Lingual Consensus for Model Abstention.

The results reported in the paper can be accessed here. For Gemma 9B's performance on English with the low-resource auxiliary languages set, open English/results/low_res_gemma-2-9b-it_200_paraphrase-multilingual-mpnet-base-v2.json. The samples object contains the model prompts and responses and translations and similarity scores and the MKA decision. The runs object has stats related to the experiments. For confidence cutoff of 0.64 for the above model-language combination, the stats are:

{
  "metrics": {
    "abstention_metrics": {
      "answered": 144,
      "abstentions": 56,
      "answer_rate": 0.72,
      "abstention_rate": 0.28,
      "correct_confidences": 72,
      "incorrect_confidences": 72,
      "correctly_abstained": 49,
      "incorrectly_abstained": 7,
      "answered_accuracy": 0.5,
      "correctly_abstained_rate": 0.875,
      "accuracy": 0.36,
      "effective_accuracy": 0.4356
    },
    "mean_confidence": 0.7677821376174688,
    "confidence_std": 0.2160755827962758,
  }
}

Requirements

pip install --upgrade pip
pip install datasets ctranslate2 sentence-transformers sglang[all]>=0.4.2.post2
pip install sgl-kernel --force-reinstall --no-deps
pip install "sglang[all]>=0.4.2.post2" --find-links https://flashinfer.ai/whl/cu124/torch2.5/flashinfer/

Get the translation model

Quantize if necessary using --quantization int8

ct2-transformers-converter --model facebook/nllb-200-distilled-1.3B --output_dir nllb-200-distilled-1.3B

Log into Huggingface for gated model access

huggingface-cli login --token $HF_TOKEN

Run

Supply the number of samples and the seed to use.

bash run.sh 200 97 # n_samples, seed

Custom Setup

Make changes to the config.py file to change the prompting models, similarity model, answer similarity threshold and the prompt to use.

Citation

If you find this work useful, please cite:

@misc{duwal2025mka,
      title={MKA: Leveraging Cross-Lingual Consensus for Model Abstention}, 
      author={Sharad Duwal},
      year={2025},
      eprint={2503.23687},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2503.23687}, 
}

Acknowledgement

This work is part of the meta-study for the AI Researcher Project at Stanford NLP Group and was supported by them.

March 2025

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
old		old
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config.py		config.py
data_processing.py		data_processing.py
main.py		main.py
metrics.py		metrics.py
prompting.py		prompting.py
requirements.txt		requirements.txt
run.sh		run.sh
translation.py		translation.py
visualization.py		visualization.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MKA: Leveraging Cross-Lingual Consensus for Model Abstention

Requirements

Get the translation model

Log into Huggingface for gated model access

Run

Custom Setup

Citation

Acknowledgement

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

sharad461/MKA-hallucination

Folders and files

Latest commit

History

Repository files navigation

MKA: Leveraging Cross-Lingual Consensus for Model Abstention

Requirements

Get the translation model

Log into Huggingface for gated model access

Run

Custom Setup

Citation

Acknowledgement

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages