Code to reproduce the experiments of the NeurIPS 2024 paper Questioning the Survey Responses of Large Language Models.
We survey language models with the American Community Survey. The experiment results can be downloaded from here.
Use the following Jupyter notebooks:
- Main text: figures.ipynb
- Appendix: appendix.ipynb
- Prompt ablations: prompt-ablations/
- Survey ablations: survey-ablations/
The relevant files to reproduce the experiments in the main text are:
- script_fill_individual.py: obtain language models' responses to individual survey questions for language models from HugginFace's model hub.
- script_fill_openai.py: obtain GPT-3's responses to individual survey questions.
For the sequential generation experiments in the appendix, use:
- script_fill_sequential.py: sample language models' responses to entire survey. questionnaires, where questions are presented sequentially while keeping previous answers in-context.
- discriminator_test.py: perform the discriminator test on the model generated data.
Please cite as:
@article{dominguez2024questioning,
title={Questioning the Survey Responses of Large Language Models},
author={Dominguez-Olmedo, Ricardo and Hardt, Moritz and Mendler-D{\"u}nner, Celestine},
journal={Advances in Neural Information Processing Systems},
year={2024}
}