Data and code for the paper, Quantifying the Uniqueness of Donald Trump in Presidential Discourse.
@misc{zhou2024quantifying,
title={Quantifying the Uniqueness of Donald Trump in Presidential Discourse},
author={Karen Zhou and Alexander A. Meitus and Milo Chase and Grace Wang and Anne Mykland and William Howell and Chenhao Tan},
year={2024},
eprint={2401.01405},
archivePrefix={arXiv},
primaryClass={cs.CL}
}
If you use the data please also cite the American Presidency Project:
@article{woolley1999american,
title={The American presidency project},
author={Woolley, [dataset] John T and Peters, Gerhard},
journal={Santa Barbara, CA. Available from World Wide Web: http://www. presidency. ucsb. edu/ws},
year={1999}
}
We share our sentence-delimited corpora of presidential debates, State of the Union addresses, and campaign speeches that we scrape and process from the American Presidency Project.
Please see data/README.md for more details.
We define language as "divisive" if it intends to impugn and delegitimize the speaker's target, e.g., by attacking their intelligence, integrity, or intentions. Such labels are expressly designed to put the target on defense and accentuate differences and distance between parties.
The methodology of lexicon construction is described in our paper. In this repo, we provide two files:
majority-words.txt
- contains the 178 terms that ≥3 out of 4 annotators agree are divisive. This set is used for analysis in the paper.unanimous-words.txt
- contains the 123 terms that all 4 annotators agree are divisive.
Model training: see models/README.md
Get scores from model: run analysis/score_uniqueness.py
. See analysis/run_score_uniqueness.sh
for an example of how to run this script.
Plot scores: run analysis/plot_uniqueness.py
. See analysis/run_plot_uniqueness.sh
for an example of how to run this script.
Run analysis/plot_divisive_lexicon.py
. See analysis/run_plot_lexicon.sh
for an example of how to run this script.
Run analysis/plot_fw_overlap.py
. See analysis/run_fw_overlap.sh
for an example of how to run this script.