Citation Contexts for AI Reproducibility

This repository is for the proof of concept project for identifying the correlation between citation context of citing papers and the reproducibility of cited paper (original paper) within the field of Artificial Intelligence. The repository contains the code and the data to reproduce results for the work titled: "Can citations tell us about a paper’s reproducibility? A case study of machine learning papers. The project requires Python GPU-based processing capabilities, TensorFlow, Keras and PyTorch frameworks."

Folder structure

    .
    ├── data              # All the data files required to reproduce the results
    ├── documents         # Documentation related files
    ├── notebooks         # .ipynb notebook files
    ├── plots             # Visualizations stored location
    └── README.md

Dependencies

All the required dependencies included in the requirements.txt file. To prevent dependency conflicts, refrain from manually installing TensorFlow and Keras. When installing keras-nlp via requirements.txt, it will automatically download and install the appropriate TensorFlow and Keras versions. Codebase is tested on below python library versions.

tensorflow==2.16.1
keras==3.1.1
keras-core==0.1.7
keras-nlp==0.8.2
torch==1.13.0
transformers==4.39.2
pandas==2.0.3
ipykernel==6.29.3
openpyxl==3.1.2
numpy==1.24.3
scikit-learn==1.3.1

Dataset

Avaialable in the data directory

Steps

Clone the GitHub repository https://github.com/lamps-lab/ccair-ai-reproducibility
Create a python virtual environment https://docs.python.org/3/library/venv.html
Activate venv, navigate to the cloned repository and install the dependencies using requirements.txt file
```
    pip install -r requirements.txt
```
Use either the available data in data directory or create the datasets from scratch by following the steps in below jupyter notebooks in sequential order (available inside notebooks directory).
- R_001_Creating_the_RS_superset.ipynb
- R_001_Extract_Citing_Paper_Details_from_S2GA.ipynb
- R_001_JSON_to_csv_contexts_conversion.ipynb
Note: If you are using the existing data in the data directory, you can skip this step.
After the environment setup, execute the below jupyter notebooks in sequential order (available inside 'notebooks' directory).
- R_001_M1_to_M5_Sentiment_Analysis_models.ipynb
  - This will generate the performance measures for the selected five open-source multiclass sentiment analysis models (Table 3).
- R_001_M6_3_class_sentiment_classification.ipynb
  - This will custom train a multiclass DistilBert sentiment classifier and perform 5-fold cross validation for model evaluation. At the end of model evaluation, this generates the predicted class labels {'negative','neutral','positive'} for all 41244 citation contexts (Table 4).
- R_001_M7_1_binary_classification_related_not_related.ipynb
  - This will custom train a binary classifier and perform 5-fold cross validation for model evaluation. At the end of model evaluation, this generates the predicted class labels {'related','not-related'} for all 41244 citation contexts (Table 4).
- R_001_M7_2_binary_sentiment_classification.ipynb
  - This will custom train a binary classifier and perform 5-fold cross validation for model evaluation. At the end of model evaluation, this generates the predicted class labels {'negative','positive'} for only reproducibility related citation contexts filtered from M7.1 (Table 4).
- R_001_Visualizations.ipynb
  - This will parse all the data files created by previous notebooks and generate the results in Table 2, figure 3, and Figure 4.

Citation

@article{acm-rep-24,
author = {Obadage, Rochana R. and Rajtmajer, Sarah M. and Wu, Jian},
title = {SHORT: Can citations tell us about a paper's reproducibility? A case study of machine learning papers},
year = {2024},
isbn = {9798400705304},
publisher = {Association for Computing Machinery},
address = {New York, NY, USA},
url = {https://doi.org/10.1145/3641525.3663628},
howpublished = {\url{https://doi.org/10.1145/3641525.3663628}},
abstract = {The iterative character of work in machine learning (ML) and artificial intelligence (AI) and reliance on comparisons against benchmark datasets emphasize the importance of reproducibility in that literature. Yet, resource constraints and inadequate documentation can make running replications particularly challenging. Our work explores the potential of using downstream citation contexts as a signal of reproducibility. We introduce a sentiment analysis framework applied to citation contexts from papers involved in Machine Learning Reproducibility Challenges in order to interpret the positive or negative outcomes of reproduction attempts. Our contributions include training classifiers for reproducibility-related contexts and sentiment analysis, and exploring correlations between citation context sentiment and reproducibility scores. Study data, software, and an artifact appendix are publicly available at https://github.com/lamps-lab/ccair-ai-reproducibility.},
booktitle = {Proceedings of the 2nd ACM Conference on Reproducibility and Replicability},
pages = {96–100},
numpages = {5},
keywords = {Citation Contexts, Machine Learning, Reproducibility, Science of Science, Sentiment Analysis},
location = {Rennes, France},
series = {ACM REP '24}
}

Rochana R. Obadage
03/29/2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Citation Contexts for AI Reproducibility

Folder structure

Dependencies

Dataset

Steps

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
data		data
documents		documents
notebooks		notebooks
plots		plots
LICENSE		LICENSE
README.md		README.md
artifact_appendix.pdf		artifact_appendix.pdf
requirements.txt		requirements.txt

License

lamps-lab/ccair-ai-reproducibility

Folders and files

Latest commit

History

Repository files navigation

Citation Contexts for AI Reproducibility

Folder structure

Dependencies

Dataset

Steps

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages