paper link: https://arxiv.org/abs/2304.11411
The LCS datset and UKM movie database is available at Google Drive. Please apply for access by contacting [email protected] with your institutional email address and clearly state your institution, your research advisor (if any), and your use case of the data.
We also provided preprocessed data for the Kaggle dataset, which can be directly download from Google Drive.
conda env create -f environment.yml
would generate a conda environment called spoiler
that should be able to run the code.
First create the conda environment.
Download preprocessed data for the Kaggle dataset from Google Drive to the repository directory and unzip it.
cd src
python train.py --device <your_device_id>
If you find this dataset or codebase useful in your research, please cite the following paper.
@article{wang2023detecting,
title={Detecting Spoilers in Movie Reviews with External Movie Knowledge and User Networks},
author={Wang, Heng and Zhang, Wenqian and Bai, Yuyang and Tan, Zhaoxuan and Feng, Shangbin and Zheng, Qinghua and Luo, Minnan},
journal={arXiv preprint arXiv:2304.11411},
year={2023}
}