Official code for "Constructing Word-Context-Coupled Space Aligned with Associative Knowledge Relations for Interpretable Language Modeling", Findings in ACL2023.
You can find our paper on Link.
Cite as: Fanyu Wang and Zhenping Xie. 2023. Constructing Word-Context-Coupled Space Aligned with Associative Knowledge Relations for Interpretable Language Modeling. In Findings of the Association for Computational Linguistics: ACL 2023, pages 8414–8427, Toronto, Canada. Association for Computational Linguistics.
Fanyu Wang: Personal Page
Zhenping Xie: Personal Page
ColeGroup (Chinese Only): Home Page
You can customize the settings in config.py, where "mode" refer to sentiment classification and spelling correction tasks.
You can use your personal dataset for AKN initialization or you can find our AKN weight in ./akn/akn_download.txt.
Initialization py file for finetuning BERT model and training for mapping network.
Clustering py file for abstraction of context-level semantics.
Tasks completion py files for sentiment classification and spelling correction tasks.
CHNST dataset for sentiment classification task. We do preprocessing operation.
SIGHAN15 dataset for spelling correction task. Evaluation only.
Weibo dataset for sentiment classification task. We do preprocessing operation.
We upload our preprocessed training dataset on Google Drive. The trainset are constructed based on HyBird and SIGHAN15-Trainset.