Improving Event Representation Learning via Generating and Utilizing Synthetic Data

We are pleased to release the official implementation of our paper titled "Improving Event Representation Learning via Generating and Utilizing Synthetic Data", which was submitted to the journal of Information Processing & Management.

News

Dec 12 2024, the code, dataset and checkpoints are coming soon!
Dec 14 2024, the code has been released, the dataset and checkpoints are coming soon!
Jan 27 2025, the paper has been accepted by Information Processing & Management! 🎉🎉🎉

Quick Start

Installation

To run a docker container:

docker run ubuntu:22.04

To install pip requirements:

pip3 install \
  texar-pytorch \
  torch==1.13.1 \
  tensorflow==2.14 \
  numpy==1.26.4 \
  nltk \
  faiss-cpu \
  tiktoken \
  jupyter \
  matplotlib \
  openai \
  scipy \
  scikit-learn

Synthetic

python3 syn-dat.py \
  --anchor /data/train.json \
  --atomic /data/atomic/v4_atomic_all.csv \
  --output /data/out \
  --api-key API-KEY \
  --prompt analogy-reasoner.j2

Train

python3 main.py \
  --do-train \
  --output-dir /data/out

Test

python3 main.py \
  --do-eval \
  --checkpoint /data/checkpoint.pt

Acknowledgement

The code is developed based on SWCC. We appreciate all the authors who made their code public, which greatly facilitates this project.

Citation

@article{feng2025improving,
  title = {Improving event representation learning via generating and utilizing synthetic data},
  author = {Yubo Feng and Lishuang Li and Xueyang Qin and Beibei Zhang},
  journal = {Information Processing & Management},
  volume = {62},
  number = {4},
  pages = {104083},
  year = {2025},
  issn = {0306-4573},
  doi = {https://doi.org/10.1016/j.ipm.2025.104083},
  url = {https://www.sciencedirect.com/science/article/pii/S0306457325000251},
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
analogy-reasoner.j2		analogy-reasoner.j2
config_data.py		config_data.py
config_model.py		config_model.py
data_utils.py		data_utils.py
main.py		main.py
mcnc.py		mcnc.py
misc_utils.py		misc_utils.py
model.py		model.py
syn-dat.py		syn-dat.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Improving Event Representation Learning via Generating and Utilizing Synthetic Data

News

Quick Start

Installation

Synthetic

Train

Test

Acknowledgement

Citation

About

Releases

Packages

Languages

YuboFeng2023/LLM-CL

Folders and files

Latest commit

History

Repository files navigation

Improving Event Representation Learning via Generating and Utilizing Synthetic Data

News

Quick Start

Installation

Synthetic

Train

Test

Acknowledgement

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages