Skip to content
/ DyVo Public

EMNLP 2024: Dynamic Vocabularies For Learned Sparse Retrieval with Entities

License

Notifications You must be signed in to change notification settings

thongnt99/DyVo

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DyVo

Codebase for the paper "DyVo: Dynamic Vocabularies for Learned Sparse Retrieval with Entities" EMNLP 2024

Steps to run the code:

  1. Create conda environment and install dependencies:

Create conda environment:

conda create --name lsr python=3.9.12
conda activate lsr

Install dependencies:

pip install -r requirements.txt
  1. Train and evaluatate a model
python -m lsr.train +experiment=qmlp_dmlm_emlm_laque_wapo_msmarco_pretrained_inparsv2_monot53b_distillation_l1_0.0_0.001_entw_0.05.yaml training_arguments.fp16=True 

The list of experiment configurations could be found inside the lsr/configs/experiment directory.

Citing and Authors

If you find this repository helpful, feel free to cite our paper

@inproceedings{nguyen-etal-2024-dyvo,
    title = "DyVo: Dynamic Vocabularies for Learned Sparse Retrieval with Entities",
    author = "Nguyen, Thong  and
      Chatterjee, Shubham  and
      MacAvaney, Sean  and
      Mackie, Iain  and
      Dalton, Jeff  and
      Yates, Andrew",
    booktitle = "Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing",
    month = nov,
    year = "2024"
}

About

EMNLP 2024: Dynamic Vocabularies For Learned Sparse Retrieval with Entities

Resources

License

Stars

Watchers

Forks

Languages