CVC

Code for "Boosting Visual Knowledge-Intensive Training for LVLMs through Causality-driven Visual Object Completion" (IJCAI 2025)

Data Preparation

High-Causality Entity Collection

Download the COCO dataset using LAVIS.

Format the input into a JSON list. Each entry should contain:

{
    "image": "image file",
    "text_input": "image caption"
}

Extract entities for each caption:

python cvc/data_preparation/1-0_entity_extractor.py

Tag the causality for each entity:

python cvc/data_preparation/1-1_causality_tagger.py

Image Occlusion

Use GLIP to detect bounding boxes of high-causality entities. Download the GLIP checkpoint and run the following script within the GLIP repository:
```
python cvc/data_preparation/2-1_detect_bbox.py
```

Use SAM to mask high-causality objects:

python cvc/data_preparation/2-2_segment.py

Instruction Generation

Generate the specific instruction for each high-causality entity:
```
python cvc/data_preparation/3_instruction_generator.py
```

Model Training

Trial Sampling

Sample multiple rationales (trials) for each CVC instance:
```
python cvc/model_training/1_cot_generator_llava.py
```

Extract the final answer from each trial:

python cvc/model_training/2_answer_extractor.py

Verify the correctness of each trial's answer using soft matching with the BGE-M3 embedding model.
```
python cvc/model_training/3_answer_checker.py
```

Trial Learning

Collect challenging successful CVC instances and construct the training data using hybrid formats. The resulting dataset is combined with the instruction data of LLaVA-1.5:
```
python cvc/model_training/4_hybrid_format.py
```
Download the pretrained checkpoint of LLaVA-1.5 and use the official LLaVA training script for model training.

🤝 Acknowledgements

This project builds upon the excellent work of several open-source repositories. We sincerely thank the authors for their contributions:

LLaVA: for the base LVLM architecture and training pipeline
LAVIS: for dataset downloading
GLIP: for object detection

Please make sure to install all required dependencies as specified in the respective repositories.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
cvc		cvc
examples		examples
figures		figures
Appendix.pdf		Appendix.pdf
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

CVC

Data Preparation

High-Causality Entity Collection

Image Occlusion

Instruction Generation

Model Training

Trial Sampling

Trial Learning

🤝 Acknowledgements

About

Uh oh!

Releases

Packages

Languages

XMUDeepLIT/CVC

Folders and files

Latest commit

History

Repository files navigation

CVC

Data Preparation

High-Causality Entity Collection

Image Occlusion

Instruction Generation

Model Training

Trial Sampling

Trial Learning

🤝 Acknowledgements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages