Bumper: Automatic Program Repair For Breaking Dependency Updates With Large Language Models

External libraries are widely used to expedite software development, but like any software component, they are updated over time, introducing new features and deprecating or removing old ones. When a library introduces breaking changes, all its clients must be updated to avoid disruptions. This update, when it introduces a breaking change, is defined as a Breaking Dependency Update. Repairing such breakages is challenging and time-consuming because the error originates in the dependency, while the fix must be applied to the client codebase.

Automatic Program Repair (APR) is a research area focused on developing techniques to repair code failures without human intervention. With the advent of Large Language Models (LLMs), learning-based APR techniques have significantly improved in software repair tasks. However, their effectiveness on Breaking Dependency Updates remains unexplored.

Bumper is APR for breaking dependency updates with LLMs.

Publications:

Automatic Program Repair For Breaking Dependency Updates With Large Language Models Master's thesis Federico Bono (2024)

Repository Contents

📁 benchmarks/: Configuration scripts and base directory for benchmark files
📁 libs/: Source code of the tools used to do Fault Location (FL) and context extraction (API Diffs)
📁 pipeline/: Source code for the APR pipelines
📁 prompts/: Prompt templates used in the different pipeline configurations
📊 results/: Experimental results and analysis.
⚙️ benchmark.py: Python script to run a specific benchmark configuration
️⚙️ main.py: Debug Python script to run a specific project
️⚙️ replay.py: Python script to generate patched version of a client from a result file
️⚡ run_experiments.bash: Bash script to run sequentially all the experiments
️⚡ run_experiments-parallel.bash: Bash script to run in parallel all the experiments
🎛️ setup.bash: Setup script to clone the benchmark repository and perform dataset selection
📄 README.md: This file.

Setup and Installation

To set up the project locally, follow these steps:

Clone the repository:

git clone https://github.com/chains-project/bumper.git
cd bumper

Create and activate a virtual environment:

python -m venv venv
source venv/bin/activate  # On Windows, use `venv\Scripts\activate`

Install the required dependencies:
```
pip install -r requirements.txt
```
Setup benchmarks dataset:
```
bash setup.bash
```
Setup environment variable:
```
cp .env.example .env
```
To use Gemini you need to store the Google Cloud API Credential (g_credentials.json) in the root folder of the project:

Usage

Run all the experiments

To run the complete experiment set in sequence:
```
bash run_experiments.bash :name
```
Or to run the complete experiment set in parallel (4 processes max):
```
bash run_experiments-parallel.bash :name
```

Run a specific experiment

To run a specific experiment you can use the benchmark.py script with the needed flags.

[RUN_ID=:id] [WITHOUT_APIDIFF=True] python benchmark.py -n :name -p :pipeline -m :model

IMPORTANT: To run multiple experiments in parallel remember to set the RUN_ID env variable to identify the specific execution and avoid collisions in the repair process

Results

The results of our experiments can be found in the results directory. A complete data analysis with chart is provided in the analysis Jupyter notebook Key findings include:

The necessity of incorporating additional context from dependency changes.
The importance of error-type specific repair strategies.
Comparative analysis of GPT-4, Gemini, and Llama in terms of efficacy and cost-efficiency.

Contributing

Contributions are welcome! Please submit a pull request or open an issue to discuss your ideas or suggestions.

Contact

For any questions or inquiries, please contact us.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bumper: Automatic Program Repair For Breaking Dependency Updates With Large Language Models

Repository Contents

Setup and Installation

Usage

Run all the experiments

Run a specific experiment

Results

Contributing

Contact

About

Contributors 3

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 835 Commits
benchmarks/bump		benchmarks/bump
libs/java		libs/java
pipeline		pipeline
prompts/templates		prompts/templates
results		results
scripts		scripts
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
benchmark.py		benchmark.py
main.py		main.py
migrate.sh		migrate.sh
poster.pdf		poster.pdf
replay.py		replay.py
requirements.txt		requirements.txt
run_experiments-parallel.bash		run_experiments-parallel.bash
run_experiments.bash		run_experiments.bash
setup.bash		setup.bash

License

chains-project/bumper

Folders and files

Latest commit

History

Repository files navigation

Bumper: Automatic Program Repair For Breaking Dependency Updates With Large Language Models

Repository Contents

Setup and Installation

Usage

Run all the experiments

Run a specific experiment

Results

Contributing

Contact

About

Resources

License

Stars

Watchers

Forks

Contributors 3

Languages