Add DeepAnalyze system by LIUyizheSDU · Pull Request #26 · mitdbg/Kramabench

LIUyizheSDU · 2026-02-09T08:52:40Z

This pull request introduces the new DeepAnalyze System (KramaBench) and provides all necessary scripts, configuration, and documentation to integrate, evaluate, and run it within the project. The changes include adding environment setup and execution scripts, a detailed README, a prompt template for the system, and registering the system within the project.

Key additions and changes:

DeepAnalyze System Integration:

Registered the DeepAnalyze system in the project by updating systems/__init__.py to import it, making it available for evaluation workflows.

Execution and Evaluation Scripts:

Added run.sh for running DeepAnalyze across multiple or specific workloads, including logic to clean the data directory using a snapshot, run evaluations in parallel, and summarize scores.
Added eval_response_cache.sh to re-evaluate cached responses without regenerating outputs, facilitating fast recomputation of scores.

System Prompt:

Introduced prompt.py containing the DeepAnalyze prompt template, specifying instructions and answer formatting for the system's operation.

Documentation:

Added a README.md for DeepAnalyze, explaining environment setup, running workloads, output locations, and cache evaluation.

Add DeepAnalyze system

LIUyizheSDU and others added 3 commits February 9, 2026 16:15

Add DeepAnalyze system scaffold and package export

7a0b485

Merge pull request #1 from LIUyizheSDU/feat-deepanalyze

8878fde

Add DeepAnalyze system

Merge branch 'mitdbg:main' into main

62bb252

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add DeepAnalyze system#26

Add DeepAnalyze system#26
LIUyizheSDU wants to merge 3 commits intomitdbg:mainfrom
LIUyizheSDU:main

LIUyizheSDU commented Feb 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

LIUyizheSDU commented Feb 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant