ConsumerBench

📑 Overview

ConsumerBench is a comprehensive benchmarking framework that evaluates the runtime performance of user-defined GenAI applications under realistic conditions on end-user devices.

🚀 Quick Start

# Clone the repository
git clone https://github.com/your-org/ConsumerBench.git
cd ConsumerBench

# Set up environment
conda create -n consumerbench python=3.10
conda activate consumerbench
pip install -r requirements.txt

# Run a sample benchmark
python3 benchmark_v2.py --benchmark workflow --config configs/workflow_imagegen.yml

📋 Repository Structure

ConsumerBench/
├── applications/           # Application repositories
├── configs/                # Example user configurations & workflows
├── scripts/                # Result processing and plotting scripts
├── benchmark_v2.py         # Core benchmark code
├── workflow.py             # Workflow class definition
└── globals.py              # Shared global variables

🧩 Supported Applications

💬 Chatbot

Text-to-text generation for chat and Q&A with:

Minimal frontend in benchmark_v2.py
Local backend mimicking OpenAI API
Powered by llama.cpp for efficient CPU-GPU co-execution

🔍 DeepResearch

Agent-based reasoning for complex fact gathering:

Built on open-deep-research framework
Served via LiteLLM
Located in applications/smolagents

🖼️ ImageGen

Text-to-image generation optimized for edge devices:

Utilizes stable-diffusion-webui in API mode
Located in applications/stable-diffusion-webui

🎙️ LiveCaptions

Audio-to-text transcription for real-time and offline use:

Whisper-based backend over HTTP
Front-end: applications/whisper_streaming/generate_raw_realtime.py
Back-end: applications/whisper_streaming/whisper_online_server.py

⚙️ Installation & Setup

Application Installation

Follow the README in each application directory for specific installation instructions.

Note: For llama.cpp, use these CMAKE flags:
-DGGML_CUDA=ON -DGGML_CUDA_F16=1 -DCMAKE_CUDA_ARCHITECTURES="75"

📊 Running Benchmarks

Note: Please change the paths of the applications, datasets and models to your personal paths.

Basic Benchmark

python3 benchmark_v2.py --benchmark workflow --config configs/workflow_imagegen.yml

Comprehensive Benchmark with System Metrics

./scripts/run_benchmark.sh configs/workflow_imagegen.yml 0

This script collects:

GPU metrics - Compute/memory bandwidth (DCGM)
CPU utilization - Via stat utility
CPU memory bandwidth - Via pcm-memory utility
GPU power - Via NVML utility
CPU power - Via RAPL utility

Results Analysis

Results are saved in the results directory with timestamps. PDF plots are automatically generated.

To modify Service Level Objectives (SLOs):

Chatbot: scripts/parse-results-chatbot-log.py
DeepResearch: scripts/parse-results-deepresearch-log.py
ImageGen: scripts/parse-results-imagegen-log.py
LiveCaptions: scripts/parse-results-whisper-log.py

📝 Experiment Configurations

Exclusive Execution

Application	Config
Chatbot	`configs/workflow_chatbot.yml`
LiveCaptions	`configs/workflow_live_captions.yml`
ImageGen	`configs/workflow_imagegen.yml`

CPU-only: Change device from "gpu" to "cpu" in the configs.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ConsumerBench

📑 Overview

🚀 Quick Start

📋 Repository Structure

🧩 Supported Applications

💬 Chatbot

🔍 DeepResearch

🖼️ ImageGen

🎙️ LiveCaptions

⚙️ Installation & Setup

Application Installation

📊 Running Benchmarks

Basic Benchmark

Comprehensive Benchmark with System Metrics

Results Analysis

📝 Experiment Configurations

Exclusive Execution

Concurrent Execution

Model Sharing (Inference Server)

End-to-End User Workflow

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
applications		applications
configs		configs
example_workflow		example_workflow
results		results
scripts		scripts
server_logs		server_logs
.gitmodules		.gitmodules
README.md		README.md
benchmark_v1.py		benchmark_v1.py
benchmark_v2.py		benchmark_v2.py
globals.py		globals.py
requirements.txt		requirements.txt
workflow.py		workflow.py

efeslab/ConsumerBench

Folders and files

Latest commit

History

Repository files navigation

ConsumerBench

📑 Overview

🚀 Quick Start

📋 Repository Structure

🧩 Supported Applications

💬 Chatbot

🔍 DeepResearch

🖼️ ImageGen

🎙️ LiveCaptions

⚙️ Installation & Setup

Application Installation

📊 Running Benchmarks

Basic Benchmark

Comprehensive Benchmark with System Metrics

Results Analysis

📝 Experiment Configurations

Exclusive Execution

Concurrent Execution

Model Sharing (Inference Server)

End-to-End User Workflow

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages