EduVisAgent: A Benchmark and Multi-Agent Framework for Pedagogical Visualization

Paper: From EduVisBench to EduVisAgent: A Benchmark and Multi-Agent Framework for Pedagogical Visualization

Overview

This project introduces EduVisAgent, a multi-agent framework designed to generate comprehensive educational visualizations from user queries, and EduVisBench, a benchmark for evaluating the quality of these visualizations. The system leverages a series of specialized language model (LM) agents that work collaboratively to create detailed teaching plans, improve them through iterative feedback, and build structured content for various presentation formats. The core of the system is the TeachAgentsIntermediateSystem, which coordinates the entire workflow.

Features

Multi-Agent System: A collaborative team of AI agents for generating educational content.
Iterative Refinement: Agents provide feedback to each other to improve the quality of the generated content.
Structured Output: Generates structured content suitable for various presentation formats.
Evaluation Benchmark (EduVisBench): A comprehensive benchmark to assess the quality of pedagogical visualizations based on five key dimensions.

Requirements

Python 3.12
OpenAI API Key
Conda (recommended for environment management)

Installation

Clone the repository:

git clone https://github.com/aiming-lab/EduVisAgent
cd EduVisAgent

Create and activate Conda environment:

conda create -n eduvis python=3.12
conda activate eduvis

Install dependencies:
```
bash install.sh
```
Set up environment variables: Export your OpenAI API key as an environment variable in your terminal session.
```
export OPENAI_API_KEY="your_openai_api_key_here"
```

Usage

Generating Educational Visualizations

Execute the main script to run the educational content generation system:

python scripts/run_teach_intermediate.py --question "Your educational question here"

By default, if no --question argument is provided, the script uses a predefined example query.

Output will be saved in the outputs/teach_intermediate/ directory, with subdirectory names containing a timestamp and a brief description of the query (e.g., outputs/teach_intermediate/YYYYMMDD_HHMMSS_explain_concept_photosynthesis/).

Batch Processing of Educational Questions

For processing multiple educational questions from a dataset (e.g., train.jsonl), you can use the run_teach_intermediate_batch.py script. This script iterates through questions in a JSONL file and runs the full TeachAgentsIntermediateSystem for each.

python scripts/run_teach_intermediate_batch.py --data_file mydatasets/train.jsonl --limit 5

Evaluating Visualizations with EduVisBench

To evaluate visualizations against the benchmark, simply run the evaluation script:

python evaluation/run_evaluation.py

The script will automatically check for the EduVisBench dataset. If it's not found locally, it will be downloaded from Hugging Face before the evaluation begins. This includes the necessary JSON data and images.

Dataset

The EduVisBench dataset is automatically downloaded from the Haonian/EduVisBench repository on Hugging Face when you run the evaluation script for the first time. For a detailed description of the dataset, including its structure, statistics, and data sources, please see DATASET.md.

Directory Structure

EduVisAgent/
├── agents/                         # Core agent logic
├── config/                         # Configuration files
├── evaluation/                     # Evaluation scripts and data
│   ├── data/                       # Benchmark data (auto-downloaded)
│   └── run_evaluation.py           # Script for running EduVisBench evaluation
├── image/                          # Image resources for README
├── models/                         # Language model interfaces
├── mydatasets/                     # Directory for custom user datasets
├── scripts/                        # Main executable scripts
├── utils/                          # Utility functions and classes
├── install.sh                      # Installation script
└── README.md                       # This file

Citation

If you find this work useful, please cite our paper:

@article{ji2025eduvisbench,
  title={From EduVisBench to EduVisAgent: A Benchmark and Multi-Agent Framework for Pedagogical Visualization},
  author={Ji, Haonian and Qiu, Shi and Xin, Siyang and Han, Siwei and Chen, Zhaorun and Wang, Hongyi and Zhang, Dake and Yao, Huaxiu},
  journal={arXiv preprint arXiv:2505.16832},
  year={2025}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

EduVisAgent: A Benchmark and Multi-Agent Framework for Pedagogical Visualization

Overview

Features

Requirements

Installation

Usage

Generating Educational Visualizations

Batch Processing of Educational Questions

Evaluating Visualizations with EduVisBench

Dataset

Directory Structure

Citation

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
agents		agents
config		config
evaluation		evaluation
image		image
models		models
mydatasets		mydatasets
scripts		scripts
utils		utils
webview		webview
DATASET.md		DATASET.md
README.md		README.md
install.sh		install.sh

aiming-lab/EduVisAgent

Folders and files

Latest commit

History

Repository files navigation

EduVisAgent: A Benchmark and Multi-Agent Framework for Pedagogical Visualization

Overview

Features

Requirements

Installation

Usage

Generating Educational Visualizations

Batch Processing of Educational Questions

Evaluating Visualizations with EduVisBench

Dataset

Directory Structure

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages