Auto-GPT Benchmarks

This repo has been deprecated. The benchmark code is now in the main AutoGPT repo: https://github.com/Significant-Gravitas/Auto-GPT

Built for the purpose of benchmarking the performance of agents regardless of how they work.

Objectively know how well your agent is performing in categories like code, retrieval, memory, and safety.

Save time and money while doing it through smart dependencies. The best part? It's all automated.

More agents coming soon !

Name		Name	Last commit message	Last commit date
Latest commit History 1,397 Commits
.github		.github
.vscode		.vscode
agbenchmark		agbenchmark
agent		agent
backend		backend
frontend @ c5c3662		frontend @ c5c3662
notebooks		notebooks
paper		paper
reports		reports
.env.example		.env.example
.flake8		.flake8
.gitignore		.gitignore
.gitmodules		.gitmodules
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
mypy.ini		mypy.ini
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
run.sh		run.sh
server.py		server.py