GitHub - beam-cloud/beta9: Scalable Infrastructure for Running Your AI Workloads at Scale

Ultrafast AI Inference

Scalable Infrastructure for Running Your AI Workloads at Scale

Installation

pip install beam-client

Features

Extremely Fast: Launch containers in 200ms using a custom runc runtime
Parallelization and Concurrency: Fan out workloads to 100s of containers
First-Class Developer Experience: Hot-reloading, webhooks, and scheduled jobs
Scale-to-Zero: Workloads are serverless by default
Volume Storage: Mount distributed storage volumes
GPU Support: Run on our cloud (4090s, H100s, and more) or bring your own GPUs

Quickstart

Create an account at https://beam.cloud
Follow our Getting Started Guide

Creating your first inference endpoint

With Beam, everything is Python-native—no YAML, no config files, just code:

from beam import Image, endpoint


@endpoint(
    image=Image(python_version="python3.11"),
    gpu="A10G",
    cpu=1,
    memory=1024,
)
def handler():
    return {"label": "cat", "confidence": 0.97}

This snippet deploys your code to a GPU-backed container with an HTTPS endpoint—ready to serve requests immediately (https://my-model-v1.app.beam.cloud/)

Self-Hosting vs Cloud

Beta9 is the open-source engine powering Beam, our fully-managed cloud platform. You can self-host Beta9 for free or choose managed cloud hosting through Beam.

Contributing

We welcome contributions big or small. These are the most helpful things for us:

Submit a feature request or bug report
Open a PR with a new feature or improvement

Name		Name	Last commit message	Last commit date
Latest commit History 1,354 Commits
.github		.github
.vscode		.vscode
bin		bin
cmd		cmd
deploy		deploy
docker		docker
docs		docs
e2e		e2e
hack		hack
manifests		manifests
pkg		pkg
proto		proto
sdk		sdk
static		static
.dockerignore		.dockerignore
.gitignore		.gitignore
.stignore		.stignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
go.mod		go.mod
go.sum		go.sum

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Ultrafast AI Inference

Scalable Infrastructure for Running Your AI Workloads at Scale

Installation

Features

Quickstart

Creating your first inference endpoint

Self-Hosting vs Cloud

Contributing

Thanks to Our Contributors

About

Uh oh!

Releases 923

Packages

Uh oh!

Contributors 16

Languages

License

beam-cloud/beta9

Folders and files

Latest commit

History

Repository files navigation

Ultrafast AI Inference

Scalable Infrastructure for Running Your AI Workloads at Scale

Installation

Features

Quickstart

Creating your first inference endpoint

Self-Hosting vs Cloud

Contributing

Thanks to Our Contributors

About

Topics

Resources

License

Code of conduct

Uh oh!

Stars

Watchers

Forks

Releases 923

Packages 0

Uh oh!

Contributors 16

Languages

Packages