MPI Operator

The MPI Operator makes it easy to run allreduce-style distributed training.

Deploy

kubectl create -f deploy/

Launch a multi-node tensorflow benchmark training job:

kubectl create -f examples/tensorflow-benchmarks.yaml

Once everything starts, the logs are available in the launcher pod.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
cmd		cmd
deploy		deploy
examples		examples
hack		hack
pkg		pkg
.dockerignore		.dockerignore
.gitignore		.gitignore
.travis.yml		.travis.yml
Dockerfile		Dockerfile
Gopkg.lock		Gopkg.lock
Gopkg.toml		Gopkg.toml
LICENSE		LICENSE
OWNERS		OWNERS
README.md		README.md
prow_config.yaml		prow_config.yaml