Can RamaLama support Kubernetes-based inference clustering on a Mac Mini M4? #700

hotwa · 2025-02-01T10:56:20Z

I’m really impressed with the work being done on RamaLama and its ability to handle various models for inference. I have been exploring the possibility of leveraging Kubernetes (K8s) for distributed model in my local environment, specifically on a Mac Mini.

Is there currently support within RamaLama to deploy and manage model inference workloads using Kubernetes on a Mac Mini?
If not natively supported, are there any recommendations or best practices for integrating RamaLama with Kubernetes for local development/testing purposes?
Are there any known limitations or considerations when running Kubernetes-based inference clusters on ARM-based hardware (like the M4 chip) using RamaLama?

ericcurtin · 2025-02-01T11:34:39Z

It can be done, you need to install podman-machine with krunkit that's the first step https://podman.io/ . There's also kubelet generators via:

ramalama serve --generate

Most of the bits are there, just need someone to tie it all together.

ericcurtin · 2025-02-01T14:45:30Z

Would make a great blog post!

rhatdan · 2025-02-01T21:53:35Z

ramalama serve --generate kube MODEL

Will generate a k8s deployment for the containerized AI Model.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can RamaLama support Kubernetes-based inference clustering on a Mac Mini M4? #700

Can RamaLama support Kubernetes-based inference clustering on a Mac Mini M4? #700

hotwa commented Feb 1, 2025

ericcurtin commented Feb 1, 2025 •

edited

Loading

ericcurtin commented Feb 1, 2025

rhatdan commented Feb 1, 2025

Can RamaLama support Kubernetes-based inference clustering on a Mac Mini M4? #700

Can RamaLama support Kubernetes-based inference clustering on a Mac Mini M4? #700

Comments

hotwa commented Feb 1, 2025

ericcurtin commented Feb 1, 2025 • edited Loading

ericcurtin commented Feb 1, 2025

rhatdan commented Feb 1, 2025

ericcurtin commented Feb 1, 2025 •

edited

Loading