-
Notifications
You must be signed in to change notification settings - Fork 39
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Recommended latest image is too large for most clusters #30
Comments
Agree. It would also just be good to have a small default (does minimum) just to make sure everything can be installed. Maybe put a warning on size (not able to see it on quay site).
This download was k8s on EC2, thats a long time.
|
Great point - On a standard K8s node, this is going to really fill up the overlay and when that node hits >=80% you will start getting evictions. Most orgs allocate 100-200GB on the overlay (from what I have seen) - so this taking up 1/4 of the storage may cause issues to other workloads. One thing you could do is to add a node with a taint and then use a toleration to limit that node to running just localai (and other resources for it). I know that can be easier said than done... |
docs about the different image tags is here: https://localai.io/basics/getting_started/#container-images - I do agree generally, however note that after few rounds of enhancements the images now are much smaller, compressed are in order of ~17GB: |
Nice progresss on optimizing - I do get why they are larger images in general - and using a PVC for the models is really the way to help any stress on the conainer host filesystem. |
While trying out the helm chart I found that the recommended image (from the example values file) is far too large to pull on our cluster. Pulling the image locally I'm seeing that it is just shy of 70.0 Gi.
I think that if the docs make some suggestions for different image tags it would make installing the chart easier.
I'm going forward with
v2.0.0-ffmpeg-core
.The text was updated successfully, but these errors were encountered: