Add support for safetensors (non-GGUF) model format #642

sallyom · 2025-01-28T12:55:40Z

Instead of a single model file, ramalama will need to mount and pull a directory. It would be really nice to have ramalama pull the/model --format safetensors or something like that

The text was updated successfully, but these errors were encountered:

bmahabirbu · 2025-02-06T19:08:39Z

Great idea, if we're using vllm (or others that support safe tensor) as the runtime we can mount the folder as is. If we are using llama.cpp we can convert the safetensors to a gguf then mount the file as usual!

ericcurtin · 2025-02-06T22:39:46Z

I agree we should implement this. I would say even drop the "--format safetensors", I think it would not be too hard to automatically detect a certain model is safetensors.

rhatdan · 2025-02-07T10:34:03Z

SGTM

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for safetensors (non-GGUF) model format #642

Add support for safetensors (non-GGUF) model format #642

sallyom commented Jan 28, 2025

bmahabirbu commented Feb 6, 2025 •

edited

Loading

ericcurtin commented Feb 6, 2025

rhatdan commented Feb 7, 2025

Add support for safetensors (non-GGUF) model format #642

Add support for safetensors (non-GGUF) model format #642

Comments

sallyom commented Jan 28, 2025

bmahabirbu commented Feb 6, 2025 • edited Loading

ericcurtin commented Feb 6, 2025

rhatdan commented Feb 7, 2025

bmahabirbu commented Feb 6, 2025 •

edited

Loading