Skip to content

The file structure of model folder (/data) #1231

Closed Answered by Starlento
Starlento asked this question in Q&A
Discussion options

You must be logged in to vote

According to #815
I modified a docker-compose.yml which makes it work:
The model-id is actually a dir if you want to use the local model.

#Huggingface Inference
version: '3.5'

services:
  huggingface-inference:
    container_name: huggingface-inference
    image: ghcr.io/huggingface/text-generation-inference:latest
    shm_size: 8g
    deploy:
      resources:
        reservations:
          devices:
            - driver: nvidia
              count: all
              capabilities: [gpu]
    # runtime: nvidia
    volumes:
      - ./models:/data
    ports:
      - "8000:80"
    environment:
      # - NVIDIA_VISIBLE_DEVICES=0
      # - CUDA_LAUNCH_BLOCKING=1
      - MODEL_ID=/data/zephyr-7b…

Replies: 2 comments

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Answer selected by Starlento
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
1 participant