Skip to content

Commit

Permalink
fix: increase gpu utilization
Browse files Browse the repository at this point in the history
  • Loading branch information
Avram Tudor committed Oct 10, 2024
1 parent 63513e8 commit 5f4f694
Show file tree
Hide file tree
Showing 2 changed files with 2 additions and 2 deletions.
2 changes: 1 addition & 1 deletion run.sh
Original file line number Diff line number Diff line change
Expand Up @@ -9,5 +9,5 @@ else
cd ..
fi

export LLAMA_N_CTX=16000
export LLAMA_N_CTX=44000
poetry run python -m uvicorn skynet.main:app --reload
2 changes: 1 addition & 1 deletion skynet/modules/ttt/openai_api/app.py
Original file line number Diff line number Diff line change
Expand Up @@ -31,7 +31,7 @@ def initialize():
f'python -m {openai_api_server_path} \
--disable-log-requests \
--model {llama_path} \
--gpu_memory_utilization 0.95 \
--gpu_memory_utilization 0.99 \
--max-model-len {llama_n_ctx} \
--port {openai_api_server_port}'.split(),
shell=False,
Expand Down

0 comments on commit 5f4f694

Please sign in to comment.