Skip to content

Commit 9e532c4

Browse files
committed
Update installation instructions to prevent/resolve nccl error, update poetry lock to vllm 0.4.2, update flash attention instructions for vllm 0.4.2
1 parent 3ec9069 commit 9e532c4

File tree

3 files changed

+198
-57
lines changed

3 files changed

+198
-57
lines changed

README.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -23,6 +23,7 @@ pip install wheel
2323
export CUDA_HOME=/pkgs/cuda-12.3
2424

2525
pip install flash-attn --no-build-isolation
26+
pip install vllm-flash-attn
2627
```
2728

2829
## Launch an inference server

0 commit comments

Comments
 (0)