remove

lvhan028 · Aug 14, 2023 · 59fd0a3 · 59fd0a3
1 parent 92b326e
commit 59fd0a3
Showing 1 changed file with 0 additions and 13 deletions.
diff --git a/docs/en/serving.md b/docs/en/serving.md
@@ -34,19 +34,6 @@ bash workspace/service_docker_up.sh
 
 </details>
 
-<details open>
-<summary><b>7B with INT4 weight only quantization</b></summary>
-
-```shell
-python3 -m lmdeploy.serve.turbomind.deploy llama2 /path/to/llama-2-7b-chat-hf \
-    --model_format awq \
-    --group_size 128 \
-    --quant_path /path/to/awq-quant-weight.pt
-bash workspace/service_docker_up.sh
-```
-
-</details>
-
 ## Serving [LLaMA](https://github.com/facebookresearch/llama)
 
 Weights for the LLaMA models can be obtained from by filling out [this form](https://docs.google.com/forms/d/e/1FAIpQLSfqNECQnMkycAp2jP4Z9TFX0cGR4uf7b_fBxjY_OjhJILlKGA/viewform)