diff --git a/docs/reference/models/llamacpp.md b/docs/reference/models/llamacpp.md
index 0ab3dd83b..24b0fdc97 100644
--- a/docs/reference/models/llamacpp.md
+++ b/docs/reference/models/llamacpp.md
@@ -47,7 +47,7 @@ model = models.llamacpp(
 | `n_gpu_layers`| `int` | Number of layers to offload to GPU. If -1, all layers are offloaded | `0` |
 | `split_mode` | `int` | How to split the model across GPUs. `1` for layer-wise split, `2` for row-wise split | `1` |
 | `main_gpu` | `int` | Main GPU | `0` |
-| `tensor_split` | `Optional[List[float]]` | How split tensors should be distributed accross GPUs. If `None` the model is not split. | `None` |
+| `tensor_split` | `Optional[List[float]]` | How split tensors should be distributed across GPUs. If `None` the model is not split. | `None` |
 | `n_ctx` | `int` | Text context. Inference from the model if set to `0` | `0` |
 | `n_threads` | `Optional[int]` | Number of threads to use for generation. All available threads if set to `None`.| `None` |
 | `verbose` | `bool` | Print verbose outputs to `stderr` | `False` |