Skip to content

Commit 23904d9

Browse files
committed
update training operator control plane
Signed-off-by: mahdikhashan <[email protected]>
1 parent 94240ac commit 23904d9

File tree

1 file changed

+3
-2
lines changed

1 file changed

+3
-2
lines changed

content/en/docs/components/katib/user-guides/hp-tuning/llm-hp-optimization.md

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -26,8 +26,9 @@ You need to install the following Katib components to run code in this guide:
2626
- Katib control plane – [install](/docs/components/katib/installation/#installing-control-plane).
2727
- Katib Python SDK with LLM Hyperparameter Optimization Support – [install](/docs/components/katib/installation/#installing-python-sdk)
2828

29-
Additionally, this API supports both non-distributed training and distributed training using **PyTorchJob**.
30-
If you want to use **distributed training**, make sure to install the **Training Operator** control plane in addition to the packages mentioned above.
29+
Additionally, this API supports both non-distributed and distributed training using PyTorchJob. If you want to enable distributed training, you must install the Training Operator control plane in addition to the required packages mentioned above.
30+
31+
Furthermore, when importing models and datasets from external platforms, you are required to define `resources_per_trial` using the `TrainerResources` object. This setup automatically utilizes PyTorchJob for training, making the installation of the Training Operator control plane **mandatory** in this case. Please ensure this setup is properly configured.
3132

3233
- Training Operator control plane – [install](/docs/components/training/installation/#installing-the-control-plane)
3334

0 commit comments

Comments
 (0)