[DOCS] preparing 2025.0 pass 4

openvinotoolkit · Jan 24, 2025 · dfc3dba · dfc3dba
1 parent 0fdc1bc
commit dfc3dba
Showing 1 changed file with 8 additions and 0 deletions.
diff --git a/docs/articles_en/openvino-workflow-generative/inference-with-genai.rst b/docs/articles_en/openvino-workflow-generative/inference-with-genai.rst
@@ -575,6 +575,14 @@ compression is done by NNCF at the model export stage. The exported model contai
 information necessary for execution, including the tokenizer/detokenizer and the generation
 config, ensuring that its results match those generated by Hugging Face.
 
+.. note::
+
+   To use meta-llama/Llama-2-7b-chat-hf model, you will need to accept license agreement.
+   You must be a registered user in 🤗 Hugging Face Hub. Please visit HuggingFace model card,
+   carefully read terms of usage and click accept button. You will need to use an access token
+   for the code below to run. For more information on access tokens, refer to this section of
+   the documentation. Refer to this document to learn how to login to Hugging Face Hub.
+
 The `LLMPipeline` is the main object to setup the model for text generation. You can provide the
 converted model to this object, specify the device for inference, and provide additional
 parameters.