Skip to content

Commit

Permalink
[DOCS] preparing 2025.0 pass 4
Browse files Browse the repository at this point in the history
  • Loading branch information
kblaszczak-intel committed Jan 24, 2025
1 parent 0fdc1bc commit dfc3dba
Showing 1 changed file with 8 additions and 0 deletions.
Original file line number Diff line number Diff line change
Expand Up @@ -575,6 +575,14 @@ compression is done by NNCF at the model export stage. The exported model contai
information necessary for execution, including the tokenizer/detokenizer and the generation
config, ensuring that its results match those generated by Hugging Face.

.. note::

To use meta-llama/Llama-2-7b-chat-hf model, you will need to accept license agreement.
You must be a registered user in 🤗 Hugging Face Hub. Please visit HuggingFace model card,
carefully read terms of usage and click accept button. You will need to use an access token
for the code below to run. For more information on access tokens, refer to this section of
the documentation. Refer to this document to learn how to login to Hugging Face Hub.

The `LLMPipeline` is the main object to setup the model for text generation. You can provide the
converted model to this object, specify the device for inference, and provide additional
parameters.
Expand Down

0 comments on commit dfc3dba

Please sign in to comment.