how to use a online LLM API，instead of local vllm loaded #1307

devillaws · 2024-12-02T09:04:21Z

devillaws
Dec 2, 2024

how to use a online LLM API，instead of local vllm loaded.
such as openai、kimi

Dec 2, 2024

Please review this doc. vLLM is OpenAI compliant, meaning you can just use the openai python library and use a different base_url for whatever your inference server is.

class Testing(BaseModel):
    """
    A class representing a testing schema.
    """
    name: str
    age: int

openai_client = openai.OpenAI(
    base_url="http://0.0.0.0:1234/v1",
    api_key="dopeness"
)

# Make a request to the local LM Studio server
response = openai_client.beta.chat.completions.parse(
    model="hugging-quants/Llama-3.2-1B-Instruct-Q8_0-GGUF",
    messages=[
        {"role": "system", "content": "You are like so good at whatever you do."},
        {"role": "user", "content": "My name is Cameron and …

View full answer

cpfiffer · 2024-12-02T20:54:59Z

cpfiffer
Dec 2, 2024
Collaborator

Please review this doc. vLLM is OpenAI compliant, meaning you can just use the openai python library and use a different base_url for whatever your inference server is.

class Testing(BaseModel):
    """
    A class representing a testing schema.
    """
    name: str
    age: int

openai_client = openai.OpenAI(
    base_url="http://0.0.0.0:1234/v1",
    api_key="dopeness"
)

# Make a request to the local LM Studio server
response = openai_client.beta.chat.completions.parse(
    model="hugging-quants/Llama-3.2-1B-Instruct-Q8_0-GGUF",
    messages=[
        {"role": "system", "content": "You are like so good at whatever you do."},
        {"role": "user", "content": "My name is Cameron and I am 28 years old. What's my name and age?"}
    ],
    response_format=Testing
)

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

how to use a online LLM API，instead of local vllm loaded #1307

{{title}}

Replies: 1 comment

{{title}}

Select a reply

how to use a online LLM API，instead of local vllm loaded #1307

devillaws Dec 2, 2024

Replies: 1 comment

cpfiffer Dec 2, 2024 Collaborator

devillaws
Dec 2, 2024

cpfiffer
Dec 2, 2024
Collaborator