OpenWebUI with Ollama

This repository provides detailed instructions and steps to successfully run OpenWebUI and Ollama on Intel platforms.

Example

High Level Architecture

Validated Hardware

CPU: Intel® Core™ Ultra 7 processors
GPU: Intel® Arc™ graphics
RAM: 16GB
DISK: 128GB

Prerequisite

1. Install operating system

Install the latest Ubuntu* 22.04 LTS Desktop. Refer to Ubuntu Desktop installation tutorial if needed.

2. Docker Setup

Docker and docker compose should be setup before running the commands below. Refer to here to setup docker.

3. Install necessary GPU drivers.

Refer to here to setup GPU drivers

Application ports

Please ensure that you have these ports available before running the applications.

Apps	Port
Open WebUI	80

Device-Specific Workload Configuration

You can offload model inference to specific device by modifying the environment variable setting in the docker-compose.yml file.

Workload	Environment Variable	Supported Device
LLM	-	GPU
STT	STT_DEVICE	CPU, GPU, NPU
TTS	TTS_DEVICE	CPU

Example Configuration:

To offload the STT encoded workload to NPU, you can use the following configuration.

stt_service:
  ...
  environment:
    ...
    STT_DEVICE=NPU
    ...

Quick Start

1. Build the Docker Container

docker compose build

2. Start the Docker Container

export RENDER_GROUP_ID=$(getent group render | cut -d: -f3)
docker compose up -d

3. Access the Web UI

Navigate to: http://localhost:80

4. Register the Default Admin User for the Web UI

After signing up as the default admin user, you will be automatically redirected to the home page.

5. Configure OpenAI, TTS and STT API Link

Open the Admin Panel from the top left corner.
Click on Settings
Replace OpenAI API link:
- Click on Connections
- Replace the OpenAI API link with http://ollama:11434/v1and provide any API Key.
- Click on Verify Connection to ensure the server connection is verified.
- Click Save button for save the changes

Example:

Replace TTS and STT API links:
- Click on Audio
- For Speech-to-Text Engine, change from whisper (local) to OpenAI
- Replace the OpenAI API link with http://stt_service:5996/v1 and provide any API Key.
- Text-to-Speech Engine, change from Web API to OpenAI
- Replace the OpenAI API link with http://tts_service:5995/v1 and provide any API Key.
- Leave the STT Model, TTS Voice, and TTS Model fields empty (default TTS voice will be EN-US).
- Click Save to save the changes.

Example:

9. Start New Chat

Click on New Chat
You may download the model from Ollama.com by entering the model name and selecting Pull <model_name> from Ollama.com.
Click on Arena Model and select the target model (e.g., qwen2.5:latest).

10. Verify the LLM pipeline

LLM model

LLM Model: Start interacting with the chat using the selected model.
TTS Pipeline: Click on the Read Aloud icon to trigger the TTS API.
STT Pipeline: Click on the Record Voice icon to start recording, and click again to stop. The generated text will appear in the input field.

FAQ

1. Configure Ollama to use CPU instead of GPU

Linux: Export the environment variable OLLAMA_NUM_GPU before starting the services to offload to CPU device
```
# Default: GPU
export OLLAMA_NUM_GPU=999

# Runs on CPU
export OLLAMA_NUM_GPU=0
```

Limitations

1. Automatic Speech Recognition Compatibility

Automatic speech recognition functionality is not supported in Firefox. Please use Chrome for validated performance.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

OpenWebUI with Ollama

Example

High Level Architecture

Validated Hardware

Prerequisite

1. Install operating system

2. Docker Setup

3. Install necessary GPU drivers.

Application ports

Device-Specific Workload Configuration

Quick Start

1. Build the Docker Container

2. Start the Docker Container

3. Access the Web UI

4. Register the Default Admin User for the Web UI

5. Configure OpenAI, TTS and STT API Link

9. Start New Chat

10. Verify the LLM pipeline

LLM model

FAQ

1. Configure Ollama to use CPU instead of GPU

Limitations

1. Automatic Speech Recognition Compatibility

Files

README.md

Latest commit

History

README.md

File metadata and controls

OpenWebUI with Ollama

Example

High Level Architecture

Validated Hardware

Prerequisite

1. Install operating system

2. Docker Setup

3. Install necessary GPU drivers.

Application ports

Device-Specific Workload Configuration

Quick Start

1. Build the Docker Container

2. Start the Docker Container

3. Access the Web UI

4. Register the Default Admin User for the Web UI

5. Configure OpenAI, TTS and STT API Link

9. Start New Chat

10. Verify the LLM pipeline

LLM model

FAQ

1. Configure Ollama to use CPU instead of GPU

Limitations

1. Automatic Speech Recognition Compatibility