🐛 [Bug]: DLLAMA_BLAS_VENDOR=OpenBLAS build with pip is not enabling OpenBlas #977

amgowda-oci · 2023-12-13T20:02:10Z

Bug description

I have tried pp install as detailed with OpenBlas installed CMAKE_ARGS="-DLLAMA_BLAS=ON -DLLAMA_BLAS_VENDOR=OpenBLAS" pip install -v llama-cpp-python but the llama.cpp built does not have Open blas set to true, somewhere when pip build llama-python it is not passing this command down to llama. cpp

Steps to reproduce

Install openblas-dev to your linux OS/Docker container
Run - CMAKE_ARGS="-DLLAMA_BLAS=ON -DLLAMA_BLAS_VENDOR=OpenBLAS" pip install -v llama-cpp-python
Download any model
Run the ./api or llama-python
You will see in the logs of serge that llala.cpp loading the ML model without OpenBlas
results

Environment Information

---------------------------------------

Base image for node

FROM node:20-bookworm-slim as node_base

---------------------------------------

Base image for redis

FROM redis:7-bookworm as redis

---------------------------------------

Dev environment

FROM python:3.11-slim-bookworm as dev

Set ENV

WORKDIR /usr/src/app
ENV TZ=Etc/UTC
ENV NODE_ENV='development'

Install dependencies

RUN apt-get update
&& apt-get install -y --no-install-recommends dumb-init && apt-get install -y libopenblas-dev && pip install --upgrade pip

Copy database, source code, and scripts

COPY --from=redis /usr/local/bin/redis-server /usr/local/bin/redis-server
COPY --from=redis /usr/local/bin/redis-cli /usr/local/bin/redis-cli
COPY --from=node_base /usr/local /usr/local
COPY scripts/dev.sh /usr/src/app/dev.sh
COPY scripts/serge.env /usr/src/app/serge.env
COPY vendor/requirements.txt /usr/src/app/requirements.txt
COPY ./web/package.json ./web/package-lock.json ./

RUN npm ci
&& chmod 755 /usr/src/app/dev.sh
&& chmod 755 /usr/local/bin/redis-server
&& chmod 755 /usr/local/bin/redis-cli
&& mkdir -p /etc/redis
&& mkdir -p /data/db
&& mkdir -p /usr/src/app/weights
&& echo "appendonly yes" >> /etc/redis/redis.conf
&& echo "dir /data/db/" >> /etc/redis/redis.conf

EXPOSE 8008
EXPOSE 9124
ENTRYPOINT ["/usr/bin/dumb-init", "--"]
CMD ["/bin/bash", "-c", "/usr/src/app/dev.sh"]

DEV.SH

#!/bin/bash

set -x
source serge.env

Get CPU Architecture

cpu_arch=$(uname -m)

build with OpenBLAS=1

blasconfig="CMAKE_ARGS="-DLLAMA_BLAS=ON -DLLAMA_BLAS_VENDOR=OpenBLAS""

Function to detect CPU features

detect_cpu_features() {
cpu_info=$(lscpu)
if echo "$cpu_info" | grep -q "avx512"; then
echo "AVX512"
elif echo "$cpu_info" | grep -q "avx2"; then
echo "AVX2"
elif echo "$cpu_info" | grep -q "avx"; then
echo "AVX"
else
echo "basic"
fi
}

Check if the CPU architecture is aarch64/arm64

if [ "$cpu_arch" = "aarch64" ]; then
pip_command="$blasconfig pip install -v llama-cpp-python==$LLAMA_PYTHON_VERSION --only-binary=:all: --extra-index-url=https://gaby.github.io/arm64-wheels/"
else
# Use @jllllll provided wheels
cpu_feature=$(detect_cpu_features)
pip_command="$blasconfigpip pip install -v llama-cpp-python==$LLAMA_PYTHON_VERSION --only-binary=:all: --extra-index-url=https://jllllll.github.io/llama-cpp-python-cuBLAS-wheels/$cpu_feature/cpu"
fi

echo "Recommended install command for llama-cpp-python: $pip_command"

Install python vendor dependencies

pip install -r /usr/src/app/requirements.txt || {
echo 'Failed to install python dependencies from requirements.txt'
exit 1
}

Install python dependencies

pip install -e ./api || {
echo 'Failed to install python dependencies'
exit 1
}

Install python bindings

eval "$pip_command" || {
echo 'Failed to install llama-cpp-python'
exit 1
}

Start Redis instance

redis-server /etc/redis/redis.conf &

Start the web server

cd /usr/src/app/web || exit 1
npm run dev -- --host 0.0.0.0 --port 8008 &

Start the API

cd /usr/src/app/api || exit 1
uvicorn src.serge.main:api_app --reload --host 0.0.0.0 --port 9124 --root-path /api/ || {
echo 'Failed to start main app'
exit 1
}

Screenshots

No response

Relevant log output

No response

Confirmations

I'm running the latest version of the main branch.
I checked existing issues to see if this has already been described.

gaby · 2023-12-13T22:46:12Z

@amgowda-oci Serge doesnt support gpu yet. Work is being done in #944

amgowda-oci · 2023-12-13T22:49:38Z

@gaby this is using OpenBLAS where there is no GPU dependency

gaby · 2023-12-13T22:50:57Z

@amgowda-oci I dont know then. Ask the llama-cpp-python team. We are just installing it using pip. Sounds like a problem on their side

amgowda-oci added the ☢️ Bug label Dec 13, 2023

gaby closed this as completed Dec 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🐛 [Bug]: DLLAMA_BLAS_VENDOR=OpenBLAS build with pip is not enabling OpenBlas #977

🐛 [Bug]: DLLAMA_BLAS_VENDOR=OpenBLAS build with pip is not enabling OpenBlas #977

amgowda-oci commented Dec 13, 2023

gaby commented Dec 13, 2023

amgowda-oci commented Dec 13, 2023

gaby commented Dec 13, 2023

🐛 [Bug]: DLLAMA_BLAS_VENDOR=OpenBLAS build with pip is not enabling OpenBlas #977

🐛 [Bug]: DLLAMA_BLAS_VENDOR=OpenBLAS build with pip is not enabling OpenBlas #977

Comments

amgowda-oci commented Dec 13, 2023

Bug description

Steps to reproduce

Environment Information

---------------------------------------

Base image for node

---------------------------------------

Base image for redis

---------------------------------------

Dev environment

Set ENV

Install dependencies

Copy database, source code, and scripts

Get CPU Architecture

build with OpenBLAS=1

Function to detect CPU features

Check if the CPU architecture is aarch64/arm64

Install python vendor dependencies

Install python dependencies

Install python bindings

Start Redis instance

Start the web server

Start the API

Screenshots

Relevant log output

Confirmations

gaby commented Dec 13, 2023

amgowda-oci commented Dec 13, 2023

gaby commented Dec 13, 2023