-
Notifications
You must be signed in to change notification settings - Fork 7.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feat/fireworks integration #2089
base: main
Are you sure you want to change the base?
Changes from 6 commits
f1ad995
a38675b
2d5865e
c5c1f9b
241637f
4d22546
519c48b
9c3590e
cecec30
80f15a1
b807e50
6a46060
03e8809
0ff7a06
b2ffe5b
16d1f60
b8cb49a
2052ff4
5334dda
c846b3f
62985df
c4be3f8
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,54 @@ | ||
FROM python:3.11.6-slim-bookworm as base | ||
|
||
# Install poetry | ||
RUN pip install pipx | ||
RUN python3 -m pipx ensurepath | ||
RUN pipx install poetry==1.8.3 | ||
ENV PATH="/root/.local/bin:$PATH" | ||
ENV PATH=".venv/bin/:$PATH" | ||
|
||
RUN apt update && apt install -y \ | ||
build-essential | ||
|
||
# https://python-poetry.org/docs/configuration/#virtualenvsin-project | ||
ENV POETRY_VIRTUALENVS_IN_PROJECT=true | ||
|
||
FROM base as dependencies | ||
WORKDIR /home/worker/app | ||
COPY pyproject.toml poetry.lock ./ | ||
|
||
ARG POETRY_EXTRAS="ui llms-fireworks embeddings-fireworks vector-stores-qdrant embeddings-openai" | ||
RUN poetry install --no-root --extras "${POETRY_EXTRAS}" | ||
|
||
FROM base as app | ||
ENV PYTHONUNBUFFERED=1 | ||
ENV PORT=8080 | ||
ENV APP_ENV=prod | ||
ENV PYTHONPATH="$PYTHONPATH:/home/worker/app/private_gpt/" | ||
EXPOSE 8080 | ||
|
||
# Prepare a non-root user | ||
# More info about how to configure UIDs and GIDs in Docker: | ||
# https://github.com/systemd/systemd/blob/main/docs/UIDS-GIDS.md | ||
|
||
# Define the User ID (UID) for the non-root user | ||
# UID 100 is chosen to avoid conflicts with existing system users | ||
ARG UID=100 | ||
|
||
# Define the Group ID (GID) for the non-root user | ||
# GID 65534 is often used for the 'nogroup' or 'nobody' group | ||
ARG GID=65534 | ||
|
||
RUN adduser --system --gid ${GID} --uid ${UID} --home /home/worker worker | ||
WORKDIR /home/worker/app | ||
|
||
RUN chown worker /home/worker/app | ||
RUN mkdir local_data && chown worker local_data | ||
RUN mkdir models && chown worker models | ||
COPY --chown=worker --from=dependencies /home/worker/app/.venv/ .venv | ||
COPY --chown=worker private_gpt/ private_gpt | ||
COPY --chown=worker *.yaml . | ||
COPY --chown=worker scripts/ scripts | ||
|
||
USER worker | ||
ENTRYPOINT python -m private_gpt |
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,13 +1,12 @@ | ||
services: | ||
|
||
#----------------------------------- | ||
#---- Private-GPT services --------- | ||
#----------------------------------- | ||
|
||
# Private-GPT service for the Ollama CPU and GPU modes | ||
# This service builds from an external Dockerfile and runs the Ollama mode. | ||
private-gpt-ollama: | ||
image: ${PGPT_IMAGE:-zylonai/private-gpt}:${PGPT_TAG:-0.6.2}-ollama # x-release-please-version | ||
image: ${PGPT_IMAGE:-zylonai/private-gpt}:${PGPT_TAG:-0.6.2}-ollama # x-release-please-version | ||
build: | ||
context: . | ||
dockerfile: Dockerfile.ollama | ||
|
@@ -80,7 +79,7 @@ services: | |
ollama-cpu: | ||
image: ollama/ollama:latest | ||
volumes: | ||
- ./models:/root/.ollama | ||
- ./local_data:/root/.ollama | ||
profiles: | ||
- "" | ||
- ollama-cpu | ||
|
@@ -98,4 +97,22 @@ services: | |
count: 1 | ||
capabilities: [gpu] | ||
profiles: | ||
- ollama-cuda | ||
- ollama-cuda | ||
|
||
# fireworks service | ||
private-gpt-fireworks: | ||
build: | ||
context: . | ||
dockerfile: Dockerfile.fireworks | ||
volumes: | ||
- ./local_data/:/home/worker/app/local_data | ||
ports: | ||
- "3001:8080" | ||
environment: | ||
PORT: 8080 | ||
PGPT_PROFILES: fireworks | ||
FIREWORKS_API_KEY: ${FIREWORKS_API_KEY} | ||
env_file: | ||
- .env | ||
profiles: | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Same as Dockerfile There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. here i have used how other docker files and docker-compose are written There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I would say the same. It is not the core of PGPT, therefore, we should not give this kind of support. Our goal is to give a 100% private solution, in this area, our two main providers are Ollama and Llama-CPP. Of course, this PR gives more value to other people with the same problems, but it doesn't make sense to maintain on docker-compose and Dockerfile. |
||
- fireworks |
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I'm not sure how the new dependencies are installed, but it bumps all dependencies. Please revert the current There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. i had bumped it while i was having error when testing at github |
Large diffs are not rendered by default.
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,13 @@ | ||
server: | ||
env_name: ${APP_ENV:fireworks} | ||
|
||
llm: | ||
mode: fireworks | ||
|
||
embedding: | ||
mode: fireworks | ||
|
||
fireworks: | ||
api_key: ${FIREWORKS_API_KEY:} | ||
model: "accounts/fireworks/models/llama-v3p1-70b-instruct" | ||
#poetry install --extras "ui llms-fireworks embeddings-fireworks vector-stores-qdrant embeddings-openai" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I would say that the Docker file is useless in this case. It's a better idea if someone wants to use fireworks, make the necessary modifications.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
yes the docker file is only for running it even if the make modification docker file is usefull cause of dependency errors
python version mismatch etc
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't like this premise.
Adding another Docker/Docker-compose profile, will mean more code to maintain, when it is not the optimal PGPT user case. If fireworks was a fully local-setup environment, probably, it would be nice.