langchain4j-local-rag-sample

This is a simple RAG service running everything locally that uses Vespa or OpenSearch as the VectorStore and an ollama model.

NOTE: The OpenSearch implementation is still work-in-progress and is not yet ready to be used.

Runtime Requirements

Epub books

The default setup will read epub books from the books directory for the RAG.

Please copy your favorite epub books in there with filenames ending with .epub.

Ollama model running locally

Install ollama

The default configuration is using the mistral:7b:

ollama pull mistral:7b

To list your local ollama models:

ollama list

# For more details on the models do:
curl -s localhost:11434/api/tags | jq .

(option 1; default) Vespa

Start Vespa cluster

You need to start a Vespa version 8 cluster:

docker run --detach \
  --name vespa \
  --hostname vespa-tutorial \
  --publish 8080:8080 \
  --publish 19071:19071 \
  --publish 19092:19092 \
  --publish 19050:19050 \
  vespaengine/vespa:8

Note: the 19050 port is not absolutely necessary, but has a nice status page for the Vespa cluster once you have your Vespa doc-types in place.

Deploy Vespa application

Install the vespa-cli if needed:

brew install vespa-cli

Run from the root of this repo:

vespa deploy --wait 300 vespa

If you used the above docker command to expose the 19050 port then you can monitor the Cluster status on this page: http://127.0.0.1:19050/clustercontroller-status/v1/llm

Stopping Vespa

To kill (and delete all data from) the Vespa cluster just:

docker rm -f vespa

Delete all Vespa docs

# Delete all books
curl -X DELETE \
  "http://localhost:8080/document/v1/embeddings/books/docid?selection=true&cluster=llm"

# Delete all news
curl -X DELETE \
  "http://localhost:8080/document/v1/embeddings/news/docid?selection=true&cluster=llm"

(Option 2) OpenSearch [WIP]

Follow the instructions to set up a single node OpenSearch server with docker.

Using docker-compose:

cd opensearch
wget https://raw.githubusercontent.com/opensearch-project/documentation-website/2.12/assets/examples/docker-compose.yml

# Setup your admin password
echo "OPENSEARCH_INITIAL_ADMIN_PASSWORD=$OPENSEARCH_INITIAL_ADMIN_PASSWORD" > .env

# Start the containers as detached daemons:
docker-compose up -d

Check that OpenSearch is up and running:

curl -ku "admin:$OPENSEARCH_INITIAL_ADMIN_PASSWORD" https://localhost:9200

# If the docker containers do not start then check the server logs:
docker logs opensearch-node1

Things that might go wrong above are:

Not enough strong admin password
Not setting the sysctl limits

Please set the OPENSEARCH_INITIAL_ADMIN_PASSWORD env variable to a strong password as OpenSearch will not start otherwise.

Open http://localhost:5601 and login as admin with the OPENSEARCH_INITIAL_ADMIN_PASSWORD password you created above.

BUILD

Make sure you set the configuration to what you want to use.

mvn clean compile package

USAGE

# Populate the Vector store
./target/langchain4j-local-rag-sample-0.0.1-assembly/bin/rag-sample-create-embeddings.sh

# Chat 
./target/langchain4j-local-rag-sample-0.0.1-assembly/bin/rag-sample-cli.sh

Call GRPC Service

# Start GRPC server
./target/langchain4j-local-rag-sample-0.0.1-assembly/bin/rag-sample-grpc-service.sh

# Call the service
grpcurl --plaintext -d '{"question": "What is the Foundation?"}' 127.0.0.1:4242 ragsample.RagSample.Ask

Misc

Some alternative prompts:

prompt.template = """You are a helpful assistant, conversing with a user about the subjects contained in a set of documents.
Use the information from the DOCUMENTS section to provide accurate answers. If unsure or if the answer
isn't found in the DOCUMENTS section, simply state that you don't know the answer.

QUESTION:
{{userMessage}}

DOCUMENTS:
{{contents}}
"""

prompt.template = """Context information is below.

---------------------
{{contents}}
---------------------

Given the context information above and no prior knowledge, provide answers based on the below query.

{{userMessage}}
"""

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
books		books
news		news
opensearch		opensearch
src		src
vespa-queries		vespa-queries
vespa		vespa
.gitignore		.gitignore
README.md		README.md
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

langchain4j-local-rag-sample

Runtime Requirements

Epub books

Ollama model running locally

(option 1; default) Vespa

Start Vespa cluster

Deploy Vespa application

Stopping Vespa

Delete all Vespa docs

(Option 2) OpenSearch [WIP]

BUILD

USAGE

Call GRPC Service

Misc

About

Releases

Packages

Languages

pehrs/langchain4j-local-rag-sample

Folders and files

Latest commit

History

Repository files navigation

langchain4j-local-rag-sample

Runtime Requirements

Epub books

Ollama model running locally

(option 1; default) Vespa

Start Vespa cluster

Deploy Vespa application

Stopping Vespa

Delete all Vespa docs

(Option 2) OpenSearch [WIP]

BUILD

USAGE

Call GRPC Service

Misc

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages