02-07-2025 UPDATE!!! This repository has been merged into RAGSST and will no longer be supported.

Ask your documents!

Implement a Retrieval Augmented Generation (RAG) system to query and retrieve information from your local documents efficiently.

Hands-on Workshop.

Gain practical experience with embeddings, vector databases, and local Large Language Models (LLMs)

Repository: https://github.com/aihpi/workshop-local-rag

Getting Started

Download or clone the repository

Installation

Install it step by step (or check Auto Installation for a single command)

Create and activate a virtual environment

$ python3 -m venv myvenv
$ source myvenv/bin/activate

Install dependencies

$ pip3 install -r requirements.txt

Ollama

Install to run large language models locally.

$ curl -fsSL https://ollama.ai/install.sh | sh

Or follow the installation instructions for your operating system: Install Ollama

Choose and download an LLM model. For example:

$ ollama pull llama3.2

Auto Installation

Alternatively, on bash, run the following installation script:

$ bin/install.sh

Usage

(myvenv)$ python3 local-rag-gui.py

And open the exposed link with your browser for the Graphical User Interface version.

Or, run the following for the command line input version

(myvenv)$ python3 local-rag-cli.py

In case the LLM server is not running start it in a different terminal with:

$ ollama serve

Additional Input parameters on the Frontend

Top k: Ranks the output tokens in descending order of probability, selects the first k tokens to create a new distribution, and it samples the output from it. Higher values result in more diverse answers, and lower values will produce more conservative answers. ([0, 10]. Default: 5)
Top p: Works together with Top k, but instead of selecting a fixed number of tokens, it selects enough tokens to cover the given cumulative probability. A higher value will produce more varied text, and a lower value will lead to more focused and conservative answers. ([0.1, 1] Default: 0.9)
Temp: This affects the “randomness” of the answers by scaling the probability distribution of the output elements. Increasing the temperature will make the model answer more creatively. ([0.1, 1]. Default: 0.5)

Development

Before commiting, format the code by using black as following on the project folder:

$ black -t py311 -S -l 99 .

You can Install Black with:

$ python3 -m pip install black

License

GPLv3

Name		Name	Last commit message	Last commit date
Latest commit History 62 Commits
CloudNotebook		CloudNotebook
bin		bin
data		data
exports		exports
extras		extras
images		images
learning-materials		learning-materials
log		log
ragsst		ragsst
sample_data		sample_data
sample_data_archive		sample_data_archive
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
README_ragsst.md		README_ragsst.md
__init__.py		__init__.py
app.py		app.py
local-rag-cli.py		local-rag-cli.py
local-rag-gui.py		local-rag-gui.py
parameters.py		parameters.py
ragfuncs.py		ragfuncs.py
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

02-07-2025 UPDATE!!! This repository has been merged into RAGSST and will no longer be supported.

Ask your documents!

Implement a Retrieval Augmented Generation (RAG) system to query and retrieve information from your local documents efficiently.

Hands-on Workshop.

Gain practical experience with embeddings, vector databases, and local Large Language Models (LLMs)

Getting Started

Download or clone the repository

Installation

Create and activate a virtual environment

Install dependencies

Ollama

Auto Installation

Usage

Additional Input parameters on the Frontend

Development

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 4

Uh oh!

Languages

License

aihpi/workshop-local-rag

Folders and files

Latest commit

History

Repository files navigation

02-07-2025 UPDATE!!! This repository has been merged into RAGSST and will no longer be supported.

Ask your documents!

Implement a Retrieval Augmented Generation (RAG) system to query and retrieve information from your local documents efficiently.

Hands-on Workshop.

Gain practical experience with embeddings, vector databases, and local Large Language Models (LLMs)

Getting Started

Download or clone the repository

Installation

Create and activate a virtual environment

Install dependencies

Ollama

Auto Installation

Usage

Additional Input parameters on the Frontend

Development

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 4

Uh oh!

Languages

Packages