How to teach new things to your AI

A hands-on workshop exploring how to work with text embeddings for search and retrieval, using modern Python tools and libraries.

Companion to the talk "How to teach new things to your AI".

Overview

This workshop teaches the fundamentals of working with text embeddings through a practical Jupyter notebook that guides participants through:

Text extraction from PDFs
Semantic text chunking
Creating and working with embeddings
Vector similarity search
Reranking search results
Building a simple RAG (Retrieval Augmented Generation) system

Prerequisites

Python 3.12
Basic familiarity with Python and Jupyter notebooks
Understanding of basic NLP concepts
A text editor (VS Code recommended)

Setup

Install Python 3.12 using a version manager like:
- uv (recommended)
- mise
Clone this repository and navigate to the project directory:

git clone [repository-url]
cd [repository-name]

Create and activate a virtual environment:

uv venv
source .venv/bin/activate  # On Unix/macOS
# or
.venv\Scripts\activate  # On Windows

Install dependencies:

uv pip install -r requirements.txt

Getting Started

Launch Jupyter Notebook:

jupyter notebook

Open embeddings.ipynb and follow along with the tutorial.

What You'll Learn

How to extract and process text from PDF documents
Techniques for semantic text chunking
Creating and working with text embeddings
Implementing vector similarity search using DuckDB
Using rerankers to improve search results
Building a simple question-answering system

Additional Resources

MTEB Leaderboard - Compare embedding models
Sentence Transformers Documentation
DuckDB Documentation
PyMuPDF Documentation

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
.python-version		.python-version
Latin American Economic Outlook 2020.pdf		Latin American Economic Outlook 2020.pdf
README.md		README.md
Ruler Benchmark.pdf		Ruler Benchmark.pdf
embeddings.ipynb		embeddings.ipynb
sentence-transformers-error.png		sentence-transformers-error.png
the potential of artificial intelligence to create public value.pdf		the potential of artificial intelligence to create public value.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

How to teach new things to your AI

Overview

Prerequisites

Setup

Getting Started

What You'll Learn

Additional Resources

About

Releases

Packages

Languages

jackbravo/embeddings-workshop

Folders and files

Latest commit

History

Repository files navigation

How to teach new things to your AI

Overview

Prerequisites

Setup

Getting Started

What You'll Learn

Additional Resources

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages