End to End RAG Pipeline

End to End RAG Pipeline Using Cortex and Streamlit

Precheck (If you have access to Snowflake Notebooks)

Create a stage to store the PDFs (Demo uses a stage called RAG)
Load the pdfs via the UI
Go to projects, notebooks, and upload 4_rag_sf_notebook.ipynb
In the packages drop down add in the packages from the environment.yml file
Run the notebook!

Precheck (If running from Visual Studio Code

Utilize the environment.yml file to set up your Python environment for the demo:
- Examples in the terminal:
  - conda env create -f environment.yml
  - micromamba create -f environment.yml -y

Create a .env file and populate it with your account details:

SNOWFLAKE_ACCOUNT = abc123
SNOWFLAKE_USER = username
SNOWFLAKE_PASSWORD = your_password
SNOWFLAKE_ROLE = your_role
SNOWFLAKE_WAREHOUSE = warehouse_name
SNOWFLAKE_DATABASE = database_name
SNOWFLAKE_SCHEMA = schema_name

Step 1: Run notebook `1_rag.ipynb`

This lesson will:

Create a stage for your unstructured documents (PDFs in this case).
Create a UDF named readpdf that reads in the PDF as raw text.
Create a UDF that chunks the text leveraging Langchain.
Create a vector store leveraging Cortex to create embeddings out of the chunks.

Step 2: Run `2_rag.sql`

Show the vector store.
Create a table that will track all inputs and outputs from the Streamlit app.
Showcase how we can query the most relevant chunks from the vector store.
Showcase how we can leverage Cortex LLMs to get answers from the relevant chunks.

Step 3: Streamlit App Integration

Copy Streamlit app code into SiS and ask the question: "What % of Snowflake customers process unstructured data?" and watch it in action!

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
documents		documents
images		images
.gitignore		.gitignore
1_rag.ipynb		1_rag.ipynb
2_rag.sql		2_rag.sql
3_streamlit_app.py		3_streamlit_app.py
4_rag_sf_notebook.ipynb		4_rag_sf_notebook.ipynb
README.md		README.md
environment.yml		environment.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

End to End RAG Pipeline

End to End RAG Pipeline Using Cortex and Streamlit

Precheck (If you have access to Snowflake Notebooks)

Precheck (If running from Visual Studio Code

Step 1: Run notebook `1_rag.ipynb`

Step 2: Run `2_rag.sql`

Step 3: Streamlit App Integration

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

cromano8/RAG_Snowflake

Folders and files

Latest commit

History

Repository files navigation

End to End RAG Pipeline

End to End RAG Pipeline Using Cortex and Streamlit

Precheck (If you have access to Snowflake Notebooks)

Precheck (If running from Visual Studio Code

Step 1: Run notebook 1_rag.ipynb

Step 2: Run 2_rag.sql

Step 3: Streamlit App Integration

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Step 1: Run notebook `1_rag.ipynb`

Step 2: Run `2_rag.sql`

Packages