EnterpriseData-with-LLM

Generative AI is creating lots of exciting use-cases across industries. To create true business value from generative AI, requires integration of Large Language Models (LLM) with enterprise knowledge base. LLMs are not trained on proprietary enterprise specific knowledge (but are trained on publicly available internet data), they might hallucinate and provide incorrect response to enterprise specific questions.

In this repository, I will present a way to quickly (within 1–2 hours) and securely integrate your enterprise data (Confluence pages, Salesforce Data, CRM data, Relational databases, manuals etc.) with Large Language Models (LLM). This is a full end-to-end solution - no model training, fine-tuning, or extensive deployment needed. You also do not need any specific AI/ML experience or extensive developer knowledge to deploy this solution. The answers provided will be grounded in your organisations specific knowledge, avoiding factuality issues such as hallucinations and out-of-context responses.

This solution will enable enterprises to create a lot of business use-cases like:

Improving customer experience: intelligent chat-bots providing answers based on enterprise data- say order status, account balance etc.
Increasing internal employee productivity: by generating enterprise specific proposals/ marketing material/manuals/job descriptions etc.
Internal Search engine: searching code repositories, internal documents, etc.

If you want to read more about architecture choices made here, please read Medium Blog https://medium.com/@Sachin.Kulkarni.NL/generative-ai-with-enterprise-data-3c81a8bffaf2.

Architecture of solution:

Here are steps to implement this solution:

1. Set up your AWS Sagemaker Studio environment and Git Clone

Login to your AWS Account, select any region (for e.g. Ireland (eu-west-1)) as the region and navigate to Amazon SageMaker Management Console. Click on Studio link in the left and then click on the Open Studio link.
It will launch Amazon SageMaker Studio in a new browser window or tab. In the studio, click on File in the top menu. Next Open Terminal In Terminal tab. You can type in: git clone https://github.com/sachink2010/EnterpriseData-with-LLM

2. Set up your Kendra Index Using AWS Console:

Upload files in Bank Financial Statements folder to your S3 folder
Create a Kendra Index
Add data source as S3 bucket, set up sync as periodic, based on your needs
Follow steps as shown in src/CreateKendraIndex Folder

3. Run your Streamlit app in SageMaker Studio terminal and start using the app

Set the kendra_index_id variable in the Kendra-RAG-StreamlitApp.py file to match the index you created in step 2.
In SageMaker Terminal window type streamlit run streamlit run ./EnterpriseData-with-LLM/src/streamlit/Kendra-RAG-StreamlitApp.py --server.port 6006
You will see that Streamlit App is running message in your Terminal session

Copy your Sagemaker domain id, your region where SageMaker Studio is running and Streamlit Port from previous step
In a new same browser window in a new tab, open link: https://studio-id.studio.region.sagemaker.aws/jupyter/default/proxy/port/ For e.g.: https://d-1n8b7wqrjeyx.studio.eu-west-1.sagemaker.aws/jupyter/default/proxy/6006/
Start using the app

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
src		src
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EnterpriseData-with-LLM

About

Releases

Packages

Languages

satishnvrn/rag-kendra-template

Folders and files

Latest commit

History

Repository files navigation

EnterpriseData-with-LLM

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages