Backend of the Information Detrevial
- Python 3.8 or higher
- pip
-
Clone the repository Run:
git clone [email protected]:Information-Detrieval/project.git
-
Change directory to the project root directory
cd project
-
Create a virtual environment
- On Windows, run:
python -m venv env
- On Unix or MacOS, run:
python3 -m venv .venv
- This will create a virtual environment named
env
- On Windows, run:
-
Activate the virtual environment
- On Windows, run:
.env\Scripts\activate
- On Unix or MacOS, run:
source .venv/bin/activate
- On Windows, run:
-
Install the required packages
- Run:
pip install -r requirements.txt
- Run:
-
project Structure:
- Website_data folder - consists of the extracted content from websites in form of
- html
- json {"title":"", "url": "", "text": ""}
- txt
- pickle
- Website_data folder - consists of the extracted content from websites in form of
-
To run the code with NEW Vectors and NEW data
- Delete the storage folder
- Delete the pkl/documents.pkl file
-
Run
python backend.py
. You might see a loading bar in the console that shows that the vectors have been upserted. -
Goto
chrome://extensions/
in your browser and enable developer mode. Click onLoad unpacked
and select theextension
repo (in the same Github org as this repo). -
Open a new tab and click on the extension icon. You should see the extension popup alongwith the chat UI.