Skip to content

Curiosity recommender system for freedom of thoughts

Notifications You must be signed in to change notification settings

texonom/curiosity

Repository files navigation

Curiosity

Our civilization is built on curiosity. Curiosity recommender system's object is suggesting perfect list after reading documents.

Processing

  1. Notion.so raw data generation
  2. Nosion.so raw data to markdown

1~2 processings are done by texonom/notion-node

  1. Markdown to Huggingface dataset
git clone https://github.com/texonom/texonom-md
python hf_upload.py chroma
  1. Extracted dataset to embedding

Run chroma server

pm2 start conf/chroma.json

Run embedding server

volume=data
model=thenlper/gte-small
docker run -d --name tei --gpus all -p 8080:80 -v $volume:/data --pull always ghcr.io/huggingface/text-embeddings-inference:0.3.0 --model-id $model
python index_to.py pgvector
  1. Use embedding for recommendation

Plan

  • from dictionary dataset without id duplicating (prefer recent one)
  • dataset tagging with date

About

Curiosity recommender system for freedom of thoughts

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages