Skip to content

Commit

Permalink
feat: Add DBpedia OpenAI embedding dataset with 100k vectors (#150)
Browse files Browse the repository at this point in the history
  • Loading branch information
KShivendu authored Jun 3, 2024
1 parent 9598214 commit e026701
Show file tree
Hide file tree
Showing 2 changed files with 9 additions and 1 deletion.
2 changes: 1 addition & 1 deletion .github/workflows/continuous-benchmark.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -26,7 +26,7 @@ jobs:
DATASET_TO_ENGINE["laion-small-clip"]="qdrant-continuous-benchmark"
DATASET_TO_ENGINE["msmarco-sparse-100K"]="qdrant-sparse-vector"
DATASET_TO_ENGINE["h-and-m-2048-angular-filters"]="qdrant-continuous-benchmark"
DATASET_TO_ENGINE["dbpedia-openai-1M-1536-angular"]="qdrant-bq-continuous-benchmark"
DATASET_TO_ENGINE["dbpedia-openai-100K-1536-angular"]="qdrant-bq-continuous-benchmark"
for dataset in "${!DATASET_TO_ENGINE[@]}"; do
export ENGINE_NAME=${DATASET_TO_ENGINE[$dataset]}
Expand Down
8 changes: 8 additions & 0 deletions datasets/datasets.json
Original file line number Diff line number Diff line change
Expand Up @@ -66,6 +66,14 @@
"path": "dbpedia-openai-1M-1536-angular/dbpedia_openai_1M",
"link": "https://storage.googleapis.com/ann-filtered-benchmark/datasets/dbpedia_openai_1M.tgz"
},
{
"name": "dbpedia-openai-100K-1536-angular",
"vector_size": 1536,
"distance": "cosine",
"type": "tar",
"path": "dbpedia-openai-100K-1536-angular/dbpedia_openai_100K",
"link": "https://storage.googleapis.com/ann-filtered-benchmark/datasets/dbpedia_openai_100K.tgz"
},
{
"name": "msmarco-sparse-100K",
"type": "sparse",
Expand Down

0 comments on commit e026701

Please sign in to comment.