New feature: Report query latencies and index size #5

kwang2049 · 2023-02-11T23:11:01Z

This PR adds a new feature: The query latency details will be tracked and reported; the index size will be also reported.

Changes:

modified: examples/inference/distilsplade_max/beir_scifact/all_in_one.sh: Check out this example for the final demo report!
modified: sparse_retrieval/inference/aio.py: Add new arguments to search.run and evaluate.run;
modified: sparse_retrieval/inference/search.py:
- Add new argument output_latency to specify the output latency file.
- Add a LatencyReporter;
- Use the LatencyReporter to record each searcher.search/batch_search and report the latency details into a file called latency.tsv (under the same path of run.tsv);
  - Each line of latency.tsv is f"{qid}\t{word_length}\t{latency}\t{batch_size}\n"

modified: sparse_retrieval/inference/evaluate.py:

Add new argument latency_path and index_path as the paths to the stats source;
Add new argument bins to specify the binning query latencies wrt. how many word-length bins.

Summarize and report latency details:

  latency_info = {
    "latency": {
        "latency_avg": np.mean(latencies),
        "query_word_length_avg": np.mean(word_lengths),
        "binned": {
            "word_length_bins": word_length_bins.tolist(),
            "freqs": freqs.tolist(),
            "latencies_avg": binned_latencies_avg,
            "latencies_std": binned_latencies_std
        },
        "batch_size": np.mean(batch_sizes),
        "processor": get_processor_name()
    }
}

Report index size in MB.

modified: sparse_retrieval/inference/utils.py: Some new util functions to support the new features.

kwang2049 · 2023-02-11T23:26:03Z

This PR goes after the successor PR #4. Please first deal with #4 and then come back to this

This reverts commit f0c4600.

Now report query latencies and index size

39e9ec3

kwang2049 changed the title ~~Feature query latency and index size~~ New feature: Report query latencies and index size Feb 11, 2023

kwang2049 changed the base branch from main to BUG_ckpt_name_accepts_list_only February 11, 2023 23:11

kwang2049 requested a review from thakur-nandan February 11, 2023 23:25

kwang2049 added 6 commits February 12, 2023 00:47

correct variable name

f0c4600

Revert "correct variable name"

85d3515

This reverts commit f0c4600.

correct variable name

e0e9f21

new line at end

2ba2859

new line at end

bd219be

added overall std

12fece1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New feature: Report query latencies and index size #5

New feature: Report query latencies and index size #5

kwang2049 commented Feb 11, 2023 •

edited

Loading

kwang2049 commented Feb 11, 2023

New feature: Report query latencies and index size #5

Are you sure you want to change the base?

New feature: Report query latencies and index size #5

Conversation

kwang2049 commented Feb 11, 2023 • edited Loading

kwang2049 commented Feb 11, 2023

kwang2049 commented Feb 11, 2023 •

edited

Loading