Skip to content

docs(perf): P5 Phase-2 design spec — quality + infra blind-spots (LOC-70)#74

Merged
dev07060 merged 2 commits into
mainfrom
docs/loc-70-p5-spec
Jun 1, 2026
Merged

docs(perf): P5 Phase-2 design spec — quality + infra blind-spots (LOC-70)#74
dev07060 merged 2 commits into
mainfrom
docs/loc-70-p5-spec

Conversation

@dev07060

@dev07060 dev07060 commented Jun 1, 2026

Copy link
Copy Markdown
Owner

P5 Phase-2 design spec (LOC-70)

docs/perf/ondevice-query-profiler/DESIGN-P5.md — repo-side spec for the four blind-spot targets beyond the Phase-1 latency baseline, each grounded in code (file:line), with an on-device recipe + the query_metrics-pattern hook + a pass/fail verdict:

  1. e2e hybrid recall (quality, 1순위) — f32-exact ground truth vs production i8-HNSW+BM25-RRF; no Rust change (Dart-side SQLite brute-force).
  2. activate 247ms decomposition — BM25-rebuild vs HNSW-load+Box::leak; Rust ActivateTimings.
  3. concurrency jitter + ranking integrity — global RwLock singleton; data_generation does not guard the active-collection swap. ⚠️ FRB serializes FFI per-isolate → the integrity guard is the higher-value deliverable.
  4. SQLite I/O scaling — filtered exact-scan I/O dominates the 0.76ms; find the inversion point vs embed (27ms) → per-collection doc cap.

Verified premise correction: in the shipped build (vector_quant_i8), the HNSW graph is built on dequantize_i8_to_f32 vectors (source_rag.rs:886-899) — not original f32 — so recall measures graph-approximation and i8 distortion together.

Doc-only; independent of #72/#73. Implementation gated on P3/P4 merge. Spec mirrors Linear LOC-70. 머지는 본인.

…-70)

Repo-side spec mirroring the LOC-70 ticket: 4 grounded targets (e2e hybrid
recall, activate decomposition, concurrency jitter + ranking integrity, SQLite
I/O scaling). Records the verified premise that the shipped HNSW is built on
i8-dequantized vectors (source_rag.rs:886-899), so recall measures graph
approximation + i8 distortion together. Each target carries code anchors, an
on-device recipe, the query_metrics-pattern hook, and the pass/fail verdict.
Implementation gated on P3/P4 merge.
@linear-code

linear-code Bot commented Jun 1, 2026

Copy link
Copy Markdown

LOC-70

@dev07060 dev07060 merged commit b574df6 into main Jun 1, 2026
6 checks passed
@dev07060 dev07060 deleted the docs/loc-70-p5-spec branch June 1, 2026 08:49
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant