NeuralCache 🧠⚡

Adaptive reranker for Retrieval-Augmented Generation (RAG)

NeuralCache is a lightweight reranker for RAG pipelines that actually remembers what helped. It blends dense semantic similarity with a narrative memory of past wins and stigmergic pheromones that reward helpful passages while decaying stale ones—then spices in MMR diversity and ε-greedy exploration. The result: more relevant context for your LLM without rebuilding your stack.

NeuralCache is an opinionated, stateful reranking layer designed to increase practical usefulness of RAG retrieval results by remembering what historically mattered, decaying stale signals, maintaining diversity, and optimizing compute via intelligent gating. The repository is production-minded (CI, packaging, adapters, metrics) yet approachable with minimal dependencies out of the box. Its architecture cleanly separates scoring components, adapters, and API surfaces, making it a solid foundation for iterative improvement and integration into existing LLM pipelines.

This repository open-sources the NeuralCache reranker. The broader “Cognitive Tetrad” engine remains proprietary IP and is not included here.

⚡ 60-second quickstart

# 1. Install
pip install neuralcache

# 2. Launch the API (Ctrl+C to stop)
uvicorn neuralcache.api.server:app --port 8080 --reload

# 3. Hit the reranker
curl -s -X POST http://127.0.0.1:8080/rerank \
  -H "Content-Type: application/json" \
  -d '{
    "query":"What is stigmergy?",
    "documents":[
      {"id":"a","text":"Stigmergy is indirect coordination via shared context."},
      {"id":"b","text":"Vector DBs store embeddings for retrieval."}
    ],
    "top_k":2
  }' | python -m json.tool

Prefer a single command? 👇

pip install neuralcache && \
uvicorn neuralcache.api.server:app --port 8080 --reload & \
server_pid=$! && sleep 3 && \
curl -s -X POST http://127.0.0.1:8080/rerank -H "Content-Type: application/json" \
     -d '{"query":"What is stigmergy?","documents":[{"id":"a","text":"Stigmergy is indirect coordination."},{"id":"b","text":"Vector DBs store embeddings."}],"top_k":2}' | python -m json.tool && \
kill $server_pid

Need batch reranking or Prometheus metrics?

pip install neuralcache[ops]
uvicorn neuralcache.api.server_plus:app --port 8081 --reload

Batch endpoint: POST http://127.0.0.1:8081/rerank/batch
Metrics scrape: GET http://127.0.0.1:8081/metrics (requires the prometheus-client dependency supplied by the ops extra)
Legacy routes remain available under /v1/...

Why teams choose NeuralCache

Drop-in reranker for any retriever that can send JSON. Works with Pinecone, Weaviate, Qdrant, Chroma—or your own Postgres table.
Narrative memory (EMA) keeps track of passages that consistently helped users, biasing future reranks toward them.
Stigmergic pheromones reward useful documents but decay over time, preventing filter bubbles.
MMR + ε-greedy introduces diversity without tanking relevance.
Zero external dependencies by default. Uses a hashing trick for embeddings so you can see results instantly, but slots in any vector model when you’re ready.
Adapters included. LangChain and LlamaIndex adapters ship in neuralcache.adapters; install them on demand with pip install "neuralcache[adapters]".
CLI + REST API + FastAPI docs give you multiple ways to integrate and debug.
Plus API adds /rerank/batch and Prometheus-ready /metrics endpoints when you run uvicorn neuralcache.api.server_plus:app (install the neuralcache[ops] extra for dependencies).
SQLite persistence out of the box. neuralcache.storage.sqlite_state.SQLiteState keeps narrative + pheromone state durable across workers without JSON file juggling.
Cognitive gating right-sizes the rerank set on the fly, trimming obvious non-starters to save downstream tokens without losing recall.
Transparent scoring spec documented in docs/SCORING_MODEL.md for auditability and reproducible benchmarks.

Use cases

Customer support copilots → surface articles with the exact resolution steps.
Internal knowledge bases → highlight documents that past agents actually referenced.
Vertical SaaS (legal/health/finance) → pair compliance-ready snippets with LLM summaries.
Evaluation harnesses → measure and tune Context-Use@K uplift before going live.

How it works

Signal	What it captures	Why it matters
Dense similarity	Cosine distance over embeddings (hash-based fallback out of the box)	Makes sure obviously relevant passages rank high.
Narrative EMA	Exponential moving average of successful context windows	Remembers story arcs across multi-turn conversations.
Stigmergic pheromones	Exposure-aware reinforcement with decay	Rewards docs that helped recently while fading stale ones.
MMR diversity	Maximal Marginal Relevance	Reduces redundancy and surfaces complementary evidence.
ε-greedy exploration	Occasional exploration of long-tail docs	Keeps fresh signals flowing so the model doesn’t get stuck.

All of this is orchestrated by neuralcache.rerank.Reranker, configurable through Settings or environment variables (NEURALCACHE_*).

Cognitive gating

NeuralCache now ships with an entropy-aware gating layer that decides how many candidates to score for each query. The gate looks at the dense similarity distribution, estimates uncertainty with a softmax entropy probe, and then uses a logistic curve to select a candidate budget between your configured min/max bounds.

Modes: off (never trims), auto (entropy-driven; default), on (always apply gating using provided thresholds).
Overrides: Pass a gating_overrides dict on /rerank or /rerank/batch calls to tweak mode, min/max candidates, threshold, or temperature per request.
Observability: Enable return_debug=true to receive gating telemetry (mode, uncertainty, chosen candidate count, masked ids) alongside the rerank results.

Gating plugs in before narrative, pheromone, and MMR scoring—so downstream memories and pheromones still receive consistent updates even when the candidate pool shrinks.

Multi-tenancy & namespaces

NeuralCache now supports lightweight logical isolation using a namespace header:

X-NeuralCache-Namespace: tenantA

If omitted, the default namespace is used. Narrative + pheromone feedback effects do not bleed across namespaces. See MULTITENANCY.md for deeper design notes.

Setting	Purpose	Default
`NEURALCACHE_NAMESPACE_HEADER`	Header key to read namespace	`X-NeuralCache-Namespace`
`NEURALCACHE_DEFAULT_NAMESPACE`	Fallback namespace when header missing	`default`
`NEURALCACHE_NAMESPACE_PATTERN`	Validation regex (400 on mismatch)	`^[a-zA-Z0-9_.-]{1,64}$`
`NEURALCACHE_MAX_NAMESPACES`	Optional cap on total in-memory namespaces (including default); LRU evicts oldest non-default when exceeded	unset
`NEURALCACHE_NAMESPACE_EVICTION_POLICY`	Eviction strategy (currently only `lru`)	`lru`
`NEURALCACHE_METRICS_NAMESPACE_LABEL`	If `true`, adds `namespace` label to rerank metrics families	`false`
`NEURALCACHE_NAMESPACED_PERSISTENCE`	If `true`, per-namespace narrative + pheromone JSON files are used	`false`
`NEURALCACHE_NARRATIVE_STORE_TEMPLATE`	Template for per-namespace narrative file	`narrative.{namespace}.json`
`NEURALCACHE_PHEROMONE_STORE_TEMPLATE`	Template for per-namespace pheromone file	`pheromones.{namespace}.json`

Invalid namespaces return a standardized error envelope:

{
  "error": {
    "code": "BAD_REQUEST",
    "message": "Invalid namespace",
    "detail": null
  }
}

Standardized error envelopes

All errors (including validation) resolve to a stable shape documented in docs/ERROR_ENVELOPES.md:

{
  "error": {
    "code": "VALIDATION_ERROR",
    "message": "Validation failed",
    "detail": [ { "loc": ["body","query"], "msg": "Field required" } ]
  }
}

Common codes: BAD_REQUEST, UNAUTHORIZED, NOT_FOUND, ENTITY_TOO_LARGE, VALIDATION_ERROR, RATE_LIMITED, INTERNAL_ERROR.

Privacy & data handling

A concise operator playbook for data classification, retention, and namespace isolation is available in PRIVACY.md. Before production, review both PRIVACY.md and SECURITY.md and set appropriate retention and auth settings.

Configuration essentials

Env var	Purpose	Default
`NEURALCACHE_WEIGHT_DENSE`	Weight on dense similarity	`1.0`
`NEURALCACHE_WEIGHT_NARRATIVE`	Weight on narrative memory	`0.6`
`NEURALCACHE_WEIGHT_PHEROMONE`	Weight on pheromone signal	`0.3`
`NEURALCACHE_MAX_DOCUMENTS`	Safety cap on rerank set size	`128`
`NEURALCACHE_MAX_TEXT_LENGTH`	Hard limit on document length (characters)	`8192`
`NEURALCACHE_STORAGE_DIR`	Where SQLite + JSON state is stored	`storage/`
`NEURALCACHE_STORAGE_PERSISTENCE_ENABLED`	Disable to keep narrative + pheromones in-memory only	`true`
`NEURALCACHE_STORAGE_RETENTION_DAYS`	Days before old state is purged on boot (supports SQLite + JSON)	unset
`NEURALCACHE_STORAGE_RETENTION_SWEEP_INTERVAL_S`	Interval (seconds) for background retention sweeper (0 disables)	`0`
`NEURALCACHE_STORAGE_RETENTION_SWEEP_ON_START`	Run a purge cycle synchronously at startup when true	`true`
`NEURALCACHE_GATING_MODE`	Cognitive gate mode (`off`, `auto`, `on`)	`auto`
`NEURALCACHE_GATING_THRESHOLD`	Uncertainty threshold for trimming	`0.45`
`NEURALCACHE_GATING_MIN_CANDIDATES`	Lower bound for rerank candidates	`8`
`NEURALCACHE_GATING_MAX_CANDIDATES`	Upper bound for rerank candidates	`48`
`NEURALCACHE_GATING_TEMPERATURE`	Softmax temperature when estimating entropy	`1.0`
`NEURALCACHE_DETERMINISTIC`	Force deterministic reranks (seed RNG, disable exploration)	`false`
`NEURALCACHE_DETERMINISTIC_SEED`	Seed used when deterministic mode is enabled	`1337`
`NEURALCACHE_EPSILON`	Override ε-greedy exploration rate (0-1). Ignored when deterministic.	unset
`NEURALCACHE_MMR_LAMBDA_DEFAULT`	Default MMR lambda when request omits/nulls `mmr_lambda`	`0.5`
`NEURALCACHE_NAMESPACE_HEADER`	Header key to read namespace	`X-NeuralCache-Namespace`
`NEURALCACHE_DEFAULT_NAMESPACE`	Fallback namespace when header missing	`default`
`NEURALCACHE_NAMESPACE_PATTERN`	Validation regex (400 on mismatch)	`^[a-zA-Z0-9_.-]{1,64}$`

Adjust everything via .env, environment variables, or direct Settings(...) instantiation. NEURALCACHE_EPSILON (when set) takes precedence over epsilon_greedy setting unless deterministic mode is active. NEURALCACHE_MMR_LAMBDA_DEFAULT supplies fallback diversity weighting when omitted.

Persistence happens automatically using SQLite (or JSON fallback) so narrative and pheromone stores survive restarts. Point NEURALCACHE_STORAGE_DIR at shared storage for multi-worker deployments, or import SQLiteState directly if you need to wire the persistence layer into an existing app container. Under the hood the SQLite state:

enables WAL mode with synchronous=NORMAL so multiple workers can read while a writer appends.
tracks a metadata row with the current schema version (SQLiteState.schema_version()), raising if a newer schema is encountered so upgrades can run explicit migrations before boot.
stores pheromone exposures and timestamps so retention/evaporation policies can prune long-lived records.

Evaluation: prove the uplift

We ship scripts/eval_context_use.py to measure Context-Use@K on any JSONL dataset (query, docs, answer). It can compare a baseline retriever with a NeuralCache-powered candidate. Install the neuralcache[ops] extra to pull in the requests dependency used by the script and Prometheus exporters in one go.

Want to stress-test gating specifically? Run scripts/eval_gating.py to generate a synthetic A/B comparison between the entropy-driven gate and a control configuration. The script logs summaries to stdout and writes a CSV artifact you can pull into spreadsheets or dashboards.

python scripts/eval_context_use.py \
  --api http://localhost:8080 \
  --data data/sample_rag.jsonl \
  --out reports/neuralcache_eval.csv \
  --top-k 5

# Optional: compare against another API host
python scripts/eval_context_use.py \
  --api http://localhost:8000 --data data/sample_rag.jsonl \
  --compare-api http://localhost:8080 --out reports/compare.csv

Example output (toy dataset):

Eval complete in 4.82s | Baseline Context-Use@5: 9/20 | NeuralCache: 13/20

Use the generated CSV to inspect which queries improved, regressions, and latency statistics.

Sample datasets

We ship a small, neutral illustrative dataset at data/sample_eval.jsonl (5 queries) covering:

Stigmergy concept recall
MMR rationale
ε-greedy exploration purpose
Pheromone decay motivation
Narrative memory function

Each line contains:

{"query": "...", "docs": [{"id": "d1", "text": "..."}, ...], "answer": "..."}

Run a smoke eval against a locally running API:

python scripts/eval_context_use.py \
  --api http://127.0.0.1:8080 \
  --data data/sample_eval.jsonl \
  --out reports/sample_eval.csv \
  --top-k 3

Inspect reports/sample_eval.csv for per-query hits. Extend by appending more JSONL lines that follow the same schema; avoid sensitive data—this file is published.

Project layout

neuralcache/
├─ assets/                # Logos, diagrams, and other static media
├─ examples/              # Quickstart notebooks and scripts
├─ scripts/               # Evaluation + operational tooling
├─ src/neuralcache/
│  ├─ api/                # FastAPI app exposing REST endpoints
│  ├─ adapters/           # LangChain + LlamaIndex integrations
│  ├─ metrics/            # Context-Use@K helpers & Prometheus hooks
│  ├─ gating.py           # Cognitive gating heuristics
│  ├─ narrative.py        # Narrative memory tracker
│  ├─ pheromone.py        # Pheromone store with decay/exposure logic
│  ├─ rerank.py           # Core reranking orchestrator
│  └─ config.py           # Pydantic Settings (env + .env aware)
├─ tests/                 # Pytest suite (unit + adapter sanity)
└─ .github/workflows/     # CI, lint, release, docker, code scanning

Metrics & observability

/metrics exposes Prometheus counters for request volume, success rate, and Context-Use@K proxy. Install the neuralcache[ops] extra (bundles prometheus-client) and run the Plus API for an out-of-the-box scrape target.
Structured logging (via rich + standard logging) shows rerank decisions with scores.
Extend telemetry by dropping in OpenTelemetry exporters or shipping events to your own observability stack.

Roadmap

✅ SQLite persistence (drop-in)
✅ Batch /rerank endpoint
✅ LangChain + LlamaIndex adapters
✅ Namespace eviction (LRU)
✅ Namespaced persistence (optional JSON templates)
✅ Metrics namespace labeling (opt-in)
☐ Semantic Context-Use@K metric
☐ Prometheus/OpenTelemetry exporters
☐ Optional Rust / Numba core for hot loops

Have ideas? Open an issue or grab a ticket.

Contributing & community

pip install -e .[dev,test]
pre-commit install
ruff check && mypy && pytest --cov=neuralcache --cov-report=term-missing

Look for good first issues.
Add test coverage for user-visible changes.
Coverage gate currently enforces >=89%. We'll continue to ratchet this upward as core adaptive components gain additional tests (latest uplift added namespace isolation, eviction, namespaced persistence, metrics namespace labeling, narrative purge stale, CR empty candidate fallback, encoder unknown-backend warning, rate limiting & API auth envelopes, batch gating debug, malformed envelopes, retention sweeper, pheromone purge, gating overrides, epsilon override, and narrative resize/skip branches).

Namespace eviction

Set NEURALCACHE_MAX_NAMESPACES to constrain memory growth in multi-tenant scenarios (edge cases where thousands of low-traffic tenants appear). When the cap is reached, the least recently used non-default namespace is evicted (policy lru). The default namespace is never evicted. Access updates recency automatically.

Metrics namespace labeling

Opt-in via NEURALCACHE_METRICS_NAMESPACE_LABEL=true to export parallel Prometheus metrics with a namespace label. Useful for per-tenant latency SLOs and request volume dashboards. When disabled, metrics remain cardinality-safe for large tenant counts.

Namespaced persistence

Enable NEURALCACHE_NAMESPACED_PERSISTENCE=true to write per-namespace narrative + pheromone JSON stores using the templates:

NEURALCACHE_NARRATIVE_STORE_TEMPLATE=narrative.{namespace}.json
NEURALCACHE_PHEROMONE_STORE_TEMPLATE=pheromones.{namespace}.json

This allows selective archival or scrubbing of a single tenant’s adaptive state. SQLite mode continues to provide shared durable state; the namespaced JSON layer is most useful when running the lightweight default (non-SQLite) persistence path or when you want filesystem-level isolation.

PRs with docs, demos, and eval improvements are extra appreciated.

Optionally, join the discussion in #neuralcache on Discord (coming soon—watch this space).

Upgrading

0.3.2

Release 0.3.2 introduces multi-tenant operational features. All changes are backward compatible; existing deployments that do nothing will behave exactly as before.

Key additions:

Namespace cap & eviction: set NEURALCACHE_MAX_NAMESPACES (with policy NEURALCACHE_NAMESPACE_EVICTION_POLICY=lru) to bound memory; default is unlimited.
Namespaced persistence: opt-in with NEURALCACHE_NAMESPACED_PERSISTENCE=true to emit per-namespace JSON state files (templates overrideable with NEURALCACHE_NARRATIVE_STORE_TEMPLATE / NEURALCACHE_PHEROMONE_STORE_TEMPLATE).
Metrics namespace labeling: enable NEURALCACHE_METRICS_NAMESPACE_LABEL=true to expose parallel Prometheus metric families with a namespace label. Leave false to avoid high-cardinality metrics.
Version constant bumped to 0.3.2 (neuralcache.__version__).

No breaking schema migrations were required. SQLite schema version unchanged. If you previously relied on the absence of eviction, simply leave NEURALCACHE_MAX_NAMESPACES unset (or remove it) and behavior matches 0.3.1.

Upgrading checklist

Bump dependency: pip install --upgrade neuralcache.
(Optional) Export per-tenant metrics: set NEURALCACHE_METRICS_NAMESPACE_LABEL=true (assess Prometheus cardinality first).
(Optional) Constrain namespace memory: set NEURALCACHE_MAX_NAMESPACES=<cap>.
(Optional) Enable namespaced JSON persistence: NEURALCACHE_NAMESPACED_PERSISTENCE=true (ensure filesystem ACLs align with privacy expectations).
Restart your API workers; confirm /metrics and rerank endpoints behave as expected.

Future versions will continue to maintain stability for existing Settings fields; newly added fields default to safe inactive behavior unless explicitly enabled.

License

Apache-2.0. The NeuralCache reranker is open source; the broader Cognitive Tetrad engine remains proprietary.

Automation details

Need to replicate our CI? Expand the sections below for workflow templates.

.github/workflows/ci.yml — lint, type-check, test

name: CI

on:
  pull_request:
  push:
    branches: [ main ]

jobs:
  ci:
    runs-on: ubuntu-latest
    strategy:
      matrix:
        python-version: ["3.11", "3.12"]
    steps:
      - uses: actions/checkout@v4
      - uses: actions/setup-python@v5
        with:
          python-version: ${{ matrix.python-version }}
      - uses: actions/cache@v4
        with:
          path: ~/.cache/pip
          key: pip-${{ runner.os }}-${{ matrix.python-version }}-${{ hashFiles('pyproject.toml') }}
          restore-keys: pip-${{ runner.os }}-${{ matrix.python-version }}-
      - name: Install
        run: |
          python -m pip install --upgrade pip
          pip install -e .[dev,test]
      - name: Ruff (lint + format check)
        run: ruff check .
      - name: Type-check (mypy)
        run: mypy src
      - name: Pytest
        run: pytest -q --maxfail=1 --disable-warnings --cov=neuralcache --cov-report=xml
      - name: Upload coverage artifact
        uses: actions/upload-artifact@v4
        with:
          name: coverage-xml
          path: coverage.xml

.github/workflows/lint.yml — pre-commit

name: Lint

on:
  pull_request:
  push:
    branches: [ main ]

jobs:
  precommit:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - uses: actions/setup-python@v5
        with:
          python-version: "3.11"
      - name: Install
        run: |
          python -m pip install --upgrade pip
          pip install -e .[dev]
      - name: Run pre-commit
        run: pre-commit run --all-files

.github/workflows/tests.yml — scheduled coverage

name: Tests

on:
  workflow_dispatch:
  schedule:
    - cron: "0 7 * * *"  # daily @ 07:00 UTC

jobs:
  tests:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - uses: actions/setup-python@v5
        with:
          python-version: "3.11"
      - name: Install
        run: |
          python -m pip install --upgrade pip
          pip install -e .[test]
      - name: Pytest
        run: pytest -q --maxfail=1 --disable-warnings --cov=neuralcache --cov-report=xml

.github/workflows/release.yml — PyPI publish

name: Release

on:
  push:
    tags:
      - "v*.*.*"

jobs:
  pypi:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - uses: actions/setup-python@v5
        with:
          python-version: "3.11"
      - name: Build sdist & wheel
        run: |
          python -m pip install --upgrade pip build
          python -m build
      - name: Publish to PyPI
        uses: pypa/gh-action-pypi-publish@release/v1
        with:
          password: ${{ secrets.PYPI_API_TOKEN }}

.github/workflows/docker.yml — GHCR images

name: Docker

on:
  push:
    branches: [ main ]
    tags:
      - "v*.*.*"

jobs:
  docker:
    runs-on: ubuntu-latest
    permissions:
      contents: read
      packages: write
    steps:
      - uses: actions/checkout@v4
      - name: Login to GHCR
        uses: docker/login-action@v3
        with:
          registry: ghcr.io
          username: ${{ github.actor }}
          password: ${{ secrets.GITHUB_TOKEN }}
      - name: Extract version
        id: meta
        run: |
          REF="${GITHUB_REF##*/}"
          if [[ "$GITHUB_REF" == refs/tags/* ]]; then
            echo "tag=$REF" >> $GITHUB_OUTPUT
          else
            echo "tag=latest" >> $GITHUB_OUTPUT
          fi
      - name: Build & push
        uses: docker/build-push-action@v6
        with:
          context: .
          push: true
          tags: |
            ghcr.io/${{ github.repository_owner }}/neuralcache:${{ steps.meta.outputs.tag }}
            ghcr.io/${{ github.repository_owner }}/neuralcache:latest

.github/dependabot.yml

version: 2
updates:
  - package-ecosystem: "pip"
    directory: "/"
    schedule:
      interval: "weekly"
    open-pull-requests-limit: 5
  - package-ecosystem: "github-actions"
    directory: "/"
    schedule:
      interval: "weekly"

Support the project

If NeuralCache saves you time, consider starring the repo or sharing a demo with the community. Contributions, bug reports, and evaluation results are the best way to help the project grow.

Debug envelope fields

Each /rerank response may include a debug object (structure stable across patch releases). For standardized error envelope format see docs/ERROR_ENVELOPES.md.

Field	Description
`gating`	Cognitive gating decision telemetry (mode, uncertainty, counts)
`deterministic`	True when deterministic mode is active (exploration disabled)
`epsilon_used`	Effective epsilon after env override & deterministic suppression
`mmr_lambda_used`	Final MMR lambda applied (request value clamped or default)

Use this for audit logs or offline evaluation dashboards. Avoid parsing internal sub-keys of gating beyond those documented—future versions may extend it.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

NeuralCache 🧠⚡

⚡ 60-second quickstart

Need batch reranking or Prometheus metrics?

Why teams choose NeuralCache

Use cases

How it works

Cognitive gating

Multi-tenancy & namespaces

Standardized error envelopes

Privacy & data handling

Configuration essentials

Evaluation: prove the uplift

Sample datasets

Project layout

Metrics & observability

Roadmap

Contributing & community

Namespace eviction

Metrics namespace labeling

Namespaced persistence

Upgrading

0.3.2

Upgrading checklist

License

Automation details

Support the project

Debug envelope fields

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
.github		.github
assets		assets
data		data
docs		docs
documents		documents
examples		examples
scripts		scripts
src/neuralcache		src/neuralcache
tests		tests
.editorconfig		.editorconfig
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CHANGELOG.md		CHANGELOG.md
DOCS_INDEX.md		DOCS_INDEX.md
Dockerfile		Dockerfile
LICENSE		LICENSE
MAINTAINERS.md		MAINTAINERS.md
MULTITENANCY.md		MULTITENANCY.md
Makefile		Makefile
PRIVACY.md		PRIVACY.md
README.md		README.md
README_NEXT.md		README_NEXT.md
SECURITY.md		SECURITY.md
neuralcache_scaffold.code-workspace		neuralcache_scaffold.code-workspace
pyproject.toml		pyproject.toml

License

Maverick0351a/neuralcache

Folders and files

Latest commit

History

Repository files navigation

NeuralCache 🧠⚡

⚡ 60-second quickstart

Need batch reranking or Prometheus metrics?

Why teams choose NeuralCache

Use cases

How it works

Cognitive gating

Multi-tenancy & namespaces

Standardized error envelopes

Privacy & data handling

Configuration essentials

Evaluation: prove the uplift

Sample datasets

Project layout

Metrics & observability

Roadmap

Contributing & community

Namespace eviction

Metrics namespace labeling

Namespaced persistence

Upgrading

0.3.2

Upgrading checklist

License

Automation details

Support the project

Debug envelope fields

About

Topics

Resources

License

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Languages

Packages