Skip to content

docs(fundamentals): add Architecture and Database pages#21508

Open
bloxster wants to merge 2 commits into
release/3.4from
docs/arch-db-r34-2026-05-29
Open

docs(fundamentals): add Architecture and Database pages#21508
bloxster wants to merge 2 commits into
release/3.4from
docs/arch-db-r34-2026-05-29

Conversation

@bloxster
Copy link
Copy Markdown
Collaborator

Summary

  • Adds two new Fundamentals pages: Architecture (staged sync, modular processes, flat-DB on MDBX, immutable snapshots, embedded CL) and Database (MDBX engine, datadir layout, snapshot files, real mainnet sizing numbers).
  • Includes a Mermaid "At-a-glance" diagram on the Architecture page.
  • Adds matching entries to llms.txt / llms-full.txt (root + docs/site/static).

This is the release/3.4 counterpart of #21500 (same change against main). Originally bundled with #21451 / #21494 — split out into a standalone PR for clarity.

Test plan

  • docs-site / build check passes
  • Verify /fundamentals/architecture and /fundamentals/database render with the Mermaid diagram on the deployed Docusaurus preview

@bloxster bloxster marked this pull request as draft May 29, 2026 10:56
Bloxster and others added 2 commits May 29, 2026 15:20
Closes two of the largest content gaps identified in the
[Improving-public-docs](https://github.com/erigontech/erigon-documents/blob/master/public-docs/Improving-public-docs.md)
spec: the high-level Architecture overview and the deep-dive
Database / datadir page.

New pages:
- `/fundamentals/architecture` (sidebar_position 3)
  - At-a-glance ASCII component diagram
  - Staged Sync pipeline + Erigon 3 consolidation notes
  - Modular processes (Sentry/Downloader/Execution/RPC Daemon/TxPool/Caplin)
  - Storage model split (chaindata vs snapshots)
  - Embedded Caplin consensus layer
  - Flat KV state model + RPC-latency rationale
  - Prune-modes vs sync-modes clarification

- `/fundamentals/database` (sidebar_position 15)
  - Datadir directory tree with role per folder
  - MDBX engine properties (no compaction, mmap reads, single-writer)
  - Immutable .seg snapshots + BitTorrent distribution
  - Per-transaction history granularity
  - Real Nov-2024 mainnet + bor-mainnet sizing numbers
  - Why chaindata/ stays small (and is recoverable from snapshots)
  - Tuning flags (--batchSize, --db.size.limit, --db.read.concurrency)
  - Safe-to-delete subdirectories table

Cross-links: both pages link to each other plus existing pages
(Modules, Caplin, Optimizing Storage, Hardware Requirements, Sync Modes).
No content duplication with Optimizing Storage — that page covers
multi-disk tiering recipes, these pages cover concepts.

Sources: erigontech/erigon README "Datadir structure", "Erigon3 datadir
size", "Erigon3 changes from Erigon2", "Modularity", "More Efficient
State Storage" sections.

llms.txt and llms-full.txt (both static/ and root copies) regenerated.

Build: `pnpm run build` passes with onBrokenLinks/onBrokenAnchors: 'throw'.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
…nnet sizing

- Architecture page: ASCII component diagram → Mermaid flowchart
- Database page: remove Polygon (bor-mainnet) disk-size block; we no
  longer maintain that chain
- Add @docusaurus/theme-mermaid@^3.10.0 and enable mermaid in config
- Regenerate llms-full.txt

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
@bloxster bloxster marked this pull request as ready for review May 29, 2026 13:22
@bloxster bloxster force-pushed the docs/arch-db-r34-2026-05-29 branch from b4a0472 to de8e596 Compare May 29, 2026 13:22
@bloxster bloxster added the docs label May 29, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant