CUDA header declarations for Layer Normalization (LayerNorm) forward and backward passes by Eamon2009 · Pull Request #66 · Eamon2009/Quadtrix.cpp

Eamon2009 · 2026-06-01T17:19:25Z

Summary

Introduces the CUDA header declarations for Layer Normalization (LayerNorm) forward and backward passes within the quadtrix::cuda namespace. This defines the interfaces required for managing feature normalization and its corresponding gradient tracking during backpropagation.

Key Additions

layernorm_forward: Normalizes the incoming input tensor using learnable gamma (scale) and beta (shift) weights. It writes the result to output while preserving intermediate mean and rstd (reciprocal standard deviation) caches for exact gradient tracking.
layernorm_backward: Computes the backward pass gradients for the inputs (grad_input) as well as the parameter weights (grad_gamma and grad_beta) based on the incoming grad_output.

* docs: report [run_20260530_165216] (~791 tok/s) Includes metrics for generalization gap, throughput (~791 tok/s), and gradient norms. Parameters: 6.68M | lr: 1e-3 | batch: 16 | steps: 6000 - Achieved best validation loss of 4.1319 at step 3900 * docs:report [run_20260530_165216](~791 tok/s) (#61) Includes metrics for generalization gap, throughput (~791 tok/s), and gradient norms. Parameters: 6.68M | lr: 1e-3 | batch: 16 | steps: 6000 - Achieved best validation loss of 4.1319 at step 3900 Co-authored-by: Max <eamon5174@gmail.com> * feat(cuda): add attention forward and backward kernel declarations Introduces the header declarations for `attention_forward` and `attention_backward` operations inside the `quadtrix::cuda` namespace. Configured with support for custom CUDA streams and head partitioning. --------- Co-authored-by: Max <eamon5174@gmail.com>

- Defines `DType` and `DeviceKind` enums supporting standard types (F32, F16, BF16, I32, U8). - Implements `dtype_name` and `dtype_size` metadata helper functions. - Adds an explicit `Status` struct for non-throwing error propagation alongside `checked_mul` for safe allocation size computation. - Introduces `check_cuda` and `abort_on_cuda` error macros and handling mechanisms, exposed via the `QUADTRIX_CUDA_CHECK` macro.

- Introduces the `GeluMode` enum to toggle between `Exact` and `Approximate` mathematical variants. - Declares the `gelu_forward` and `gelu_backward` kernel entrypoints. - Configures both signatures with optional stream execution and a default mode of `GeluMode::Approximate`.

Eamon2009 · 2026-06-01T17:21:02Z

/run-checks

github-actions · 2026-06-01T17:22:17Z

✅ All checks passed!

codeenthusiasm23 · 2026-06-01T17:25:57Z

/run-checks

github-actions · 2026-06-01T17:26:09Z

@codeenthusiasm23 Only maintainers can trigger checks.

github-actions · 2026-06-01T17:26:17Z

❌ Some checks failed — see Actions for details.

Eamon2009 and others added 6 commits June 1, 2026 01:00

feat(cuda): add checkpoint metadata struct and stub functions

4aac832

feat(cuda): add TokenBatchView struct and DataLoader stub class

7c94958

feat(cuda): add gradient norm calculation and clipping interfaces

28117dc

Eamon2009 requested a review from codeaddict-119 June 1, 2026 17:19

Eamon2009 assigned Eamon2009 and codeaddict-119 Jun 1, 2026

Eamon2009 added the cuda label Jun 1, 2026

codeenthusiasm23 approved these changes Jun 1, 2026

View reviewed changes

codeaddict-119 requested a review from codeenthusiasm23 June 1, 2026 17:23

codeaddict-119 approved these changes Jun 1, 2026

View reviewed changes

codeenthusiasm23 approved these changes Jun 1, 2026

View reviewed changes

Eamon2009 merged commit aef3e1e into codeaddict-master Jun 1, 2026
26 of 27 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CUDA header declarations for Layer Normalization (LayerNorm) forward and backward passes#66

CUDA header declarations for Layer Normalization (LayerNorm) forward and backward passes#66
Eamon2009 merged 6 commits into
codeaddict-masterfrom
master

Eamon2009 commented Jun 1, 2026

Uh oh!

Eamon2009 commented Jun 1, 2026

Uh oh!

github-actions Bot commented Jun 1, 2026

Uh oh!

codeenthusiasm23 commented Jun 1, 2026

Uh oh!

github-actions Bot commented Jun 1, 2026

Uh oh!

github-actions Bot commented Jun 1, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Eamon2009 commented Jun 1, 2026

Summary

Uh oh!

Eamon2009 commented Jun 1, 2026

Uh oh!

github-actions Bot commented Jun 1, 2026

Uh oh!

codeenthusiasm23 commented Jun 1, 2026

Uh oh!

github-actions Bot commented Jun 1, 2026

Uh oh!

github-actions Bot commented Jun 1, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants