bring back torch.autograd.Function for float8 matmul #341

vkuzo · 2024-07-26T15:46:51Z

Stack from ghstack (oldest at bottom):

Summary:

This is a redo of
#316

With upcoming support of scaling granularities other than tensorwise,
we need a good way to control which gemm kernel to call and how to scale
the input tensors in fwd and bwd. A torch.autograd.Function override
is the cleanest way to do that, and in 2024 this now works with
torch.compile.

Test Plan:

./test/test_everything.sh

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: D60291396

Summary: This is a redo of #316 With upcoming support of scaling granularities other than tensorwise, we need a good way to control which gemm kernel to call and how to scale the input tensors in fwd and bwd. A `torch.autograd.Function` override is the cleanest way to do that, and in 2024 this now works with `torch.compile`. Test Plan: ``` ./test/test_everything.sh ``` Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

vkuzo · 2024-07-26T15:47:39Z

@vkuzo has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

vkuzo · 2024-07-26T15:50:45Z

started a new PR due to ghstack error

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jul 26, 2024

This was referenced Jul 26, 2024

[1/x] clean up casting functions #342

Closed

[2/x] clean up casting functions: delayed scaling #343

Closed

vkuzo closed this Jul 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bring back torch.autograd.Function for float8 matmul #341

bring back torch.autograd.Function for float8 matmul #341

vkuzo commented Jul 26, 2024 •

edited

Loading

vkuzo commented Jul 26, 2024

vkuzo commented Jul 26, 2024

bring back torch.autograd.Function for float8 matmul #341

bring back torch.autograd.Function for float8 matmul #341

Conversation

vkuzo commented Jul 26, 2024 • edited Loading

vkuzo commented Jul 26, 2024

vkuzo commented Jul 26, 2024

vkuzo commented Jul 26, 2024 •

edited

Loading