MX: hook up mxfp8 and mxfp4 CUTLASS kernels to MXLinear #1713

vkuzo · 2025-02-14T00:02:15Z

Summary:

add a kernel choice setting to MXLinearConfig to choose between
emulated gemm and CUTLASS gemm
respect the setting in the torch.mm op override
numerical tests to match emulated vs real e2e
activations/weights/grads

Test Plan:

pytest test/prototype/mx_formats/ -s -x

Reviewers:

Subscribers:

Tasks:

Tags:

[ghstack-poisoned]

vkuzo · 2025-02-14T00:02:16Z

Stack from ghstack (oldest at bottom):

-> MX: hook up mxfp8 and mxfp4 CUTLASS kernels to MXLinear #1713

pytorch-bot · 2025-02-14T00:02:18Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/1713

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Summary: 1. add a kernel choice setting to `MXLinearConfig` to choose between emulated gemm and CUTLASS gemm 2. respect the setting in the torch.mm op override 3. numerical tests to match emulated vs real e2e activations/weights/grads Test Plan: ``` pytest test/prototype/mx_formats/ -s -x ``` Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 1d12e235c90ad3f46335654d62d3376efc035d84 ghstack-comment-id: 2657958104 Pull Request resolved: #1713

[ghstack-poisoned]

Summary: 1. add a kernel choice setting to `MXLinearConfig` to choose between emulated gemm and CUTLASS gemm 2. respect the setting in the torch.mm op override 3. numerical tests to match emulated vs real e2e activations/weights/grads Test Plan: ``` pytest test/prototype/mx_formats/ -s -x ``` Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 9234dba200094da88c2d13bc20e198aa7ffb5af6 ghstack-comment-id: 2657958104 Pull Request resolved: #1713

torchao/prototype/mx_formats/README.md

[ghstack-poisoned]

Summary: 1. add a kernel choice setting to `MXLinearConfig` to choose between emulated gemm and CUTLASS gemm 2. respect the setting in the torch.mm op override 3. numerical tests to match emulated vs real e2e activations/weights/grads Test Plan: ``` pytest test/prototype/mx_formats/ -s -x ``` Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 55f72a0e0d18898c2f02fb2d88c537b382ed5a67 ghstack-comment-id: 2657958104 Pull Request resolved: #1713

[ghstack-poisoned]

Summary: 1. add a kernel choice setting to `MXLinearConfig` to choose between emulated gemm and CUTLASS gemm 2. respect the setting in the torch.mm op override 3. numerical tests to match emulated vs real e2e activations/weights/grads Test Plan: ``` pytest test/prototype/mx_formats/ -s -x ``` Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 55f72a0e0d18898c2f02fb2d88c537b382ed5a67 ghstack-comment-id: 2657958104 Pull Request resolved: #1713

vkuzo added 11 commits January 29, 2025 20:32

Update

c834520

[ghstack-poisoned]

Update

85da297

[ghstack-poisoned]

Update

0d3d6f9

[ghstack-poisoned]

Update

82b543d

[ghstack-poisoned]

Update

0220b19

[ghstack-poisoned]

Update

e4b5ded

[ghstack-poisoned]

Update

7c1166e

[ghstack-poisoned]

Update

8819b28

[ghstack-poisoned]

Update

1064e83

[ghstack-poisoned]

Update

2439930

[ghstack-poisoned]

Update

e596007

[ghstack-poisoned]

vkuzo mentioned this pull request Feb 14, 2025

mx formats: create MXLinearConfig #1688

Merged

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Feb 14, 2025

vkuzo mentioned this pull request Feb 14, 2025

MX: move block_size and elem_dtype into MXLinearConfig #1689

Merged

vkuzo added the topic: improvement Use this tag if this PR is an improvement (doesn't fit into any of the other categories) label Feb 14, 2025

Update

12add12

[ghstack-poisoned]

vkuzo requested review from drisspg and danielvegamyhre February 14, 2025 17:15

drisspg approved these changes Feb 14, 2025

View reviewed changes

drisspg reviewed Feb 14, 2025

View reviewed changes

torchao/prototype/mx_formats/README.md Show resolved Hide resolved

vkuzo added 2 commits February 14, 2025 15:46

Update

d49b604

[ghstack-poisoned]

Update

fc78cf8

[ghstack-poisoned]

Update

ef02134

[ghstack-poisoned]

vkuzo changed the base branch from gh/vkuzo/26/head to main February 14, 2025 23:47

vkuzo merged commit 8fc49fe into main Feb 14, 2025
16 of 26 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MX: hook up mxfp8 and mxfp4 CUTLASS kernels to MXLinear #1713

MX: hook up mxfp8 and mxfp4 CUTLASS kernels to MXLinear #1713

vkuzo commented Feb 14, 2025

vkuzo commented Feb 14, 2025 •

edited

Loading

pytorch-bot bot commented Feb 14, 2025 •

edited

Loading

MX: hook up mxfp8 and mxfp4 CUTLASS kernels to MXLinear #1713

MX: hook up mxfp8 and mxfp4 CUTLASS kernels to MXLinear #1713

Conversation

vkuzo commented Feb 14, 2025

vkuzo commented Feb 14, 2025 • edited Loading

pytorch-bot bot commented Feb 14, 2025 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/1713

vkuzo commented Feb 14, 2025 •

edited

Loading

pytorch-bot bot commented Feb 14, 2025 •

edited

Loading