MX: move block_size and elem_dtype into MXLinearConfig #1689

vkuzo · 2025-02-10T19:36:30Z

Summary:

Moves block_size and elem_dtype into MXLinearConfig and updates all callsites.

Before

elem_dtype = torch.float8_e4m3fn
swap_linear_with_mx_linear(m, elem_dtype, block_size=32)

After

config = MXLinearConfig(elem_dtype=torch.float8_e4m3fn, block_size=32)
swap_linear_with_mx_linear(m, config=config)

Test Plan: CI

Reviewers:

Subscribers:

Tasks:

Tags:

[ghstack-poisoned]

vkuzo · 2025-02-10T19:36:31Z

Stack from ghstack (oldest at bottom):

pytorch-bot · 2025-02-10T19:36:35Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/1689

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 92913315b65e8b19a55bab3a51c54ed56c1bb4f4 ghstack-comment-id: 2649054194 Pull Request resolved: #1689

drisspg · 2025-02-10T21:38:08Z

torchao/prototype/mx_formats/config.py

+    # TODO(future PR): refactor to make this cleaner
+    elem_dtype_weight_override: Optional[Any] = None
+    elem_dtype_grad_output_override: Optional[Any] = None
+
    # If True, uses a custom triton kernel for fp4 dequantize
    use_fp4_custom_triton_dequant_kernel: bool = False


you think that we will want to keep this public?

unlikely, but IMO we can punt that until later

[ghstack-poisoned]

vkuzo added 8 commits January 29, 2025 20:32

Update

c834520

[ghstack-poisoned]

Update

85da297

[ghstack-poisoned]

Update

0d3d6f9

[ghstack-poisoned]

Update

82b543d

[ghstack-poisoned]

Update

0220b19

[ghstack-poisoned]

Update

e4b5ded

[ghstack-poisoned]

Update

7c1166e

[ghstack-poisoned]

Update

8819b28

[ghstack-poisoned]

vkuzo mentioned this pull request Feb 10, 2025

mx: add ceil and RNE rounding modes to the cast from fp32 to e8m0 #1620

Merged

vkuzo mentioned this pull request Feb 10, 2025

mx formats: create MXLinearConfig #1688

Merged

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Feb 10, 2025

vkuzo added the topic: improvement Use this tag if this PR is an improvement (doesn't fit into any of the other categories) label Feb 10, 2025

vkuzo requested review from drisspg and danielvegamyhre February 10, 2025 19:52

drisspg reviewed Feb 10, 2025

View reviewed changes

drisspg approved these changes Feb 10, 2025

View reviewed changes

vkuzo added 2 commits February 13, 2025 16:02

Update

1064e83

[ghstack-poisoned]

Update

e596007

[ghstack-poisoned]

vkuzo mentioned this pull request Feb 14, 2025

MX: hook up mxfp8 and mxfp4 CUTLASS kernels to MXLinear #1713

Merged

Update

d49b604

[ghstack-poisoned]

vkuzo changed the base branch from gh/vkuzo/25/head to main February 14, 2025 23:46

vkuzo merged commit 40d01cd into main Feb 14, 2025
32 of 40 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MX: move block_size and elem_dtype into MXLinearConfig #1689

MX: move block_size and elem_dtype into MXLinearConfig #1689

vkuzo commented Feb 10, 2025 •

edited

Loading

vkuzo commented Feb 10, 2025 •

edited

Loading

pytorch-bot bot commented Feb 10, 2025 •

edited

Loading

drisspg Feb 10, 2025

vkuzo Feb 10, 2025

MX: move block_size and elem_dtype into MXLinearConfig #1689

MX: move block_size and elem_dtype into MXLinearConfig #1689

Conversation

vkuzo commented Feb 10, 2025 • edited Loading

vkuzo commented Feb 10, 2025 • edited Loading

pytorch-bot bot commented Feb 10, 2025 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/1689

drisspg Feb 10, 2025

Choose a reason for hiding this comment

vkuzo Feb 10, 2025

Choose a reason for hiding this comment

vkuzo commented Feb 10, 2025 •

edited

Loading

vkuzo commented Feb 10, 2025 •

edited

Loading

pytorch-bot bot commented Feb 10, 2025 •

edited

Loading