Refactor quantizer: Only replace with per-tensor variants #14974

DrJessop · 2025-10-09T23:59:26Z

Summary:
In our previous flow, we would replace ops with default variants, have a special fusion pass which constructs singleton tensors for a variety of fused quantized ops, and then we would call a replace ops to turn them into per-tensor-variants.

I confirmed this was for legacy reasons, so a cleanup was much due.

This diff also fixes any ref implementations during the refactor.

Reviewed By: zonglinpeng

Differential Revision: D83873738

pytorch-bot · 2025-10-09T23:59:29Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/14974

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 7 New Failures, 3 Pending, 2 Unrelated Failures

As of commit f84f831 with merge base 3591604 ():

NEW FAILURES - The following jobs have failed:

pull / test-qnn-wheel-packages-linux (3.10) / linux-job (gh)
RuntimeError: Command docker exec -t 297301d1134feab0072b736c043d0f312df73d9b10c19c863255b94c306e6591 /exec failed with exit code 1
pull / test-qnn-wheel-packages-linux (3.11) / linux-job (gh)
RuntimeError: Command docker exec -t 1143182cdeee6c4f973722ec91196a98ac38a534e9836db7368ed8e3647c9bb3 /exec failed with exit code 1
pull / test-qnn-wheel-packages-linux (3.12) / linux-job (gh)
RuntimeError: Command docker exec -t c268dab6438c90b6bcc7a69e1630184e34466ca5ecd82a1816d0f88b995f340c /exec failed with exit code 1
pull / test-samsung-models-linux / linux-job (gh)
RuntimeError: Command docker exec -t 629b8a0d8e0933879bc8a281ee14cf4e4cf3a13cc34d6eba84d666683673e9dc /exec failed with exit code 1
pull / unittest / macos / macos-job (gh)
export/tests/test_target_recipes.py::TestTargetRecipes::test_linear_model
pull / unittest-editable / macos / macos-job (gh)
export/tests/test_target_recipes.py::TestTargetRecipes::test_mv3_model
Test CUDA Builds / test-voxtral-cuda-e2e / linux-job (gh)
RuntimeError: Command docker exec -t ca04ed5fa003d36bb31d101b0215b5573373e4c61a3c56bd63bb29810b850b5e /exec failed with exit code 2

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / unittest / windows / windows-job (gh) (trunk failure)
Process completed with exit code 1.
pull / unittest-editable / windows / windows-job (gh) (trunk failure)
Process completed with exit code 1.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

meta-codesync · 2025-10-09T23:59:34Z

@DrJessop has exported this pull request. If you are a Meta employee, you can view the originating Diff in D83873738.

github-actions · 2025-10-10T00:00:13Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

) Summary: In our previous flow, we would replace ops with default variants, have a special fusion pass which constructs singleton tensors for a variety of fused quantized ops, and then we would call a replace ops to turn them into per-tensor-variants. I confirmed this was for legacy reasons, so a cleanup was much due. This diff also fixes any ref implementations during the refactor. Reviewed By: zonglinpeng Differential Revision: D83873738

Summary: Matmul was relying on linear infra which didn't support batched second argument. This adds support. Differential Revision: D84279595

) Summary: In our previous flow, we would replace ops with default variants, have a special fusion pass which constructs singleton tensors for a variety of fused quantized ops, and then we would call a replace ops to turn them into per-tensor-variants. I confirmed this was for legacy reasons, so a cleanup was much due. This diff also fixes any ref implementations during the refactor. Reviewed By: zonglinpeng Differential Revision: D83873738

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 9, 2025

meta-codesync bot added fb-exported meta-exported labels Oct 9, 2025

DrJessop force-pushed the export-D83873738 branch from c434d1e to 3e65bea Compare October 10, 2025 00:08

DrJessop force-pushed the export-D83873738 branch from 3e65bea to 33a47e1 Compare October 10, 2025 16:40

DrJessop force-pushed the export-D83873738 branch from 33a47e1 to 43e31be Compare October 10, 2025 16:44

Andrew Grebenisan added 2 commits October 10, 2025 09:58

Support for batched matmul (pytorch#14956)

98e8b2d

Summary: Matmul was relying on linear infra which didn't support batched second argument. This adds support. Differential Revision: D84279595

DrJessop force-pushed the export-D83873738 branch from 43e31be to f84f831 Compare October 10, 2025 17:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Refactor quantizer: Only replace with per-tensor variants #14974

Refactor quantizer: Only replace with per-tensor variants #14974

DrJessop commented Oct 9, 2025

Uh oh!

pytorch-bot bot commented Oct 9, 2025 •

edited

Loading

Uh oh!

meta-codesync bot commented Oct 9, 2025

Uh oh!

github-actions bot commented Oct 10, 2025

Uh oh!

Uh oh!

Refactor quantizer: Only replace with per-tensor variants #14974

Are you sure you want to change the base?

Refactor quantizer: Only replace with per-tensor variants #14974

Conversation

DrJessop commented Oct 9, 2025

Uh oh!

pytorch-bot bot commented Oct 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/14974

❌ 7 New Failures, 3 Pending, 2 Unrelated Failures

Uh oh!

meta-codesync bot commented Oct 9, 2025

Uh oh!

github-actions bot commented Oct 10, 2025

This PR needs a release notes: label

Uh oh!

Uh oh!

pytorch-bot bot commented Oct 9, 2025 •

edited

Loading

This PR needs a `release notes:` label