SAC: fix recompute tag propagation for ops with list[tensor] inputs #152195

bdhirsh · 2025-04-25T16:07:09Z

There's an "are we compiling" check in SAC, which we rely on to know when to propagate recompute tags during tracing.

This check was a bit brittle, and missed cases where input ops accept list of tensors - I updated it to check if a FunctionalTensorMode is active, which should be a 100% reliable way to know if AOTDispatcher is in the middle of running.

There is a long-standing followup here around unifying torch.compiler.is_compiling() to work in all cases. We should probably just update it to always check if FakeMode/FunctionalMode are active and use it there. This has a bit of BC risk though so I opted for the more local fix to SAC.

Stack from ghstack (oldest at bottom):

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @chenyang78 @kadeng @chauhang @amjames

[ghstack-poisoned]

pytorch-bot · 2025-04-25T16:07:13Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/152195

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

CI workflows being skipped on PR

✅ You can merge normally! (1 Unrelated Failure)

As of commit 4c09af6 with merge base a4a7716 ():

FLAKY - The following job failed but was likely due to flakiness present on trunk:

trunk / macos-py3-arm64-mps / test (mps, 1, 1, macos-m1-13) (gh) (matched macos rule in flaky-rules.json)
File doesn't exist

This comment was automatically generated by Dr. CI and updates every 15 minutes.

…r] inputs" There's an "are we compiling" check in SAC, which we rely on to know when to propagate recompute tags during tracing. This check was a bit brittle, and missed cases where input ops accept list of tensors - I updated it to check if a `FunctionalTensorMode` is active, which should be a 100% reliable way to know if AOTDispatcher is in the middle of running. There is a long-standing followup here around unifying `torch.compiler.is_compiling()` to work in all cases. We should probably just update it to always check if FakeMode/FunctionalMode are active and use it there. This has a bit of BC risk though so I opted for the more local fix to SAC. cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx chenyang78 kadeng chauhang amjames [ghstack-poisoned]

bdhirsh · 2025-04-29T23:09:38Z

@pytorchbot merge

pytorchmergebot · 2025-04-29T23:12:13Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2025-04-29T23:33:43Z

Merge failed

Reason: 1 jobs have failed, first few of them are: trunk / macos-py3-arm64 / test (default, 2, 3, macos-m1-stable)

Details for Dev Infra team

Raised by workflow job

…r] inputs" There's an "are we compiling" check in SAC, which we rely on to know when to propagate recompute tags during tracing. This check was a bit brittle, and missed cases where input ops accept list of tensors - I updated it to check if a `FunctionalTensorMode` is active, which should be a 100% reliable way to know if AOTDispatcher is in the middle of running. There is a long-standing followup here around unifying `torch.compiler.is_compiling()` to work in all cases. We should probably just update it to always check if FakeMode/FunctionalMode are active and use it there. This has a bit of BC risk though so I opted for the more local fix to SAC. cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx chenyang78 kadeng chauhang amjames [ghstack-poisoned]

pytorchmergebot · 2025-05-05T14:42:45Z

Starting merge as part of PR stack under #152688

…2688) We never added a proper test for the fix from #134661 Pull Request resolved: #152688 Approved by: https://github.com/kwen2501 ghstack dependencies: #152195

SAC: fix recompute tag propagation for ops with list[tensor] inputs

cef1537

[ghstack-poisoned]

bdhirsh mentioned this pull request Apr 25, 2025

flex attention: fix dispatch order for tensor subclasses, avoid hardcoding call to faketensor impl in dynamo #151719

Closed

pytorch-bot bot added ciflow/inductor module: dynamo labels Apr 25, 2025

github-actions bot requested review from albanD, antoniojkim, ezyang, miladm and SherlockNoMad April 25, 2025 16:07

soulitzer approved these changes Apr 25, 2025

View reviewed changes

bdhirsh added 2 commits April 25, 2025 10:44

albanD removed their request for review April 29, 2025 20:56

bdhirsh added the release notes: autograd release notes category label Apr 29, 2025

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Apr 29, 2025

pytorchmergebot added the merging label Apr 29, 2025

pytorchmergebot removed the merging label Apr 29, 2025

bdhirsh mentioned this pull request May 2, 2025

Add a test for AsyncCollectiveTensor handling for maybe-view ops #152688

Closed

pytorchmergebot closed this in 5abe748 May 5, 2025

pytorchmergebot added the Merged label May 5, 2025

github-actions bot deleted the gh/bdhirsh/658/head branch June 15, 2025 02:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

SAC: fix recompute tag propagation for ops with list[tensor] inputs #152195

SAC: fix recompute tag propagation for ops with list[tensor] inputs #152195

Uh oh!

bdhirsh commented Apr 25, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Apr 25, 2025 •

edited

Loading

Uh oh!

bdhirsh commented Apr 29, 2025

Uh oh!

pytorchmergebot commented Apr 29, 2025

Uh oh!

pytorchmergebot commented Apr 29, 2025

Uh oh!

pytorchmergebot commented May 5, 2025

Uh oh!

Uh oh!

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

SAC: fix recompute tag propagation for ops with list[tensor] inputs #152195

SAC: fix recompute tag propagation for ops with list[tensor] inputs #152195

Uh oh!

Conversation

bdhirsh commented Apr 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Apr 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/152195

❗ 1 Active SEVs

✅ You can merge normally! (1 Unrelated Failure)

Uh oh!

bdhirsh commented Apr 29, 2025

Uh oh!

pytorchmergebot commented Apr 29, 2025

Merge started

Uh oh!

pytorchmergebot commented Apr 29, 2025

Merge failed

Uh oh!

pytorchmergebot commented May 5, 2025

Uh oh!

Uh oh!

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

bdhirsh commented Apr 25, 2025 •

edited

Loading

pytorch-bot bot commented Apr 25, 2025 •

edited

Loading