[cp] dispatch flex_attention on DTensor to cp implementation #151900

XilunWu · 2025-04-22T09:32:07Z

Stack from ghstack (oldest at bottom):

cc @H-Huang @awgu @wanchaol @fegin @fduwjj @wz337 @wconstab @d4l3k

[ghstack-poisoned]

pytorch-bot · 2025-04-22T09:32:11Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/151900

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❌ 3 New Failures, 1 Unrelated Failure

As of commit 11c155d with merge base b7c7000 ():

NEW FAILURES - The following jobs have failed:

Check Labels / Check labels (gh)
RuntimeError: Error checking labels: PR does not have required labels
Lint / lintrunner-noclang / linux-job (gh)
>>> Lint for test/distributed/tensor/test_attention.py:
pull / linux-jammy-py3.9-gcc11 / test (distributed, 1, 2, ephemeral.linux.2xlarge) (gh)
distributed/tensor/test_attention.py::RingFlexAttentionTest::test_ring_flex_attention

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

inductor / cuda12.6-py3.10-gcc9-sm86 / test (inductor_torchbench, 1, 2, linux.g5.4xlarge.nvidia.gpu) (gh) (trunk failure)
demucs

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: af830b8 Pull Request resolved: #151900

github-actions · 2025-04-22T09:32:38Z

This PR needs a `release notes:` label

If your changes are user facing and intended to be a part of release notes, please use a label starting with release notes:.

If not, please add the topic: not user facing label.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "topic: not user facing"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

ghstack-source-id: 939ab50 Pull Request resolved: #151497 DTensor HOP call in TorchDispatchMode ghstack-source-id: 939ab50 Pull Request resolved: #151685 [cp] Add e2e flex_attention test w/ causal masking ghstack-source-id: 939ab50 Pull Request resolved: #151900 [cp] Context Parallel: dispatch flex_attention to CP impl in TorchDispatchMode ghstack-source-id: 939ab50 Pull Request resolved: #151903

ghstack-source-id: 9f052d6 Pull Request resolved: #151497 DTensor HOP call in TorchDispatchMode ghstack-source-id: 9f052d6 Pull Request resolved: #151685 [cp] Add e2e flex_attention test w/ causal masking ghstack-source-id: 9f052d6 Pull Request resolved: #151900 [cp] Context Parallel: dispatch flex_attention to CP impl in TorchDispatchMode ghstack-source-id: 9f052d6 Pull Request resolved: #151903

ghstack-source-id: e098b23 Pull Request resolved: #151497 DTensor HOP call in TorchDispatchMode ghstack-source-id: e098b23 Pull Request resolved: #151685 [cp] Add e2e flex_attention test w/ causal masking ghstack-source-id: e098b23 Pull Request resolved: #151900 [cp] Context Parallel: dispatch flex_attention to CP impl in TorchDispatchMode ghstack-source-id: e098b23 Pull Request resolved: #151903

ghstack-source-id: 4524be1 Pull Request resolved: #151497 DTensor HOP call in TorchDispatchMode ghstack-source-id: 4524be1 Pull Request resolved: #151685 [cp] Add e2e flex_attention test w/ causal masking ghstack-source-id: 4524be1 Pull Request resolved: #151900 [cp] Context Parallel: dispatch flex_attention to CP impl in TorchDispatchMode ghstack-source-id: 4524be1 Pull Request resolved: #151903

github-actions · 2025-06-23T08:41:57Z

Looks like this PR hasn't been updated in a while so we're going to go ahead and mark this as Stale.
Feel free to remove the Stale label if you feel this was a mistake.
If you are unable to remove the Stale label please contact a maintainer in order to do so.
If you want the bot to never mark this PR stale again, add the no-stale label.
Stale pull requests will automatically be closed after 30 days of inactivity.

ghstack-source-id: af830b8 Pull Request resolved: pytorch/pytorch#151900

[cp] Add e2e flex_attention test w/ causal masking

11c155d

[ghstack-poisoned]

pytorch-bot bot added ciflow/inductor oncall: distributed Add this issue/PR to distributed oncall triage queue labels Apr 22, 2025

This was referenced Apr 22, 2025

[BE] follow autoformating and linter #151507

Closed

[cp] dispatch flex_attention to CP impl in TorchDispatchMode #151497

Open

DTensor HOP call in TorchDispatchMode #151685

Closed

XilunWu added a commit that referenced this pull request Apr 22, 2025

[cp] Add e2e flex_attention test w/ causal masking

56bf490

ghstack-source-id: af830b8 Pull Request resolved: #151900

XilunWu marked this pull request as draft April 22, 2025 09:36

This was referenced Apr 22, 2025

[cp] cast tensor to DTensor for flex_attention #151902

Closed

[cp] Context Parallel: dispatch flex_attention to CP impl in TorchDispatchMode #151903

Closed

XilunWu changed the title ~~[cp] Add e2e flex_attention test w/ causal masking~~ [cp] dispatch flex_attention on DTensor to cp implementation Apr 24, 2025

github-actions bot added the Stale label Jun 23, 2025

superiwan pushed a commit to superiwan/pytorch that referenced this pull request Jul 14, 2025

[cp] Add e2e flex_attention test w/ causal masking

2feae0a

ghstack-source-id: af830b8 Pull Request resolved: pytorch/pytorch#151900

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[cp] dispatch flex_attention on DTensor to cp implementation #151900

[cp] dispatch flex_attention on DTensor to cp implementation #151900

Uh oh!

XilunWu commented Apr 22, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Apr 22, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Apr 22, 2025

Uh oh!

github-actions bot commented Jun 23, 2025

Uh oh!

Uh oh!

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

[cp] dispatch flex_attention on DTensor to cp implementation #151900

Are you sure you want to change the base?

[cp] dispatch flex_attention on DTensor to cp implementation #151900

Uh oh!

Conversation

XilunWu commented Apr 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Apr 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/151900

❌ 3 New Failures, 1 Unrelated Failure

Uh oh!

github-actions bot commented Apr 22, 2025

This PR needs a release notes: label

Uh oh!

github-actions bot commented Jun 23, 2025

Uh oh!

Uh oh!

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

XilunWu commented Apr 22, 2025 •

edited

Loading

pytorch-bot bot commented Apr 22, 2025 •

edited

Loading

This PR needs a `release notes:` label