[pt2] [Precompile] Store parameters in BundledAOTAutogradCacheEntry #155433

jamesjwu · 2025-06-09T02:07:14Z

🚀 The feature, motivation and pitch

AOTDispatch has an extra wrapper here which adds flattened GraphModule parameters to the runtime arguments passed into the AOTAutograd callable.

pytorch/torch/_functorch/aot_autograd.py

Line 1229 in 2908c10

full_args.extend(params_flat)

These need to be saved by BundledAOTAutogradCacheEntry or elsewhere for precompile.

Alternatives

No response

Additional context

No response

cc @chauhang @penguinwu

bdhirsh · 2025-06-09T17:36:25Z

Some quick thoughts: I would have imagined that instead of saving the params in the BundledAOTAutogradCacheEntry, we would want to have some similar logic to lift them at runtime into extra graph arguments. Also, is there a repro that uses inline_inbuilt_nn_modules=True where we see the params/buffers that AOT has to lift?

(1) the params themselves can be large tensors, so won't reading/writing them directly to the cache cause problems? (longer read/write latency, also blowing up the size of the cache)

(2) The background here is that... AOTDispatcher used to lift all params/buffers into graph inputs a while ago, but after @anijain2305 landed inline_inbuilt_nn_modules=True, most params/buffers started getting lifted by dynamo. I don't actually have a good sense of which params/buffers are not handled by this case and still need to be lifted by AOT. Maybe @anijain2305 knows? If we want to avoid saving these params to the cache directly, we probably need to either have dynamo properly lift all of these params/buffers, or figure out what their dynamo source is so we can lift them properly on a cache hit.

jamesjwu · 2025-06-09T17:51:28Z

Ah, it's possible that this is just legacy code, and we don't need to care about it as long as inline_inbuilt_nn_modules=True. I will double check!

jamesjwu added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Jun 9, 2025

jamesjwu self-assigned this Jun 9, 2025

jamesjwu added oncall: pt2 in progress labels Jun 9, 2025

jamesjwu changed the title ~~[Precompile] Store parameters in BundledAOTAutogradCacheEntry~~ [pt2] [Precompile] Store parameters in BundledAOTAutogradCacheEntry Jun 9, 2025

jamesjwu added the actionable label Jun 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[pt2] [Precompile] Store parameters in BundledAOTAutogradCacheEntry #155433

[pt2] [Precompile] Store parameters in BundledAOTAutogradCacheEntry #155433

jamesjwu commented Jun 9, 2025 •

edited by pytorch-bot bot

Loading

bdhirsh commented Jun 9, 2025

Uh oh!

jamesjwu commented Jun 9, 2025

Uh oh!

pFad - (p)hone/(F)rame/(a)nonymizer/(d)eclutterfier! Saves Data!

[pt2] [Precompile] Store parameters in BundledAOTAutogradCacheEntry #155433

[pt2] [Precompile] Store parameters in BundledAOTAutogradCacheEntry #155433

Comments

jamesjwu commented Jun 9, 2025 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🚀 The feature, motivation and pitch

Alternatives

Additional context

bdhirsh commented Jun 9, 2025

Uh oh!

jamesjwu commented Jun 9, 2025

Uh oh!

pFad - (p)hone/(F)rame/(a)nonymizer/(d)eclutterfier! Saves Data!

jamesjwu commented Jun 9, 2025 •

edited by pytorch-bot bot

Loading