mirror of https://github.com/pytorch/pytorch.git synced 2025-10-21 05:34:18 +08:00

Files

William Wen f916f34739 [dynamo, docs] non-strict programming model docs (#158531 )

Pull Request resolved: https://github.com/pytorch/pytorch/pull/158531
Approved by: https://github.com/AlannaBurke, https://github.com/mlazos, https://github.com/anijain2305
ghstack dependencies: #157985, #158055

Co-authored-by: Svetlana Karslioglu <svekars@meta.com>

2025-07-29 01:53:47 +00:00

3.8 KiB

Raw Blame History

file_format, kernelspec, mystnb

file_format

kernelspec

mystnb

name
python3

execution_timeout	execution_show_tb	merge_streams
30	True	True

:tags: [remove-cell]
import torch

import header_code

Use `torch._dynamo.nonstrict_trace`

Summary:

Use nonstrict_trace to trace a function with non-strict tracing inside of a torch.compile'd region. You may wish to do this because the Dynamo graph breaks on something inside of the function and you are sure that the function is non-strict traceable.

Consider the following scenario:

def get_magic_num():
    # This explicit graph break call is meant to emulate any kind of Dynamo
    # graph break, e.g., the function is implemented in C, or uses some python
    # language feature Dynamo doesn't yet support.
    torch._dynamo.graph_break()
    return torch.tensor([42])
@torch.compile(fullgraph=True)
def func(x):
    n = get_magic_num()
    return x + n
try:
    func(torch.rand(10))
except Exception as e:
    print(e)

If we run the code above, we'll get an error from Dynamo, because it sees a graph break while the user specified fullgraph=True.

In these situations, if a user still wants to keep fullgraph=True, they typically have several options:

The graph break is due to a language feature Dynamo doesn't yet support. In this case, the user either rewrites their code, or files an issue on GitHub.
The graph break is due to a call to a function implemented in C. In this case, the user can try to use a custom op. The user could also try providing a polyfill (a reference implementation in Python) so that Dynamo can trace through it.
Worst case scenario -- an internal compiler error. In this case, the user likely has to file an issue on GitHub.

In addition to all these options, PyTorch does provide an alternative torch._dynamo.nonstrict_trace, if the function call that induced the graph break satisfies certain requirements:

The requirements of general non-strict tracing.
The inputs and outputs must contain either basic types (e.g., int, float, list, dict, torch.Tensor), or user-defined types that are registered to torch.utils._pytree.
The function must be defined outside the torch.compile'd region.
Any non-input values read by the function will be treated as a constant (e.g., a global tensor), and will not be guarded on.

When tracing through a call to a torch._dynamo.nonstrict_trace'd function, torch.compile switches to non-strict tracing, and the FX graph will eventually contain all the relevant tensor operations which happened inside that function.

For the example above, we can use torch._dynamo.nonstrict_trace to eliminate the graph break:

@torch._dynamo.nonstrict_trace
def get_magic_num():
    # This explicit graph break call is meant to emulate any kind of Dynamo
    # graph break, e.g., the function is implemented in C, or uses some python
    # language feature Dynamo doesn't yet support.
    torch._dynamo.graph_break()
    return torch.tensor([42])
@torch.compile(fullgraph=True)
def func(x):
    n = get_magic_num()
    return x + n
print(func(torch.rand(10)))
# No graph break and no error.

Note that one can use it inside a torch.compile'd region as well:

def get_magic_num():
    # This explicit graph break call is meant to emulate any kind of Dynamo
    # graph break, e.g., the function is implemented in C, or uses some python
    # language feature Dynamo doesn't yet support.
    torch._dynamo.graph_break()
    return torch.tensor([42])
@torch.compile(fullgraph=True)
def func(x):
    n = torch._dynamo.nonstrict_trace(get_magic_num)()
    return x + n
print(func(torch.rand(10)))
# No graph break and no error.

3.8 KiB Raw Blame History

Use torch._dynamo.nonstrict_trace

3.8 KiB

Raw Blame History

Use `torch._dynamo.nonstrict_trace`