Files
pytorch/torch/_inductor/runtime
drisspg de3da77cf7 Thread deterministic config vars to subproc compilation (#165729)
# Summary

TIL (AFTER WAYYYY TOO MUCH INSANITY), that we do not serialize the full set of configs for the subproc compilation.

I found this while working on Flex-attention determinism: https://github.com/meta-pytorch/attention-gym/pull/168

might be good to audit if we need to thread through any more

Pull Request resolved: https://github.com/pytorch/pytorch/pull/165729
Approved by: https://github.com/shunting314, https://github.com/eellison
2025-10-18 01:25:50 +00:00
..

torch._inductor.runtime

This folder contains code needed at runtime by the output code of Inductor. The output code of Inductor will import torch and torch._inductor.runtime, but should not import from other files in torch._inductor.*. Note that this code includes code that is needed to actually perform Triton compilation, but is not needed in the actual, final runtime execution of kernels.

Runtime includes Triton/C++ generated code, which are compiled (sometimes in parallel) when the output code of Inductor is imported. It also includes the autotuning code and heuristics to decide block sizes of generated code.

One of the original motivations for this directory split was so that the Triton compile subprocesses could access Triton and our compiler support code while mocking out most of torch, which can take seconds to import (sometimes more than a Triton compile itself). An abandoned prototype of this can be found here.