pytorch

mirror of https://github.com/pytorch/pytorch.git synced 2025-10-20 21:14:14 +08:00

Files

zpcore ab65498d71 Fix _StridedShard incorrect split (#165533 )

https://github.com/pytorch/pytorch/pull/164820 introduced a bug that `_StridedShard` will call parent class `Shard`'s `split_tensor` method, thus results in incorrect data locality. (I think @ezyang spotted this issue, but we have no test to capture this)

Meanwhile, I notice another bug that when we normalize a `_StridedShard`'s placement, it will also trigger parent class `Shard`'s `split_tensor` method because it will create a Shard class [here](0c14f55de6/torch/distributed/tensor/_api.py (L783)). I think we never test `distribute_tensor` for `_StridedShard` before. So I added a test here to compare against ordered shard.

Using classmethod because the _split_tensor logic is different between `Shard` and `_StridedShard`. Basically I want to shard on local tensors without initializing the Shard object:
```
local_tensor = _StridedShard._make_shard_tensor(dim, tensor, mesh, mesh_dim, split_factor=split_factor)
local_tensor = Shard._make_shard_tensor(dim, tensor, mesh, mesh_dim)
```

Pull Request resolved: https://github.com/pytorch/pytorch/pull/165533
Approved by: https://github.com/XilunWu

2025-10-17 20:54:46 +00:00

_awaits

…

Revert "[Mem Snapshot] Add Metadata Field (#165490 )"

2025-10-17 02:01:53 +00:00

_C_flatbuffer

…

_custom_op

[2/N] Fix ruff warnings (#164460 )

2025-10-04 03:40:32 +00:00

_decomp

Ensure rms_norm decomp generates add.Scalar for pattern match BC (#165437 )

2025-10-14 19:56:37 +00:00

_dispatch

Add initial suppressions for pyrefly (#164177 )

2025-10-02 20:57:41 +00:00

_dynamo

Don't run compile inside kernel invocation (#165687 )

2025-10-17 19:03:57 +00:00

_export

[torch.export] Rmoving unused constants - add support for corner case (#165205 )

2025-10-14 20:26:28 +00:00

_functorch

Fix bug with serialization after AOTAutogradCache hit (#165474 )

2025-10-17 17:47:24 +00:00

_higher_order_ops

[hop] run local_map with interpreter to preserve fx_traceback annotations (#165336 )

2025-10-16 02:53:17 +00:00

_inductor

Fix bug with serialization after AOTAutogradCache hit (#165474 )

2025-10-17 17:47:24 +00:00

_lazy

Add initial suppressions for pyrefly (#164177 )

2025-10-02 20:57:41 +00:00

_library

[RFC] Add pyrefly to lintrunner (#165179 )

2025-10-16 20:07:09 +00:00

_logging

Enable ruff rule E721 (#165162 )

2025-10-13 01:48:55 +00:00

_numpy

Enable ruff rule E721 (#165162 )

2025-10-13 01:48:55 +00:00

_prims

Add pyrefly suppressions 2/n (#164513 )

2025-10-03 02:46:13 +00:00

_prims_common

[2/N] Use "is" in python type comparison (#165142 )

2025-10-10 15:36:44 +00:00

_refs

Enable ruff rule E721 (#165162 )

2025-10-13 01:48:55 +00:00

_strobelight

Add initial suppressions for pyrefly (#164177 )

2025-10-02 20:57:41 +00:00

_subclasses

FakeTensorMode shouldn't cache syms when tracing (#164718 )

2025-10-16 20:57:07 +00:00

_vendor

…

accelerator

Add unified memory APIs for torch.accelerator (#152932 )

2025-08-08 17:41:22 +00:00

amp

Revert "[AMP][Refactor] Simplify dtype support logic in autocast context manager (#163446 )"

2025-10-10 15:12:46 +00:00

[2/N] More ruff SIM fixes (#165031 )

2025-10-14 14:22:54 +00:00

autograd

[2/N] More ruff SIM fixes (#165031 )

2025-10-14 14:22:54 +00:00

backends

Add torch.backends.mkldnn.is_acl_available() method (#165678 )

2025-10-16 22:34:21 +00:00

compiler

Megacache integration (#163533 )

2025-10-15 22:49:15 +00:00

contrib

…

cpu

Add initial suppressions for pyrefly (#164177 )

2025-10-02 20:57:41 +00:00

csrc

Revert "shrink_group implementation to expose ncclCommShrink API (#164518 )"

2025-10-17 18:55:53 +00:00

cuda

Revert "[Mem Snapshot] Add Metadata Field (#165490 )"

2025-10-17 02:01:53 +00:00

distributed

Fix _StridedShard incorrect split (#165533 )

2025-10-17 20:54:46 +00:00

distributions

[1/N] Use "is" in python type comparison (#165037 )

2025-10-10 12:36:50 +00:00

export

Revert "[export] preserve_node_meta by default (#165524 )"

2025-10-17 12:27:17 +00:00

fft

[BE][PYFMT] migrate PYFMT for torch/[e-n]*/ to ruff format (#144553 )

2025-06-17 08:18:47 +00:00

func

…

futures

[5/N] Apply ruff UP035 rule (#164423 )

2025-10-02 07:31:11 +00:00

[annotate] add annotate_fn function decorator (#165703 )

2025-10-17 20:10:53 +00:00

headeronly

Refactor out headeronly ArrayRef (#164991 )

2025-10-17 18:32:39 +00:00

jit

Fix missing brackets (#165138 )

2025-10-10 17:23:31 +00:00

legacy

…

lib

[2/N] Mark unused parameters in C++ code (#165121 )

2025-10-15 03:04:39 +00:00

linalg

Add initial suppressions for pyrefly (#164177 )

2025-10-02 20:57:41 +00:00

masked

[1/N] Use "is" in python type comparison (#165037 )

2025-10-10 12:36:50 +00:00

monitor

…

mps

Add type annotations to MPS profiler utilities (#163486 )

2025-09-27 23:00:53 +00:00

mtia

Add initial suppressions for pyrefly (#164177 )

2025-10-02 20:57:41 +00:00

multiprocessing

Add initial suppressions for pyrefly (#164177 )

2025-10-02 20:57:41 +00:00

nativert

[2/N] Mark unused parameters in C++ code (#165121 )

2025-10-15 03:04:39 +00:00

nested

[NJT] Fix schema validation error in jagged functions (#165307 )

2025-10-13 17:59:18 +00:00

[docs] Add usage examples to ConvTranspose1d docstring (#165618 )

2025-10-17 19:11:57 +00:00

numa

Add initial suppressions for pyrefly (#164177 )

2025-10-02 20:57:41 +00:00

onnx

[ONNX] Remove common imports from torchlib (#165156 )

2025-10-17 03:25:34 +00:00

optim

[RFC] Add pyrefly to lintrunner (#165179 )

2025-10-16 20:07:09 +00:00

package

[1/N] Use "is" in python type comparison (#165037 )

2025-10-10 12:36:50 +00:00

profiler

Pyrefly suppressions 6/n (#164877 )

2025-10-08 02:30:57 +00:00

quantization

[RFC] Add pyrefly to lintrunner (#165179 )

2025-10-16 20:07:09 +00:00

signal

Add initial suppressions for pyrefly (#164177 )

2025-10-02 20:57:41 +00:00

sparse

[RFC] Add pyrefly to lintrunner (#165179 )

2025-10-16 20:07:09 +00:00

special

[BE][PYFMT] migrate PYFMT for torch/[p-z]*/ to ruff format (#144552 )

2025-08-07 00:09:56 +00:00

testing

Revert "shrink_group implementation to expose ncclCommShrink API (#164518 )"

2025-10-17 18:55:53 +00:00

utils

Revert "[DebugMode][2/N] add nn.Module tracking (#165498 )"

2025-10-17 18:22:48 +00:00

xpu

Add a new API torch.xpu.is_tf32_supported for Intel GPU (#163141 )

2025-10-12 12:11:57 +00:00

__config__.py

…

__future__.py

…

__init__.py

[2/N] Use "is" in python type comparison (#165142 )

2025-10-10 15:36:44 +00:00

_appdirs.py

…

_classes.py

remove allow-untyped-defs from torch/_classes.py (#157231 )

2025-07-08 00:11:52 +00:00

_compile.py

[4/N] Apply ruff UP035 rule to python code (#164206 )

2025-10-01 19:05:53 +00:00

_custom_ops.py

Render Example: and not Example:: in docs (#153978 )

2025-05-21 01:03:26 +00:00

_environment.py

…

_guards.py

[4/N] Apply ruff UP035 rule to python code (#164206 )

2025-10-01 19:05:53 +00:00

_jit_internal.py

[2/N] More ruff SIM fixes (#165031 )

2025-10-14 14:22:54 +00:00

_linalg_utils.py

Update is_sparse doc to mention that it is sparse_coo specific (#157378 )

2025-07-09 18:22:14 +00:00

_lobpcg.py

Add initial suppressions for pyrefly (#164177 )

2025-10-02 20:57:41 +00:00

_lowrank.py

[BE][1/16] fix typos in torch/ (#156311 )

2025-07-09 11:02:22 +00:00

_meta_registrations.py

[2/N] More ruff SIM fixes (#165031 )

2025-10-14 14:22:54 +00:00

_namedtensor_internals.py

…

_ops.py

[2/N] More ruff SIM fixes (#165031 )

2025-10-14 14:22:54 +00:00

_python_dispatcher.py

Typo fixes for "overridden" in comments and function names (#155944 )

2025-06-14 03:37:38 +00:00

_size_docs.py

Render Example: and not Example:: in docs (#153978 )

2025-05-21 01:03:26 +00:00

_sources.py

…

_storage_docs.py

Fix docstring for torch.UntypedStorage.from_file (#155067 )

2025-06-05 14:30:49 +00:00

_streambase.py

…

_tensor_docs.py

[reland] Allow setting grad_dtype on leaf tensors (#164751 )

2025-10-08 20:23:13 +00:00

_tensor_str.py

Pyrefly suppressions 6/n (#164877 )

2025-10-08 02:30:57 +00:00

_tensor.py

Pyrefly suppressions 6/n (#164877 )

2025-10-08 02:30:57 +00:00

_thread_safe_fork.py

…

_torch_docs.py

Update docs for torch.mode (#165614 )

2025-10-17 19:06:33 +00:00

_utils_internal.py

Revert "Call internal log_compilation_event if it exists (#164855 )"

2025-10-09 22:38:45 +00:00

_utils.py

Enable ruff rule E721 (#165162 )

2025-10-13 01:48:55 +00:00

_VF.py

…

_vmap_internals.py

[4/N] Apply ruff UP035 rule to python code (#164206 )

2025-10-01 19:05:53 +00:00

_weights_only_unpickler.py

[4/N] Apply ruff UP035 rule to python code (#164206 )

2025-10-01 19:05:53 +00:00

CMakeLists.txt

Revert "[RELAND] Always build USE_DISTRIBUTED (#160449 ) and Make distributed modules importable even when backend not built (#159889 ) (#162594 )"

2025-09-25 13:47:46 +00:00

custom_class_detail.h

Mark unused parameters in C++ code (#164912 )

2025-10-09 06:23:25 +00:00

custom_class.h

Mark unused parameters in C++ code (#164912 )

2025-10-09 06:23:25 +00:00

extension.h

…

functional.py

[2/N] Fix ruff warnings (#164460 )

2025-10-04 03:40:32 +00:00

header_only_apis.txt

Refactor out headeronly ArrayRef (#164991 )

2025-10-17 18:32:39 +00:00

hub.py

Add initial suppressions for pyrefly (#164177 )

2025-10-02 20:57:41 +00:00

library.h

Mark unused parameters in C++ code (#164912 )

2025-10-09 06:23:25 +00:00

library.py

Add initial suppressions for pyrefly (#164177 )

2025-10-02 20:57:41 +00:00

overrides.py

Add scaled_grouped_mm_v2 and python API (#165154 )

2025-10-15 17:47:23 +00:00

py.typed

…

quasirandom.py

…

random.py

Revert "Add device argument to torch.random.get_rng_state (#163034 )"

2025-10-04 15:25:45 +00:00

return_types.py

…

script.h

…

serialization.py

Add initial suppressions for pyrefly (#164177 )

2025-10-02 20:57:41 +00:00

storage.py

[2/N] Use "is" in python type comparison (#165142 )

2025-10-10 15:36:44 +00:00

torch_version.py

…

types.py

[4/N] Apply ruff UP035 rule to python code (#164206 )

2025-10-01 19:05:53 +00:00

version.py.tpl

…