pytorch/notes at 1e42fde45eff81845f269e8185f54a19f6d87c5b - pytorch - Gitea: Git for Me

frozenleaves/pytorch

mirror of https://github.com/pytorch/pytorch.git synced 2025-10-31 12:15:03 +08:00

Files

History

Banit Agrawal f39789cdab [PyTorch Pinned Allocator] Add support of reserved pinned memory segment to avoid slow paths (#164501 )

Summary:
This diff adds the feature of allocating a large pinned memory segment upfront based on the provided config. This large segment is then used to serve all the small pinned memory requests to avoid expensive device level APIs (slow paths).

Example:

PYTORCH_CUDA_ALLOC_CONF=pinned_reserve_segment_size_mb:2048

This reserves a 2GB pinned memory segment for the process and then all incoming small requests are just served from this segment and no cudaHostAlloc/cudaHostRegister apis are being called.

Differential Revision: D83779074

Pull Request resolved: https://github.com/pytorch/pytorch/pull/164501
Approved by: https://github.com/yangw-dev

2025-10-03 18:11:27 +00:00

..

amp_examples.rst

Update document for autocast on CPU (#135299 )

2024-09-13 09:11:47 +00:00

autograd.rst

[doc] Add documentation for division by zero behavior in autograd (#155987 )

2025-06-16 19:02:12 +00:00

broadcasting.rst

Fix comment on broadcasting example to clarify dimension mismatch (#162177 )

2025-09-29 16:47:48 +00:00

cpu_threading_torchscript_inference.rst

[3/n] Remove references to TorchScript in PyTorch docs (#158315 )

2025-07-15 21:14:18 +00:00

cuda.rst

[PyTorch Pinned Allocator] Add support of reserved pinned memory segment to avoid slow paths (#164501 )

2025-10-03 18:11:27 +00:00

custom_operators.rst

Redirect the custom ops landing page :D (#139634 )

2024-11-04 22:25:15 +00:00

ddp.rst

Update DDP dynamo debug docs (#118295 )

2024-01-29 14:58:26 +00:00

extending.func.rst

Fix the example in the extending.func.rst (#109279 )

2023-09-14 17:29:39 +00:00

extending.rst

[autograd][docs] Add more details on why save_for_backward is important in extending autograd note (#153005 )

2025-05-09 16:36:57 +00:00

faq.rst

Update faq.rst so OOM section mentions checkpoint (#62709 )

2021-08-05 07:40:08 -07:00

get_start_xpu.rst

update supported OS for Intel client GPU (#161699 )

2025-09-01 05:45:09 +00:00

gradcheck.rst

[BE] fix typos in docs/ (#156080 )

2025-06-21 02:47:32 +00:00

hip.rst

[ROCm] Ck backend UX refactor (#152951 )

2025-08-08 18:40:17 +00:00

large_scale_deployments.rst

[3/n] Remove references to TorchScript in PyTorch docs (#158315 )

2025-07-15 21:14:18 +00:00

libtorch_stable_abi.md

Add ScalarType -> shim conversion, add stable::Tensor.scalar_type (#160557 )

2025-08-19 22:13:47 +00:00

mkldnn.rst

Enable TF32 as fp32 internal precision for matmul/linear/conv (#157520 )

2025-07-17 08:57:34 +00:00

modules.rst

Fix to modules.rst: indent line with activation functions (#139667 )

2024-11-08 01:12:52 +00:00

mps.rst

update mps note with more details (#78669 )

2022-06-02 20:53:19 +00:00

multiprocessing.rst

[BE] fix typos in docs/ (#156080 )

2025-06-21 02:47:32 +00:00

numerical_accuracy.rst

Update warning of TF32 (#158209 )

2025-07-16 01:28:50 +00:00

out.rst

add Out Notes (#151306 )

2025-04-24 20:25:09 +00:00

randomness.rst

[cuBLAS] update cuBLAS determinism docs, remove workspace requirement checks (#161749 )

2025-10-03 00:09:47 +00:00

serialization.rst

Delete sections referencing torchscript in serialization docs (#156648 )

2025-06-25 23:41:24 +00:00

windows.rst

Removing conda references from PyTorch Docs (#152702 )

2025-05-20 20:33:28 +00:00