pytorch

mirror of https://github.com/pytorch/pytorch.git synced 2025-10-20 12:54:11 +08:00

Files

Natalia Gimelshein 37c6087334 Add split-K control to cuBLAS reduced-precision settings (#164766 )

## Summary
- add a CuBLASReductionOption enum so the CUDA context can track reduced-precision and split-K options
- extend the Python bindings, backend helpers, and docs to accept an optional allow_splitk argument for fp16/bf16 matmul controls
- update cuBLAS/cuBLASLt call sites plus dynamo guards and tests to respect the new combinations

## Testing
- python test/test_cuda.py TestCuda.test_cublas_allow_fp16_reduced_precision_reduction_get_set -v *(fails: ModuleNotFoundError: No module named 'psutil')*

------
https://chatgpt.com/codex/tasks/task_e_68e404623178832f8a3e1d34e1e175da

Pull Request resolved: https://github.com/pytorch/pytorch/pull/164766
Approved by: https://github.com/malfet, https://github.com/albanD

2025-10-08 18:48:45 +00:00

cpp

Fix cpp build (#162774 )

2025-09-25 18:21:45 +00:00

source

Add split-K control to cuBLAS reduced-precision settings (#164766 )

2025-10-08 18:48:45 +00:00

.gitignore

.gitignore for the docs folder

2019-10-08 12:18:30 -07:00

libtorch.rst

Add ROCm documentation to libtorch (C++) reST. (#136378 )