mirror of https://github.com/pytorch/pytorch.git synced 2025-10-20 21:14:14 +08:00

Files

Sarthak Tandon 66ea76ec44 [ROCm][tunableop] Improvements to tunableop Numerical Check (#163079 )

Modified the flag PYTORCH_TUNABLEOP_NUMERICAL_CHECK, so that it accepts the numerical tolerances in the format atol_rtol as compared to the previous 0 and 1. Retains previous functionality with default values as well.

Pull Request resolved: https://github.com/pytorch/pytorch/pull/163079
Approved by: https://github.com/naromero77amd, https://github.com/jeffdaily

2025-10-15 22:26:47 +00:00

1.2 KiB

Raw Permalink Blame History

.. currentmodule:: torch.cuda.tunable

TunableOp

Overview

.. automodule:: torch.cuda.tunable

API Reference

.. autofunction:: enable

.. autofunction:: is_enabled

.. autofunction:: tuning_enable

.. autofunction:: tuning_is_enabled

.. autofunction:: record_untuned_enable

.. autofunction:: record_untuned_is_enabled

.. autofunction:: set_max_tuning_duration

.. autofunction:: get_max_tuning_duration

.. autofunction:: set_max_tuning_iterations

.. autofunction:: get_max_tuning_iterations

.. autofunction:: set_filename

.. autofunction:: get_filename

.. autofunction:: get_results

.. autofunction:: get_validators

.. autofunction:: read_file

.. autofunction:: tune_gemm_in_file

.. autofunction:: mgpu_tune_gemm_in_file

.. autofunction:: set_rotating_buffer_size

.. autofunction:: get_rotating_buffer_size

.. autofunction:: set_numerical_check_tolerances

1.2 KiB Raw Permalink Blame History

TunableOp

Overview

API Reference

1.2 KiB

Raw Permalink Blame History