pytorch

mirror of https://github.com/pytorch/pytorch.git synced 2025-10-20 21:14:14 +08:00

Files

wengshiy 668d414ae7 [CPU] Fix bias dtype issue for FP8 qlinear (#159125 )

Fixes
`RuntimeError: self and mat2 must have the same dtype, but got BFloat16 and Float`

With bf16 autocast, bias converted into BFloat16, but fp8_qlinear_onednn_ref not support bf16 bias.
In this pr, convert bias into bf16 on fp8_qlinear_onednn_ref.

Add this case into ut and reproduce:
`python test/test_quantization.py -k test_qlinear_fp8`

Pull Request resolved: https://github.com/pytorch/pytorch/pull/159125
Approved by: https://github.com/Xia-Weiwen, https://github.com/cyyever, https://github.com/CaoE

2025-07-31 01:26:45 +00:00

ao_migration

[BE][PYFMT] migrate PYFMT for test/[i-z]*/ to ruff format (#144556 )

2025-07-29 03:26:09 +00:00

Add __main__ guards to quantization tests (#154728 )

2025-06-10 19:46:07 +00:00

core

[CPU] Fix bias dtype issue for FP8 qlinear (#159125 )

2025-07-31 01:26:45 +00:00

eager

[BE][PYFMT] migrate PYFMT for test/[i-z]*/ to ruff format (#144556 )

2025-07-29 03:26:09 +00:00