pytorch

mirror of https://github.com/pytorch/pytorch.git synced 2025-10-20 21:14:14 +08:00

Files

leslie-fang-intel 86e2d16ba0 [Inductor][Quant] Change the schema of QLinear Binary (#129049 )

**Summary**
We change the schema of QLinear Binary, so it will be easier to enable the corresponding gemm template.

- Extra input of binary post-op is a tensor which needs to be an input node of autotuning, we need to move it at front of `output_scale` which is a scalar.
- We also move it at front of `bias`, since `bias` is optional tensor for this fusion, but `other` is a must to have for linear binary fusion.

**Test Plan**
```
python -u -m pytest -s -v test/quantization/core/test_quantized_op.py -k qlinear
python -u -m pytest -s -v test/inductor/test_mkldnn_pattern_matcher.py -k qlinear
```

Pull Request resolved: https://github.com/pytorch/pytorch/pull/129049
Approved by: https://github.com/jgong5, https://github.com/jansel
ghstack dependencies: #128825, #129048

2024-07-02 12:36:38 +00:00

check_forward_backward_compatibility.py

[Inductor][Quant] Change the schema of QLinear Binary (#129049 )

2024-07-02 12:36:38 +00:00

dump_all_function_schemas.py

UFMT formatting on test/distributions, test/error_messages, test/forward_backward_compatability (#123527 )

2024-04-09 16:03:46 +00:00