74 Commits

Author SHA1 Message Date
3f66dc5049 third_party: ngen: avoid unnecessary copy 2025-10-15 09:08:56 -07:00
8f70dafafd third_party: ngen: auto-SWSB: fix early A@ dependency clearing bug 2025-10-14 15:38:41 -07:00
54c4350589 ngen: workaround for HW bug with cross-pipe cmod WAR hazards 2025-10-07 10:11:44 -07:00
c0c069ef2f ngen: fix SIMD2 dst stride 2025-09-12 08:07:45 -07:00
c793a7a35f ngen: workaround gcc13 bitfield assignment bug
Assignments to *VS0 and *VS1 were being incorrectly optimized away by GCC in
some scenarios.
2025-09-08 10:47:54 -07:00
eddad2d50c third_party: ngen: fix build error with gcc 8.5 2025-09-02 15:09:03 -07:00
fbea4cd863 third_party: ngen: fix isARF() 2025-08-28 16:38:02 -07:00
0d6b37c137 third_party, src: update nGEN 2025-08-27 17:52:51 -07:00
337b34867a ngen: remove conflict with windows.h interface define 2025-08-19 10:14:37 -07:00
831220021b ngen: add ELFCodeGenerator interface setter 2025-08-07 06:32:32 -07:00
6750723b59 xe: jit: ir: enable ngen interface to kernel_iface_t construction 2025-08-05 06:52:04 -07:00
c6e9be775e Cpu: update Xbyak to v7.28 (#3682) 2025-07-31 12:00:12 -04:00
871e4aac01 third_party: spdlog: upgrade version to 1.15.3 2025-07-10 11:44:44 -07:00
1afce61b82 ngen: improve 64-bit emul
In some cases, emulation state may not have any registers so 64-bit
multiplication with an immediate could fail. In particular,
07c15ce29a421a49a46e361fc88e67aff1afbaca happened to enable just such a scenario
causing failures for:

```
benchdnn --reorder --engine=gpu --allow-enum-tags-only=false --sdt=f32 --ddt=f32 --stag=ABcd32a16b --dtag=abcd 1024x256x56x56
```
2025-07-08 07:55:50 -07:00
43cef3c738 cpu: x64: brgemm: enable mem advice support for NVL (#3513) 2025-07-07 21:19:27 -04:00
90dc538525 ngen: rename ELFCodeGenerator::interface
The windows.h header contains the following define:

```
 #define interface                   struct
```

This adversely interacts with the ELFCodeGenerator::interface function and
produces a compilation error. To avoid this interaction, the function
is renamed.
2025-07-07 13:41:35 -07:00
b565e22dee ngen: improve interface for injectors 2025-07-02 11:20:04 -07:00
03344bbc29 ngen: fix correctness with ZET_ENABLE_PROGRAM_DEBUGGING 2025-07-02 11:20:04 -07:00
42f34b0378 ngen: improve emulated emul with Immediate 2025-07-02 11:20:04 -07:00
66eec0e06b third_party: level_zero: update headers to v1.13.1 2025-07-01 16:59:03 -07:00
6c2cc18b56 xbyak: allow repeated assignment of the same opmask index (#3407) 2025-06-24 16:03:00 +02:00
0bfca9daed third_party: ngen: fixup region advance for emulation 2025-05-28 15:07:06 -07:00
492e96510d cpu: x64: enable hf8 support for matmul on AVX10.2 2025-05-23 17:17:10 -04:00
1c825b46bb build: enable Wundef 2025-05-09 16:52:00 -07:00
a21297e5d3 third_party: gtest: work around Wundef hits from gtest.h
The better solution is to create a single entry point to gtest.h from
dnnl_gtest_common.hpp and use a proxy header file technique, but this is
the improvement for the next time.
2025-05-09 16:52:00 -07:00
7edc35c37c third_party: itt: fix undefined macro hit 2025-05-09 16:52:00 -07:00
924f90e983 third_party: ngen: enable bf8 emulated mov 2025-05-07 16:51:37 -07:00
b7136b91d5 ngen: address coverity hits 2025-05-06 22:37:21 -07:00
8a8dedc07c third_party: ngen: fix locale handling 2025-05-06 07:58:09 -07:00
2804466041 fixup: xe: FP precision casting warning fix (#3178) 2025-04-29 10:07:07 -07:00
4b8fc9b01e cpu: x64: upstream DMR support to main branch
Co-authored-by: Putta, Srinivas <srinivas.putta@intel.com>
Co-authored-by: Yair Nahum <yair.y.nahum@intel.com>
Co-authored-by: Palicki, Stefan <stefan.palicki@intel.com>
Co-authored-by: Andrey Kalinin <andrey.kalinin@intel.com>
Co-authored-by: Tomasz Czeszun <tomasz.czeszun@intel.com>
Co-authored-by: Yair Nahum <64333457+ynahum@users.noreply.github.com>
2025-04-23 22:34:55 -04:00
8605df92d6 third_party: ngen: extend emulated mov to qword types 2025-04-23 07:26:55 -07:00
f1131cb2d6 third_party: ngen: extend emul for qword src0 2025-04-23 07:26:55 -07:00
47b206c5e2 third_party: ngen: use getHardware interface 2025-04-23 07:26:55 -07:00
5f001dd373 third_party: ngen: add missing defaults 2025-04-17 11:32:28 -07:00
37bda40e31 third_party: level_zero: removed unused Python scripts 2025-04-09 14:51:26 -07:00
6d8a780a35 ngen: add cross-thread data relocations for GTPin 2025-04-09 11:14:16 -07:00
7b395787eb ngen: workaround for SYCL + GCC 12.3 compiler bug 2025-04-02 10:01:08 -07:00
9bef39e84d ngen: fix missing field initialization warning 2025-03-28 07:18:38 -07:00
eaaf1c0847 third_party: ngen: prepare for SYCL generator usage 2025-03-27 13:13:24 -07:00
931cc27d80 ngen: downstream nGEN 2025-03-25 17:12:32 -07:00
607a318d26 ngen: update PVC WAR bug workaround 2025-03-24 15:19:47 -07:00
34f6711818 third_party: spdlog: upgrade version to 1.15.1 2025-03-21 10:05:28 -07:00
b6fab3ecda ngen: add getHardware() interface 2025-03-19 13:58:17 -07:00
37202682c4 ngen: improve macro fowarding to arbitrary scopes
Additionally, includes various tweaks required to get AsmCodeGenerator
to function when forwarded.
2025-03-19 13:58:17 -07:00
7e8504f617 third_party: ngen: fixup insn restrictions 2025-03-19 09:10:57 -07:00
8c811df174 Xbyak: upgrade xbyak version to v7.23.1 (#2867) 2025-03-17 18:31:22 -04:00
055d8cb522 xe: BMG/LNL product detection 2025-03-12 14:03:35 -04:00
b86e408e15 ngen: fix various warnings due to NGEN_ASM 2025-03-04 06:04:01 -08:00
6cd1beaea3 xe: enable building with out-of-tree nGEN 2025-03-04 06:04:01 -08:00