Commit Graph

20962 Commits

Author SHA1 Message Date
cff4bb8aaa doc: add section for quantization-related APIs and refactor 2025-10-16 13:38:24 -07:00
954f7f76f2 xe: jit: fix shr usage 2025-10-16 12:47:19 -07:00
3a756b982b cpu: aarch64: skip f16 to f32 upcast for clip and clip_v2 eltwise (#4101) 2025-10-16 15:06:10 +01:00
e2aa79e849 ci: aarch64: make comparison script friendlier for local testing (#4157) 2025-10-16 13:12:46 +01:00
34caf5740b cpu: aarch64: jit_reorder: fix dispatch issue for direct copy (#4150) 2025-10-16 08:57:31 +01:00
bf6617dd1f tests: benchdnn: utils: improve --buffer-prefix messaging 2025-10-15 18:09:06 -07:00
b5773a0826 xe: sdpa: workaround for xe3 regressions for PTL 2025-10-15 17:55:25 -04:00
34c81a1ca8 xe: gemm: convert HHS kernel to FHS to fix perf regression 2025-10-15 13:53:19 -07:00
7b3f2af13c xe: jit: gemm: fix stream-k slab count calculation 2025-10-15 10:05:31 -07:00
a4874e48ea tests: sdpa: fix fallthrough 2025-10-15 09:08:56 -07:00
21adf4d977 tests: sdpa: fix typo 2025-10-15 09:08:56 -07:00
ff30885a40 xe: sdpa: add check for host scalar fetch 2025-10-15 09:08:56 -07:00
f8ef3008b1 xe: rnn: avoid potential null pointer dereference 2025-10-15 09:08:56 -07:00
201e204682 xe: conv: make conversion explicit 2025-10-15 09:08:56 -07:00
3f66dc5049 third_party: ngen: avoid unnecessary copy 2025-10-15 09:08:56 -07:00
2b06b5c1b5 xe: jit: avoid unnecessary copies 2025-10-15 09:08:56 -07:00
4060edd33c xe: gemm: assert host scalar scale is nonzero 2025-10-15 09:08:56 -07:00
f1302e4804 xe: ir: remove dead code 2025-10-15 09:08:56 -07:00
8df1253973 xe: lrn: remove dead code 2025-10-15 09:08:56 -07:00
27e186ccd9 gtest: check returned ptr on expected md init failure 2025-10-15 10:42:00 +02:00
de84f7f345 benchdnn: properly lock before checking allocations size 2025-10-15 10:42:00 +02:00
34663d344a benchdnn: fix reference check in init_memory 2025-10-15 10:42:00 +02:00
67cd11b889 cpu: x64: matmul fix wei_k_blk query 2025-10-14 23:10:32 -04:00
5ab84bf3df xe: ukernel: Add new ukernel entries for MoE support 2025-10-14 20:05:31 -04:00
5a2b8ccedb scripts: update converter with quantization mode 2025-10-15 02:01:20 +02:00
eed2287b10 doc: document quantization mode and clean up quantization page 2025-10-15 02:01:20 +02:00
5331f9e068 gtest: cover mx input and output scaling in iface tests 2025-10-15 02:01:20 +02:00
76fd406dd4 benchdnn: matmul: add support for mx quantization testing 2025-10-15 02:01:20 +02:00
a3abba35b3 cpu: add mx quantization mode support to matmul primitive 2025-10-15 02:01:20 +02:00
1934858751 common: introduce quantization mode 2025-10-15 02:01:20 +02:00
98ae9938af cpu: enable e8m0->f32 reorder 2025-10-15 02:01:20 +02:00
8f70dafafd third_party: ngen: auto-SWSB: fix early A@ dependency clearing bug 2025-10-14 15:38:41 -07:00
d5e144c943 tests: benchdnn: enable mode p on windows 2025-10-14 14:28:23 -07:00
aed0c1a497 cpu: x64: fix index for comp_ow_kw_s/f (#4133) 2025-10-14 10:40:46 -04:00
edb3380221 CPU : x64: Matmul: fp4 weights support (#3928) 2025-10-14 13:58:14 +01:00
5f36333c57 cpu: x64: matmul: add a robust check for fallback compatibility 2025-10-13 19:21:31 -04:00
7aed66dc5e cpu: x64: fix perf issue for f32 conv (#4107) 2025-10-13 17:47:57 -04:00
7b95adc9fe benchdnn: ref_post_op: eltwise: fix NAN propagation 2025-10-13 12:32:33 -07:00
413d78d68f governance: add Marek Michałowski to onednn-cpu-aarch64 (#4123)
Signed-off-by: Siddhartha Menon <siddhartha.menon@arm.com>
2025-10-13 17:35:59 +01:00
c476f32bae cpu: aarch64: matmul: fix brgemm multidims tag detection (#4100)
Signed-off-by: Siddhartha Menon <siddhartha.menon@arm.com>
2025-10-13 17:06:35 +01:00
f9086bc474 xe: dsl: fixup trivial iteration 2025-10-13 09:05:23 -07:00
15006e1167 benchdnn: graph: avoid vector copy 2025-10-13 16:15:08 +08:00
356cf37f9f graph: backend: dnnl: fix copy 2025-10-13 16:15:08 +08:00
4781eae311 graph: backend: dnnl: initialize member variable 2025-10-13 16:15:08 +08:00
07ac0a566a graph: backend: dnnl: initialize is_invert_scale_ 2025-10-13 16:15:08 +08:00
1eb43285fc benchdnn: graph: flex_rewrite: remove deadcode after the macro SAFE 2025-10-13 16:15:08 +08:00
b18ed0017f benchdnn: graph: displacer: return FAIL for more states
This should fix a nullptr dereference warning on the below line.
2025-10-13 16:15:08 +08:00
95d831e606 benchdnn: inputs: graph: replace multiply with add to pass acc check 2025-10-11 14:39:24 +08:00
105a7cd7fa tests: prepare Eigen_threadpool implementation for new Eigen versions 2025-10-10 16:09:37 -07:00
08beeba4c9 cpu: x64: matmul: refactor pick_blocked_B_layout function 2025-10-10 15:17:49 -04:00