|
cff4bb8aaa
|
doc: add section for quantization-related APIs and refactor
|
2025-10-16 13:38:24 -07:00 |
|
|
954f7f76f2
|
xe: jit: fix shr usage
|
2025-10-16 12:47:19 -07:00 |
|
|
3a756b982b
|
cpu: aarch64: skip f16 to f32 upcast for clip and clip_v2 eltwise (#4101)
|
2025-10-16 15:06:10 +01:00 |
|
|
e2aa79e849
|
ci: aarch64: make comparison script friendlier for local testing (#4157)
|
2025-10-16 13:12:46 +01:00 |
|
|
34caf5740b
|
cpu: aarch64: jit_reorder: fix dispatch issue for direct copy (#4150)
|
2025-10-16 08:57:31 +01:00 |
|
|
bf6617dd1f
|
tests: benchdnn: utils: improve --buffer-prefix messaging
|
2025-10-15 18:09:06 -07:00 |
|
|
b5773a0826
|
xe: sdpa: workaround for xe3 regressions for PTL
|
2025-10-15 17:55:25 -04:00 |
|
|
34c81a1ca8
|
xe: gemm: convert HHS kernel to FHS to fix perf regression
|
2025-10-15 13:53:19 -07:00 |
|
|
7b3f2af13c
|
xe: jit: gemm: fix stream-k slab count calculation
|
2025-10-15 10:05:31 -07:00 |
|
|
a4874e48ea
|
tests: sdpa: fix fallthrough
|
2025-10-15 09:08:56 -07:00 |
|
|
21adf4d977
|
tests: sdpa: fix typo
|
2025-10-15 09:08:56 -07:00 |
|
|
ff30885a40
|
xe: sdpa: add check for host scalar fetch
|
2025-10-15 09:08:56 -07:00 |
|
|
f8ef3008b1
|
xe: rnn: avoid potential null pointer dereference
|
2025-10-15 09:08:56 -07:00 |
|
|
201e204682
|
xe: conv: make conversion explicit
|
2025-10-15 09:08:56 -07:00 |
|
|
3f66dc5049
|
third_party: ngen: avoid unnecessary copy
|
2025-10-15 09:08:56 -07:00 |
|
|
2b06b5c1b5
|
xe: jit: avoid unnecessary copies
|
2025-10-15 09:08:56 -07:00 |
|
|
4060edd33c
|
xe: gemm: assert host scalar scale is nonzero
|
2025-10-15 09:08:56 -07:00 |
|
|
f1302e4804
|
xe: ir: remove dead code
|
2025-10-15 09:08:56 -07:00 |
|
|
8df1253973
|
xe: lrn: remove dead code
|
2025-10-15 09:08:56 -07:00 |
|
|
27e186ccd9
|
gtest: check returned ptr on expected md init failure
|
2025-10-15 10:42:00 +02:00 |
|
|
de84f7f345
|
benchdnn: properly lock before checking allocations size
|
2025-10-15 10:42:00 +02:00 |
|
|
34663d344a
|
benchdnn: fix reference check in init_memory
|
2025-10-15 10:42:00 +02:00 |
|
|
67cd11b889
|
cpu: x64: matmul fix wei_k_blk query
|
2025-10-14 23:10:32 -04:00 |
|
|
5ab84bf3df
|
xe: ukernel: Add new ukernel entries for MoE support
|
2025-10-14 20:05:31 -04:00 |
|
|
5a2b8ccedb
|
scripts: update converter with quantization mode
|
2025-10-15 02:01:20 +02:00 |
|
|
eed2287b10
|
doc: document quantization mode and clean up quantization page
|
2025-10-15 02:01:20 +02:00 |
|
|
5331f9e068
|
gtest: cover mx input and output scaling in iface tests
|
2025-10-15 02:01:20 +02:00 |
|
|
76fd406dd4
|
benchdnn: matmul: add support for mx quantization testing
|
2025-10-15 02:01:20 +02:00 |
|
|
a3abba35b3
|
cpu: add mx quantization mode support to matmul primitive
|
2025-10-15 02:01:20 +02:00 |
|
|
1934858751
|
common: introduce quantization mode
|
2025-10-15 02:01:20 +02:00 |
|
|
98ae9938af
|
cpu: enable e8m0->f32 reorder
|
2025-10-15 02:01:20 +02:00 |
|
|
8f70dafafd
|
third_party: ngen: auto-SWSB: fix early A@ dependency clearing bug
|
2025-10-14 15:38:41 -07:00 |
|
|
d5e144c943
|
tests: benchdnn: enable mode p on windows
|
2025-10-14 14:28:23 -07:00 |
|
|
aed0c1a497
|
cpu: x64: fix index for comp_ow_kw_s/f (#4133)
|
2025-10-14 10:40:46 -04:00 |
|
|
edb3380221
|
CPU : x64: Matmul: fp4 weights support (#3928)
|
2025-10-14 13:58:14 +01:00 |
|
|
5f36333c57
|
cpu: x64: matmul: add a robust check for fallback compatibility
|
2025-10-13 19:21:31 -04:00 |
|
|
7aed66dc5e
|
cpu: x64: fix perf issue for f32 conv (#4107)
|
2025-10-13 17:47:57 -04:00 |
|
|
7b95adc9fe
|
benchdnn: ref_post_op: eltwise: fix NAN propagation
|
2025-10-13 12:32:33 -07:00 |
|
|
413d78d68f
|
governance: add Marek Michałowski to onednn-cpu-aarch64 (#4123)
Signed-off-by: Siddhartha Menon <siddhartha.menon@arm.com>
|
2025-10-13 17:35:59 +01:00 |
|
|
c476f32bae
|
cpu: aarch64: matmul: fix brgemm multidims tag detection (#4100)
Signed-off-by: Siddhartha Menon <siddhartha.menon@arm.com>
|
2025-10-13 17:06:35 +01:00 |
|
|
f9086bc474
|
xe: dsl: fixup trivial iteration
|
2025-10-13 09:05:23 -07:00 |
|
|
15006e1167
|
benchdnn: graph: avoid vector copy
|
2025-10-13 16:15:08 +08:00 |
|
|
356cf37f9f
|
graph: backend: dnnl: fix copy
|
2025-10-13 16:15:08 +08:00 |
|
|
4781eae311
|
graph: backend: dnnl: initialize member variable
|
2025-10-13 16:15:08 +08:00 |
|
|
07ac0a566a
|
graph: backend: dnnl: initialize is_invert_scale_
|
2025-10-13 16:15:08 +08:00 |
|
|
1eb43285fc
|
benchdnn: graph: flex_rewrite: remove deadcode after the macro SAFE
|
2025-10-13 16:15:08 +08:00 |
|
|
b18ed0017f
|
benchdnn: graph: displacer: return FAIL for more states
This should fix a nullptr dereference warning on the below line.
|
2025-10-13 16:15:08 +08:00 |
|
|
95d831e606
|
benchdnn: inputs: graph: replace multiply with add to pass acc check
|
2025-10-11 14:39:24 +08:00 |
|
|
105a7cd7fa
|
tests: prepare Eigen_threadpool implementation for new Eigen versions
|
2025-10-10 16:09:37 -07:00 |
|
|
08beeba4c9
|
cpu: x64: matmul: refactor pick_blocked_B_layout function
|
2025-10-10 15:17:49 -04:00 |
|