oneDNN

mirror of https://github.com/uxlfoundation/oneDNN.git synced 2025-10-20 18:43:49 +08:00

Author	SHA1	Message	Date
Simonov, Alexander	85bfd3abb0	cpu: rnn: implement merged iter calculation using matmul	2025-10-17 05:32:45 -07:00
Simonov, Alexander	24379b3bf2	cpu: rnn: use merged layer execution for all sizes with matmul	2025-10-17 05:32:45 -07:00
Simonov, Alexander	24c39c357b	cpu: rnn: fix int overflows	2025-10-17 05:32:45 -07:00
Simonov, Alexander	b115e85606	cpu: rnn: use matmul wrapper for fwd prop	2025-10-17 05:32:45 -07:00
Simonov, Alexander	cda1a3ed3e	cpu: rnn: precreate matmul ctx	2025-10-17 05:32:45 -07:00
Simonov, Alexander	6397e09a99	cpu: rnn: use matmul for bwd prop	2025-10-17 05:32:45 -07:00
Simonov, Alexander	1c100e7eaa	cpu: rnn: fix memory descriptors of matmul arguments Arguments src and weights of Matmul primitive had wrong memory descriptors. This was not detected e.g. by tests as these runtime descriptors were not used (correct memory descriptors were used during Matmul primitive descriptor creation). Local variables were renamed because parameters a_, b_, c_ can correspond to arbitrary matrices, and binding them to RNN's src and weights was misleading.	2025-10-17 05:32:44 -07:00
Simonov, Alexander	9f5a86a440	cpu: rnn: implement lstm projection using matmul	2025-10-17 05:32:44 -07:00
Simonov, Alexander	e13d8d10f8	cpu: rnn: implement merged gemm layer step using matmul One layer case is supported.	2025-10-17 05:32:44 -07:00
Simonov, Alexander	53be46a45a	cpu: rnn: use merge_gemm_layer with matmul	2025-10-17 05:32:44 -07:00
Simonov, Alexander	53bf0a482b	cpu: rnn: use matmul for FWD_D for avx512 and fp32	2025-10-17 05:32:44 -07:00
dependabot[bot]	d2b59266e1	github: workflows: bump KyleMayes/install-llvm-action from 2.0.7 to 2.0.8 (#4121 ) Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2025-10-17 11:50:39 +01:00
dependabot[bot]	5365a85d64	github: workflows: bump github/codeql-action from 3.30.6 to 4.30.8 Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.30.6 to 4.30.8. - [Release notes](https://github.com/github/codeql-action/releases) - [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md) - [Commits](`64d10c1313...f443b600d9`) --- updated-dependencies: - dependency-name: github/codeql-action dependency-version: 4.30.8 dependency-type: direct:production update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <support@github.com>	2025-10-16 15:40:30 -07:00
Guskov, Andrey Y	7b351ae955	src: gpu: intel: gemm: add strided batch support to group sums	2025-10-16 14:30:24 -07:00
Maria Zhukova	dc1d8c3d55	doc: update supported attributes section for matmul	2025-10-16 13:38:24 -07:00
Maria Zhukova	52077ed87c	api, doc: fix documentation warn wrt parameter name	2025-10-16 13:38:24 -07:00
Maria Zhukova	c16cf34b28	doc: fix a few misspellings	2025-10-16 13:38:24 -07:00
Zhukova, Maria	cff4bb8aaa	doc: add section for quantization-related APIs and refactor	2025-10-16 13:38:24 -07:00
Chereshnev, Eugene	954f7f76f2	xe: jit: fix shr usage	2025-10-16 12:47:19 -07:00
Andrei Hutu	3a756b982b	cpu: aarch64: skip f16 to f32 upcast for clip and clip_v2 eltwise (#4101 )	2025-10-16 15:06:10 +01:00
Ryo Suzuki	e2aa79e849	ci: aarch64: make comparison script friendlier for local testing (#4157 )	2025-10-16 13:12:46 +01:00
David Svantesson	34caf5740b	cpu: aarch64: jit_reorder: fix dispatch issue for direct copy (#4150 )	2025-10-16 08:57:31 +01:00
Guskov, Andrey Y	bf6617dd1f	tests: benchdnn: utils: improve --buffer-prefix messaging	2025-10-15 18:09:06 -07:00
Umar Arshad	b5773a0826	xe: sdpa: workaround for xe3 regressions for PTL	2025-10-15 17:55:25 -04:00
Sergey Kazakov	34c81a1ca8	xe: gemm: convert HHS kernel to FHS to fix perf regression	2025-10-15 13:53:19 -07:00
Peter Caday	7b3f2af13c	xe: jit: gemm: fix stream-k slab count calculation	2025-10-15 10:05:31 -07:00
Kassen, Andrew	a4874e48ea	tests: sdpa: fix fallthrough	2025-10-15 09:08:56 -07:00
Kassen, Andrew	21adf4d977	tests: sdpa: fix typo	2025-10-15 09:08:56 -07:00
Kassen, Andrew	ff30885a40	xe: sdpa: add check for host scalar fetch	2025-10-15 09:08:56 -07:00
Kassen, Andrew	f8ef3008b1	xe: rnn: avoid potential null pointer dereference	2025-10-15 09:08:56 -07:00
Kassen, Andrew	201e204682	xe: conv: make conversion explicit	2025-10-15 09:08:56 -07:00
Kassen, Andrew	3f66dc5049	third_party: ngen: avoid unnecessary copy	2025-10-15 09:08:56 -07:00
Kassen, Andrew	2b06b5c1b5	xe: jit: avoid unnecessary copies	2025-10-15 09:08:56 -07:00
Kassen, Andrew	4060edd33c	xe: gemm: assert host scalar scale is nonzero	2025-10-15 09:08:56 -07:00
Kassen, Andrew	f1302e4804	xe: ir: remove dead code	2025-10-15 09:08:56 -07:00
Kassen, Andrew	8df1253973	xe: lrn: remove dead code	2025-10-15 09:08:56 -07:00
Mourad Gouicem	27e186ccd9	gtest: check returned ptr on expected md init failure	2025-10-15 10:42:00 +02:00
Mourad Gouicem	de84f7f345	benchdnn: properly lock before checking allocations size	2025-10-15 10:42:00 +02:00
Mourad Gouicem	34663d344a	benchdnn: fix reference check in init_memory	2025-10-15 10:42:00 +02:00
Denis Samoilov	67cd11b889	cpu: x64: matmul fix wei_k_blk query	2025-10-14 23:10:32 -04:00
Umar Arshad	5ab84bf3df	xe: ukernel: Add new ukernel entries for MoE support	2025-10-14 20:05:31 -04:00
Mourad Gouicem	5a2b8ccedb	scripts: update converter with quantization mode	2025-10-15 02:01:20 +02:00
Mourad Gouicem	eed2287b10	doc: document quantization mode and clean up quantization page	2025-10-15 02:01:20 +02:00
Mourad Gouicem	5331f9e068	gtest: cover mx input and output scaling in iface tests	2025-10-15 02:01:20 +02:00
Mourad Gouicem	76fd406dd4	benchdnn: matmul: add support for mx quantization testing	2025-10-15 02:01:20 +02:00
Mourad Gouicem	a3abba35b3	cpu: add mx quantization mode support to matmul primitive	2025-10-15 02:01:20 +02:00
Mourad Gouicem	1934858751	common: introduce quantization mode	2025-10-15 02:01:20 +02:00
Mourad Gouicem	98ae9938af	cpu: enable e8m0->f32 reorder	2025-10-15 02:01:20 +02:00
Peter Caday	8f70dafafd	third_party: ngen: auto-SWSB: fix early A@ dependency clearing bug	2025-10-14 15:38:41 -07:00
Kealan Barbieri	d5e144c943	tests: benchdnn: enable mode p on windows	2025-10-14 14:28:23 -07:00

1 2 3 4 5 ...

20979 Commits