Script 'mail_helper' called by obssrc Hello community, here is the log from the commit of package clpeak for openSUSE:Factory checked in at 2026-06-17 16:20:29 ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Comparing /work/SRC/openSUSE:Factory/clpeak (Old) and /work/SRC/openSUSE:Factory/.clpeak.new.1981 (New) ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Package is "clpeak" Wed Jun 17 16:20:29 2026 rev:10 rq:1359846 version:2.0.10 Changes: -------- --- /work/SRC/openSUSE:Factory/clpeak/clpeak.changes 2026-05-23 23:24:24.053421601 +0200 +++ /work/SRC/openSUSE:Factory/.clpeak.new.1981/clpeak.changes 2026-06-17 16:21:06.922081586 +0200 @@ -1,0 +2,48 @@ +Thu Jun 04 10:42:58 UTC 2026 - Martin Hauke <[email protected]> + +- Update to version 2.0.10: + * Add ROCm / HIP backend + Full AMD GPU backend using the HIP runtime. Includes the. + complete benchmark suite as it becomes available on each GPU. + family: + + Compute throughput — FP64/FP32/FP16/BF16/INT32 vector kernels + with vector-width ILP variants. + + INT8 DP4a — sudot4 instruction on CDNA3 / RDNA3-4. + + Raw MFMA matrix-core peak — FP16, BF16, INT8, FP8 (CDNA + architectures). + + 2:4 structured-sparse MFMA (smfmac) — FP16, BF16, FP8. + + FP4 matrix-core (MXFP4) — Instinct MI300+. + + Native WMMA peak — RDNA3/4, covering FP16, BF16, + FP8 E4M3/E5M2. + + rocBLAS — FP16, BF16, FP8 GEMM. + + hipBLASLt — FP8 GEMM. + + Memory bandwidth — global, local, host-device transfer. + * Add oneAPI / SYCL backend + Intel GPU backend compiled with icpx -fsycl. Requires the + Intel oneAPI Base Toolkit. + + Compute throughput — FP64/FP32/FP16/BF16/INT32/INT8 vector + kernels with vector-width sweeps. + + joint_matrix peak — device-queried shapes, VNNI B layout, + TF32; per-device capability diagnostics. + + oneMKL GEMM — FP16, BF16, FP32 (where supported). + + Memory bandwidth. + + Falls back to CPU/accelerator device when no GPU is visible. + * New Benchmarks (CUDA) + + NVFP4 matrix-core — mma FP4 benchmarks for Ada/Hopper. + + FP4 2:4 sparse mma.sp — Blackwell (SM 12.0+), with corrected + .scale_vec::4X selectors. + * CLI Changes + + --verbose - Print backend debug logs: kernel build logs, + API errors, and library exceptions that are suppressed by + default. + * Device selection: now accepts comma-separated index lists. + * Removed flags + + --cl-platform-name <str> + OpenCL platform name matching removed; use --cl-platform + <index>. + + --cl-device-name <str> + OpenCL device name matching removed; use --cl-device <index> + + --int4-packed-compute / --no-int4-packed-compute. + Emulated INT4 benchmark removed. + +------------------------------------------------------------------- Old: ---- clpeak-2.0.9.tar.xz New: ---- clpeak-2.0.10.tar.xz ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Other differences: ------------------ ++++++ clpeak.spec ++++++ --- /var/tmp/diff_new_pack.NYxBts/_old 2026-06-17 16:21:07.882121740 +0200 +++ /var/tmp/diff_new_pack.NYxBts/_new 2026-06-17 16:21:07.890122075 +0200 @@ -18,7 +18,7 @@ Name: clpeak -Version: 2.0.9 +Version: 2.0.10 Release: 0 Summary: Find peak OpenCL capacities like bandwidth & compute License: LicenseRef-SUSE-Public-Domain ++++++ _service ++++++ --- /var/tmp/diff_new_pack.NYxBts/_old 2026-06-17 16:21:07.926123581 +0200 +++ /var/tmp/diff_new_pack.NYxBts/_new 2026-06-17 16:21:07.934123915 +0200 @@ -1,7 +1,7 @@ <services> <service name="tar_scm" mode="manual"> <param name="url">https://github.com/krrishnarraj/clpeak.git</param> - <param name="revision">2.0.9</param> + <param name="revision">2.0.10</param> <param name="scm">git</param> <param name="changesgenerate">enable</param> <param name="filename">clpeak</param> ++++++ _servicedata ++++++ --- /var/tmp/diff_new_pack.NYxBts/_old 2026-06-17 16:21:07.958124919 +0200 +++ /var/tmp/diff_new_pack.NYxBts/_new 2026-06-17 16:21:07.962125086 +0200 @@ -1,6 +1,6 @@ <servicedata> <service name="tar_scm"> <param name="url">https://github.com/krrishnarraj/clpeak.git</param> - <param name="changesrevision">d6612eff43dabba92b3ba729cfb8a0f594c99693</param></service></servicedata> + <param name="changesrevision">627ff1753649e8aba5e03d98f3b20a6d17a5edc5</param></service></servicedata> (No newline at EOF) ++++++ clpeak-2.0.9.tar.xz -> clpeak-2.0.10.tar.xz ++++++ ++++ 655445 lines of diff (skipped)
