Script 'mail_helper' called by obssrc Hello community, here is the log from the commit of package openblas for openSUSE:Factory checked in at 2024-05-14 13:37:27 ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Comparing /work/SRC/openSUSE:Factory/openblas (Old) and /work/SRC/openSUSE:Factory/.openblas.new.1880 (New) ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Package is "openblas" Tue May 14 13:37:27 2024 rev:61 rq:1173654 version:0.3.26 Changes: -------- --- /work/SRC/openSUSE:Factory/openblas/openblas.changes 2024-03-03 20:18:43.668542163 +0100 +++ /work/SRC/openSUSE:Factory/.openblas.new.1880/openblas.changes 2024-05-14 13:38:14.758333630 +0200 @@ -12,7 +11,0 @@ - - Improved the version of openblas.pc that is created by the - CMAKE build. - - Fixed a CMAKE-specific build problem on older versions of - MacOS. - - Worked around linking problems on old versions of MacOS. - - Corrected installation location of the lapacke_mangling - header in CMAKE builds. @@ -21 +14 @@ - - Significantly sped up ?GESV for small problem sizes by + - Significantly sped up `?GESV` for small problem sizes by @@ -25 +18 @@ - + Added new LAPACK functions for truncated QR with pivoting + + Added new LAPACK functions for truncated `QR` with pivoting @@ -29 +22 @@ - + Fixed use of uninitialized variables in ?GEDMD and + + Fixed use of uninitialized variables in `?GEDMD` and @@ -32 +25 @@ - failures) in ?BBCSD. + failures) in `?BBCSD`. @@ -36 +28,0 @@ - + Fixed counting of testsuite results by the Python script. @@ -38 +30 @@ - - Fixed computation of CASUM on SkylakeX and newer targets in + - Fixed computation of `CASUM` on SkylakeX and newer targets in @@ -41 +33 @@ - - Fixed potential undefined behaviour in the CASUM/ZASUM + - Fixed potential undefined behaviour in the `CASUM`/`ZASUM` @@ -43 +35 @@ - - worked around a problem in the pre-AVX kernels for GEMV + - worked around a problem in the pre-AVX kernels for `GEMV` @@ -45,3 +37,3 @@ - - Sped up SGEMM and DGEMM on Neoverse V1 and N1. - - Sped up ?DOT on SVE-capable targets. - - Reduced the number of targets in DYNAMIC_ARCH builds by + - Sped up `SGEMM` and `DGEMM` on Neoverse V1 and N1. + - Sped up `?DOT` on SVE-capable targets. + - Reduced the number of targets in `DYNAMIC_ARCH` builds by @@ -52,2 +43,0 @@ - - Fixed detection of old 32bit PPC targets in CMAKE-based - builds. @@ -55,4 +44,0 @@ - - Fixed CMAKE-based compilation for PPCG4 and PPC970 targets. - * LONGARCH64: - - Added and improved optimized kernels for almost all BLAS - functions. @@ -75,2 +61,2 @@ - - fixed a potential division by zero in ?ROTG - - improved the ?MATCOPY functions to accept zero-sized rows or + - fixed a potential division by zero in `?ROTG` + - improved the `?MATCOPY` functions to accept zero-sized rows or @@ -81 +66,0 @@ - - fixed compilation with the Cray CCE Compiler suite @@ -84,4 +68,0 @@ - - worked around OPENMP builds with LLVM14's libomp hanging on - FreeBSD - - improved the Makefiles to require less option duplication on - "make install" @@ -93,2 +73,0 @@ - - fixed compile-time autodetection of AMD Ryzen3 and Ryzen4 - cpus @@ -96,2 +75,2 @@ - in DYNAMIC_ARCH - - added AVX512 optimizations for ?ASUM on Sapphire Rapids and + in `DYNAMIC_ARCH` + - added AVX512 optimizations for `?ASUM` on Intel Sapphire Rapids and @@ -100 +78,0 @@ - - fixed building on Apple with homebrew gcc @@ -103 +81,106 @@ - - increased the default buffer size for recent ARM server cpus + - increased the default buffer size for recent arm server cpus + * POWER: + - added support for `DYNAMIC_ARCH` builds with clang + - fixed union declaration in the `BFLOAT16` test case +- Changes in version 0.3.24 + * General: + - Declared the arguments of `cblas_xerbla` as `const` + (in accordance with the reference implementation + and others, the previous discrepancy appears to have dated + back to GotoBLAS) + - fixed the implementation of `?GEMMT` that was added in 0.3.23 + - made cpu-specific `SWITCH_RATIO` parameters for GEMM + available to `DYNAMIC_ARCH` builds + - fixed missing `SSYCONVF` function in the shared library + - fixed parallel build logic used with gmake + - fixed several issues with the handling of runtime limits on + the number of OPENMP threads + - corrected the error code returned by `SGEADD`/`DGEADD` when + LDA is too small + - corrected the error code returned by `IMATCOPY` when LDB + is too small + - updated `?NRM2` to support negative increment values (as + introduced in release 3.10.0 of the Reference BLAS) + - updated `?ROTG` to use the safe scaling algorithm introduced + in release 3.10.0 of the Reference BLAS + - fixed OpenMP builds with CLANG for the case where libomp is + not in a standard location + - fixed a potential overwrite of unrelated memory during + thread initialisation on startup + - fixed a potential integer overflow in the multithreading + threshold for `?SYMM`/`?SYRK` + - fixed build of the LAPACKE interfaces for the LAPACK 3.11.0 + `?TRSYL` functions added in 0.3.22 + - applied additions and corrections from the development + branch of Reference-LAPACK: + - fixed actual arguments passed to a number of LAPACK + functions (from Reference-LAPACK PR 885) + - fixed workspace query results in LAPACK `?SYTRF`/`?TRECV3` + (from Reference-LAPACK PR 883) + - fixed derivation of the UPLO parameter in `LAPACKE_?larfb` + (from Reference-LAPACK PR 878) + - fixed a crash in LAPACK `?GELSDD` on `NRHS=0` (from + Reference-LAPACK PR 876) + - added new LAPACK utility functions `CRSCL` and `ZRSCL` + (from Reference-LAPACK PR 839) + - corrected the order of eigenvalues for 2x2 matrices in + `?STEMR` (Reference-LAPACK PR 867) + - removed spurious reference to OpenMP variables outside + OpenMP contexts (Reference-LAPACK PR 860) + - updated file comments on use of `LAMBDA` variable in + LAPACK (Reference-LAPACK PR 852) + - fixed documentation of LAPACK `SLASD0`/`DLASD0` + (Reference-LAPACK PR 855) + - fixed confusing use of "minor" in LAPACK documentation + (Reference-LAPACK PR 849) + - added new LAPACK functions ?GEDMD for dynamic mode + decomposition (Reference-LAPACK PR 736) + - fixed potential stack overflows in the `EIG` part of the + LAPACK testsuite (Reference-LAPACK PR 854) + - applied small improvements to the variants of + Cholesky and QR functions (Reference-LAPACK PR 847) + - removed unused variables from LAPACK `?BDSQR` + (Reference-LAPACK PR 832) + - fixed a potential crash on allocation failure in LAPACKE + `SGEESX`/`DGEESX` (Reference-LAPACK PR 836) + - added a quick return from `SLARUV`/`DLARUV` for N < 1 + (Reference-LAPACK PR 837) + - updated function descriptions in LAPACK `?GEGS`/`?GEGV` + (Reference-LAPACK PR 831) + - improved algorithm description in `?GELSY` + (Reference-LAPACK PR 833) + - fixed scaling in LAPACK `STGSNA`/`DTGSNA` + (Reference-LAPACK PR 830) + - fixed crash in `LAPACKE_?geqrt` with row-major data + (Reference-LAPACK PR 768) + - added LAPACKE interfaces for `C/ZUNHR_COL` and + `S/DORHR_COL` (Reference-LAPACK PR 827) + - added error exit tests for `SYSV`/`SYTD2`/`GEHD2` to + the testsuite (Reference-LAPACK PR 795) + - fixed typos in LAPACK source and comments + (Reference-LAPACK PRs 809,811,812,814,820) + - adopt refactored `?GEBAL` implementation + (Reference-LAPACK PR 808) + * x86_64: + - added cpu model autodetection for Intel Alder Lake N + - added activation of the AMX tile to the Sapphire Rapids + `SBGEMM` kernel + - worked around miscompilations of GEMV/SYMV kernels by + gcc's tree-vectorizer + - fixed runtime detection of Cooperlake and Sapphire Rapids + in `DYNAMIC_ARCH` + - fixed feature-based cputype fallback in `DYNAMIC_ARCH` + - corrected `ZAXPY` result on old pre-AVX hardware for the + `INCX=0` case + - fixed a potential use of uninitialized variables in ZTRSM + * ARMV8: + - implemented SWITCH_RATIO parameter for improved GEMM + performance on Neoverse + - activated SVE SGEMM and DGEMM kernels for Neoverse V1 + - improved performance of the SVE CGEMM and ZGEMM kernels + on Neoverse V1 + - improved kernel selection for the ARMV8SVE target and added + it to `DYNAMIC_ARCH` + - fixed runtime check for SVE availability in `DYNAMIC_ARCH` + builds to take OS or container restrictions into account + - fixed a potential use of uninitialized variables in ZTRSM @@ -105,8 +188,55 @@ - - fixed building with the IBM xlf 16.1.1 compiler - - fixed building with IBM XL C - - added support for DYNAMIC_ARCH builds with clang - - fixed union declaration in the BFLOAT16 test case - - enable optimizations for the AIX assembler on POWER10 - * LOONGARCH64: - - added an optimized SGEMV kernel - - added an optimized DTRSM kernel + - fixed compiler warnings in the POWER10 SBGEMM kernel +- Changes in version 0.3.23 + * General: + - fixed a serious regression in `GETRF`/`GETF2` and + `ZGETRF`/`ZGETF2` where subnormal but nonzero data elements + triggered the singularity flag + - fixed a long-standing bug in `CSPR`/`ZSPR` in single-threaded + operation + - for cases where elements of the X vector are real numbers (or + complex with only the real part zero) + * x86_64: + - added further CPUID values for Intel Raptor Lake +- Changes in version 0.3.22 + * General: + - Updated the included LAPACK to Reference-LAPACK release 3.11.0 + plus post-release corrections and improvements + - Added a threshold for multithreading in `SYMM`, `SYMV` and + `SYR2K` + - Increased the threshold for multithreading in `SYRK` + - OpenBLAS no longer decreases the global `OMP_NUM_THREADS` + when it exceeds the maximum thread count the library was + compiled for. + - fixed `?GETF2` potentially returning `NaN` with tiny matrix + elements + - fixed `openblas_set_num_threads` to work in `USE_OPENMP` + builds. + - fixed cpu core counting in `USE_OPENMP` builds returning the + number of OMP "places" rather than cores + - fixed stride calculation in the optimized small-matrix path of + complex `SYR` + - fixed building of Reference-LAPACK with recent gfortran + - added new environment variable `OPENBLAS_DEFAULT_NUM_THREADS` + - added a GEMV-based implementation of `GEMMT` + * x86_64: + - added autodetection of Intel Raptor Lake cpu models + - added SSCAL microkernels for Haswell and newer targets + - improved the performance of the Haswell DSCAL microkernel + - added CSCAL and ZSCAL microkernels for SkylakeX targets + - fixed detection of gfortran and Cray CCE compilers + - fixed runtime selection of COOPERLAKE in `DYNAMIC_ARCH` builds + - worked around gcc/llvm using risky FMA operations in + CSCAL/ZSCAL + * ARMV8: + - fixed cross-compilation to CortexA53 with CMAKE + - fixed compilation with CMAKE and "Arm Compiler for Linux 22.1" + - added cpu autodetection for Cortex X3 and A715 + - fixed conditional compilation of SVE-capable targets in + `DYNAMIC_ARCH` + - sped up SVE kernels by removing unnecessary prefetches + - improved the GEMM performance of Neoverse V1 + - added SVE kernels for SDOT and DDOT + - added an SBGEMM kernel for Neoverse N2 + - improved cpu-specific compiler option selection for + Neoverse cpus + - added support for setting `CONSISTENT_FPCSR` ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Other differences: ------------------ ++++++ openblas.spec ++++++ --- /var/tmp/diff_new_pack.3KGsFz/_old 2024-05-14 13:38:16.234387455 +0200 +++ /var/tmp/diff_new_pack.3KGsFz/_new 2024-05-14 13:38:16.234387455 +0200 @@ -434,7 +434,7 @@ %{?dynamic_list} \ %{!?with_hpc:%{?libnamesuffix} FC=gfortran CC=gcc%{?cc_v:-%{cc_v}} %{?cc_v:CEXTRALIB=""}} \ %{?ldflags_tests:LDFLAGS_TESTS=%{ldflags_tests}} \ - %{?with_hpc:%{?cc_v:CC=gcc-%{cc_v} CEXTRALIB=""}} + %{?with_hpc:%{?cc_v:CC=gcc-%{cc_v} FC=gfortran-%{cc_v} CEXTRALIB=""}} %install %if %{with hpc}