5910 Commits (57ed58cefec3ca6669afc156cc90ffb49dba6593)
 

Author SHA1 Message Date
  Xianyi Zhang 57ed58cefe Refs #2587 Add small matrix optimization reference kernel for c/zgemm. 5 years ago
  Xianyi Zhang 17d32a4a82 Change a1b0 gemm to b0 gemm. 5 years ago
  Xianyi Zhang 59cb5de46b Refs #2587 Fix typos. 5 years ago
  Xianyi Zhang 4271cfcc6f Fix gemm interface bug for small matrix. 5 years ago
  Xianyi Zhang be3349405d Add alpha=1.0 beta=0.0 for small gemm. 5 years ago
  Xianyi Zhang 0a2077901c Add small marix optimization kernel interface. 5 years ago
  Martin Kroeker e6d6d3ee43
Merge pull request #3331 from gxw-loongson/develop 4 years ago
  gxw 0b8f7c8c10 Add cmake support for LOONGARCH64 4 years ago
  Martin Kroeker e0e88f9edc
Merge pull request #3329 from martin-frbg/issue3272 4 years ago
  Martin Kroeker 5dc6aa74f0
Disable gfortran tree vectorizer to avoid gcc11+ miscompilation at O3 4 years ago
  Martin Kroeker e78fbe4654
Disable gfortran tree vectorizer to avoid gcc11+ miscompilation at O3 4 years ago
  Martin Kroeker b4f4ed378b
Disable gfortran tree vectorizer to avoid gcc11+ miscompilation at O3 4 years ago
  Martin Kroeker cbc41973fd
Disable gfortran tree vectorizer to avoid gcc11+ miscompilation at O3 4 years ago
  gxw 34207bdf5b Fixed typos about LOONGARCH64 4 years ago
  Martin Kroeker 1b6db3dbba
Merge pull request #3327 from h-vetinari/lapack597_redux 4 years ago
  Martin Kroeker f681553c6a
Merge pull request #3326 from wattoc/develop 4 years ago
  Martin Kroeker afadeeba2a
Merge pull request #3325 from gxw-loongson/develop 4 years ago
  Isuru Fernando 02d4a49761 Also make sure the `1` is INTEGER*4 for OMP_SET_NUM_THREADS 4 years ago
  Craig Watson 4d7dfe4845 Include Haiku in processor count checks 4 years ago
  gxw af0a69f355 Add support for LOONGARCH64 4 years ago
  Martin Kroeker 5a2fe5bfb9
Merge pull request #3323 from martin-frbg/issue3322 4 years ago
  Martin Kroeker 342d3e8b5c
Merge pull request #3314 from martin-frbg/lapack597 4 years ago
  Martin Kroeker efbd7c7840
GCC did not support -mtune for ARM64 before 5.1 4 years ago
  Martin Kroeker 3a7955cd93
Merge pull request #3320 from martin-frbg/issue3318 4 years ago
  Martin Kroeker 47ba85f314
Fix regex to match kernels suffixed with cpuname too 4 years ago
  Martin Kroeker 30f23be0f9
Rework setting of -mfma to only apply it where necessary 4 years ago
  Martin Kroeker 49bbf330ca
Empirical workaround for numpy SVD NaN problem from issue 3318 4 years ago
  Martin Kroeker 38d5b4b124
Update version to 0.3.17.dev 4 years ago
  Martin Kroeker 6e3fbe8ac5
Update version to 0.3.17.dev 4 years ago
  Martin Kroeker 86273392e5
Merge pull request #3317 from xianyi/release-0.3.0 4 years ago
  Martin Kroeker d909f9f3d4
Update version to 0.3.17 4 years ago
  Martin Kroeker 12d3d94e2e
Merge pull request #3316 from xianyi/develop 4 years ago
  Martin Kroeker f349be3bdb
Merge branch 'release-0.3.0' into develop 4 years ago
  Martin Kroeker 4777eb678f
Update version to 0.3.17 4 years ago
  Martin Kroeker 415876d117
Merge pull request #3315 from martin-frbg/changelog0317 4 years ago
  Martin Kroeker da8435dc36
Update Changelog for 0.3.17 4 years ago
  Martin Kroeker 4c7065f3ee
Merge pull request #3313 from martin-frbg/3266-2 4 years ago
  Martin Kroeker f62bfaafe8
Merge pull request #3312 from martin-frbg/revert_3260 4 years ago
  Martin Kroeker d947116390
Merge pull request #3311 from martin-frbg/issue3309 4 years ago
  Martin Kroeker f176ff90af
Declare N_THREADS as *4 for compatibility of INTERFACE64 builds with LLVM libomp 4 years ago
  Martin Kroeker f4d4abd423
Declare N_THREADS as *4 for compatibility of INTERFACE64 builds with LLVM libomp 4 years ago
  Martin Kroeker 2b9443b7e7
Declare N_THREADS as *4 for compatibility of INTERFACE64 builds with LLVM libomp 4 years ago
  Martin Kroeker fe0e66564e
Declare N_THREADS as *4 for compatibility of INTERFACE64 builds with LLVM libomp 4 years ago
  Martin Kroeker a6351e32f0
Remove BLASLONG casts from SPARC entries 4 years ago
  Martin Kroeker 5b4b385ecf
Temporarily disable the SkylakeX sgemv_t microkernel due to LAPACK testsuite failures 4 years ago
  Martin Kroeker 1dea57ab25
Revert PR #3250 (shortcut without buffer allocation) as it is unsafe on some x86_64 4 years ago
  Martin Kroeker 54ffe280df
Merge pull request #3310 from jeromerobert/develop 4 years ago
  Jerome Robert 029d1e16b9 Avoid redefinition of _GNU_SOURCE 4 years ago
  Martin Kroeker ea8e208029
Merge pull request #3306 from jonaszhou1/develop 4 years ago
  JonasZhou 0fca36c8c3 Add cpu detection support for Zhaoxin processors 4 years ago