1681 Commits (57ed58cefec3ca6669afc156cc90ffb49dba6593)

Author SHA1 Message Date
  gxw 4b548857d6 Add msa support for loongson 4 years ago
  Martin Kroeker 7f11e33e8d
Merge pull request #3025 from TiredNotTear/develop 4 years ago
  Martin Kroeker 53e0837809
Merge pull request #3022 from jinboson/develop 4 years ago
  Hao Chen ad38bd0e89 Fix failed cgemv and zgemv test case after using msa optimization 4 years ago
  Hao Chen 47b639cc9b Fix failed sswap and dswap case by using msa optimization 4 years ago
  Martin Kroeker b660008c7e
Work around DOT and SWAP test failures 4 years ago
  Martin Kroeker f8346603cf
Fix compilation with SolarisStudio 4 years ago
  Jin Bo 65de6f5957 Fix test errors reported by cblas_cgemm & cblas_ctrmm 4 years ago
  Gordon Fossum 213c0e7abb Added special unrolled vectorized versions of "Solve" for specific sizes, 4 years ago
  Martin Kroeker 441c08c9ff
Merge pull request #3016 from xiegengxin/complex-asum 4 years ago
  Gengxin Xie 0cb7a403b2 fix error declare function blas_level1_thread_with_return_value 4 years ago
  Gengxin Xie b766c1e9bb Improve the performance of zasum and casum with AVX512 intrinsic 4 years ago
  Rajalakshmi Srinivasaraghavan 7d46e31de1 POWER10: Optimize dgemv_n 4 years ago
  Martin Kroeker f1bf040b25
Merge pull request #2988 from xiegengxin/smp-asum 4 years ago
  Xianyi Zhang 7037849498 Merge branch 'develop' into risc-v 4 years ago
  Martin Kroeker 7e9cb39a25
Merge pull request #2981 from Qiyu8/fix-sum 4 years ago
  Gengxin Xie d6e7e05bb3 Improve the performance of dasum and sasum when SMP is defined 4 years ago
  Qiyu8 ae0b1dea19 modify system.cmake to enable fma flag 4 years ago
  Qiyu8 e0dac6b53b fix the CI failure of target specific option mismatch 4 years ago
  Qiyu8 e5c2ceb675 fix the CI failure of lack the head 4 years ago
  Qiyu8 a87e537b8c modify macro 4 years ago
  Qiyu8 5bc0a7583f only FMA3 and vector larger than 128 have positive effects. 4 years ago
  Qiyu8 8c0b206d4c Optimize the performance of rot by using universal intrinsics 4 years ago
  Qiyu8 c4c591ac5a fix sum optimize issues 4 years ago
  Xianyi Zhang fc35b72ae1 Refs #2899 4 years ago
  Xianyi Zhang 913cc9a4ca Merge branch 'develop' into risc-v 4 years ago
  Martin Kroeker ff16329cb7
Merge pull request #2972 from xiegengxin/rot-intrinsic 4 years ago
  Martin Kroeker 110c7a6de0
Merge pull request #2979 from RajalakshmiSR/dot_power10 4 years ago
  Rajalakshmi Srinivasaraghavan 6e364981a8 Optimize sdot/ddot for POWER10 4 years ago
  Martin Kroeker b976a0bf40
Remove previous workaround for compiler flags related to cpu capabilities in x86_64 DYNAMIC_ARCH builds 4 years ago
  Martin Kroeker ff74319ea5
Merge pull request #2977 from martin-frbg/issue2976 4 years ago
  Martin Kroeker 28d2dfe2b3
Fix macro name used in ifdef 4 years ago
  Gengxin Xie 725ffbf041 fix typo 4 years ago
  Gengxin Xie d9ba49165a Improve the performance of rot by using AVX512 and AVX2 intrinsic 5 years ago
  Rajalakshmi Srinivasaraghavan dd7a9cc5bf POWER10: Change dgemm unroll factors 4 years ago
  Rajalakshmi Srinivasaraghavan b435491885 Optimize caxpy for POWER10 4 years ago
  Chen, Guobing a7b1f9b1bb Implementation of BF16 based gemv 5 years ago
  Martin Kroeker 67f39ad813
Merge pull request #2939 from thrasibule/Makefile_cleanup 5 years ago
  Rajalakshmi Srinivasaraghavan c24ba8b1dd Optimize saxpy for POWER10 5 years ago
  Martin Kroeker 6f9460f0f6
Merge pull request #2937 from martin-frbg/pwr-buffersz 5 years ago
  Guillaume Horel 1917a4e7b8 reuse variables defined in Makefile.system 5 years ago
  Martin Kroeker 34c3c407ef
label always_inline function as inline to silence a gcc warning 5 years ago
  Martin Kroeker 2e48d560ba
Fix compiler version check 5 years ago
  Rajalakshmi Srinivasaraghavan ad745c0bae Optimize scopy/ccopy for POWER10 5 years ago
  İsmail Dönmez 4a1d00f589
Fix build with -Werror=return-type 5 years ago
  Bart Oldeman b073d759d0 x86_64: clobber all xmm registers after vzeroupper 5 years ago
  Martin Kroeker dc6e44c3f8
Merge pull request #2916 from martin-frbg/issue2911 5 years ago
  Martin Kroeker a61c086408
Fix spurious trailing whitespace in comment 5 years ago
  Bart Oldeman 03e781b766 sgemm_direct_skylakex: fix 75eeb26 regression. 5 years ago
  Martin Kroeker f1a4071d8c
Clean up STACKSIZE redefinition 5 years ago