244 Commits (2d0b2334259d41c2003b51a07580dbd25cfe267c)

Author SHA1 Message Date
  Martin Kroeker 8e6d93359d
Merge pull request #4196 from TiborGY/obsolete_inlines 2 years ago
  Ian McInerney 79c15db348 Fix power10 gcc intrinsic check 2 years ago
  TGY b5ba95a6c0 Modernize obsolete inline order 2 years ago
  Martin Kroeker 54d3246fc6
Allow negative INCX (API change from version 3.10 of the reference implementation) 2 years ago
  Manjul Mohan 58b88aa5f0 POWER10: Fix compiler warnings 2 years ago
  Martin Kroeker 1688c7da43
change line endings from CRLF to LF 2 years ago
  Martin Kroeker 6c118b7977
Fix DNRM2 returning INF instead of zero due to intermediate overflow 3 years ago
  Martin Kroeker c43ec53bdd
Merge pull request #3690 from RajalakshmiSR/cdotp10 3 years ago
  Rajalakshmi Srinivasaraghavan a612e78a97 POWER: Fix complex dot function failures 3 years ago
  Rajalakshmi Srinivasaraghavan 432fd99445 POWER10: dgemv builtin rename 3 years ago
  VFerrari cac634fce3
POWER10: Fix multithreading check when USE_THREAD=0 3 years ago
  Martin Kroeker 9283c7c0b5
Merge pull request #3655 from RajalakshmiSR/zgemmasmp10 3 years ago
  Rajalakshmi Srinivasaraghavan f191bc652b POWER10: Fix ZGEMM testcase failures 3 years ago
  Rajalakshmi Srinivasaraghavan 8419d538ff POWER10: convert dgemv inline assembly 3 years ago
  Rajalakshmi Srinivasaraghavan b62173c5a0 POWER10: Changing store instructions for Level1 functions 3 years ago
  Martin Kroeker 05dcfa176e
fix undefined prefetchsizes 3 years ago
  Martin Kroeker 2bbb9f05c7
fix undefined prefetchsize 3 years ago
  Rafael Cardoso Fernandes Sousa c78fdcc80d [POWER] Add support for SMALL_MATRIX_OPT 3 years ago
  kavanabhat 9cc95e5657 AIX changes for P10 with GNU Compiler 4 years ago
  kavanabhat fe3c778c51 AIX changes for P10 with GNU Compiler 4 years ago
  Rafael Cardoso Fernandes Sousa b751edf624 Fix unused variable warnings on Power 4 years ago
  Rajalakshmi Srinivasaraghavan b06880c2cd POWER10: Improving dasum performance 4 years ago
  Martin Kroeker c4b464cac6
Merge pull request #3273 from austinpagan/sbgemm_gcc10_fix 4 years ago
  Gordon Fossum e6dd44d989 Power10: Fix for SBGEMM 4 years ago
  Martin Kroeker 2e8ff4a781
Merge pull request #3266 from martin-frbg/powerparam 4 years ago
  Martin Kroeker efdbdd8f82
Add prefetch values for power3 4 years ago
  Martin Kroeker 3906ef3b0f
Add prefetch values for power3 4 years ago
  Martin Kroeker 8adf0971d8
Add prefetch values for power3 4 years ago
  Martin Kroeker 08e2e60762
Add prefetch values for power3 4 years ago
  Martin Kroeker fb9e678235
Fix caxpy/zaxpy for big-endian 4 years ago
  Martin Kroeker dc4fcb48df
Fix inverted conditional for caxpy/zaxpy 4 years ago
  Martin Kroeker 7a48247761
fix c/zrot and sgemv for POWER5 4 years ago
  Rajalakshmi Srinivasaraghavan cbb70438df POWER10: Fixes for sbgemm kernel 4 years ago
  Rajalakshmi Srinivasaraghavan 2379abaa5e POWER10: Improve dgemm performance 4 years ago
  Rajalakshmi Srinivasaraghavan 55bb9f639a POWER10: Optimized zgemv 4 years ago
  Rajalakshmi Srinivasaraghavan 2dbcddd83d POWER10: Adding check for little endian 4 years ago
  Martin Kroeker 86c5a0013f
Add workaround for LAPACK testsuite failures with the NVIDIA HPC compiler 4 years ago
  Martin Kroeker ef85c22474
Add workaround for LAPACK test failures with the NVIDIA HPC compiler 4 years ago
  Martin Kroeker d3555d2e50
Add workaround for LAPACK test failures with the NVIDIA HPC compiler 4 years ago
  Rajalakshmi Srinivasaraghavan 09d47af2c0 Optimize zscal function for POWER10 4 years ago
  Rajalakshmi Srinivasaraghavan 41646ed006 Optimize s/dasum function for POWER10 4 years ago
  Rajalakshmi Srinivasaraghavan 0571c3187b POWER10: Rename mma builtins 4 years ago
  Rajalakshmi Srinivasaraghavan 2056ffc227 Optimize cscal function for POWER10 4 years ago
  Rajalakshmi Srinivasaraghavan 3ede843d50 Optimize s/dscal function for POWER10 4 years ago
  Rajalakshmi Srinivasaraghavan 439b93f6d2 Optimize s/drot function for POWER10 4 years ago
  Rajalakshmi Srinivasaraghavan eff7c9166e Optimize cdot function for POWER10 4 years ago
  Rajalakshmi Srinivasaraghavan 601b711c78 Optimize swap function for POWER10 4 years ago
  Rajalakshmi Srinivasaraghavan 2fb11f873b POWER10: Improve copy performance 4 years ago
  Martin Kroeker 043128cbe5
Merge pull request #3029 from RajalakshmiSR/axpyp10 4 years ago
  Rajalakshmi Srinivasaraghavan 346e30a46a POWER10: Improve axpy performance 4 years ago