2023 Commits (2d0b2334259d41c2003b51a07580dbd25cfe267c)

Author SHA1 Message Date
  Chris Sidebottom eea006a688 Wrap SVE header with __has_include check 2 years ago
  Chris Sidebottom fd4f52c797 Add SVE implementation for sdot/ddot 2 years ago
  lilianhuang fdac8a97c1 Add sbgemm_ncopy_8 and sbgemm_tcopy_4 2 years ago
  lilianhuang 135718eafc Improve the performance of sbgemm_tcopy on neoversen2 2 years ago
  Chris Sidebottom 4f7b77e08a Remove unnecessary instructions from Advanced SIMD dot 2 years ago
  Martin Kroeker f73cfb7e2c
change line endings from CRLF to LF 2 years ago
  Martin Kroeker 1688c7da43
change line endings from CRLF to LF 2 years ago
  Bart Oldeman 6c1043eb41 Add [cz]scal microkernels for SKYLAKEX 2 years ago
  Martin Kroeker c9d78dc3b2
Remove excess initializer (leftover from rework of PR 3793) 2 years ago
  Martin Kroeker 65338a9493
Merge pull request #3799 from bartoldeman/cscal-zscal-no-fma 2 years ago
  Honglin Zhu 79066b6bf3 Change file name to match the norm and delete useless code. 2 years ago
  Bart Oldeman e7e3aa2948 x86_64: prevent GCC and Clang from generating FMAs in cscal/zscal. 2 years ago
  Honglin Zhu 4989e039a5 Define SBGEMM_ALIGN_K for DYNAMIC_ARCH build 2 years ago
  Honglin Zhu 843e9fd0b9 Fix typo error 2 years ago
  Honglin Zhu b00d5b9746 New sbgemm implementation for Neoverse N2 3 years ago
  Martin Kroeker f6f35a4288
fix copyobj declarations to work with DYNAMIC_ARCH 3 years ago
  Martin Kroeker b1d69fb3ac
Add MIPS64_GENERIC as a copy of GENERIC 3 years ago
  gxw edea1bcfaf MIPS64: Fixed failed utest dsdot:dsdot_n_1 when TARGET=I6500 3 years ago
  Martin Kroeker 101a2c77c3
Fix warnings 3 years ago
  Martin Kroeker 23d59baaf1
Add -mfma to -mavx2 for Apple clang, and set AVX2 options for Zen as well 3 years ago
  gxw 365936ae1b MIPS64: Using the macro MTC rather than MTC1 3 years ago
  Martin Kroeker 739c3c44a7
Work around windows/osx gcc12 x86_64 tree-optimizer problem and add an osx/gcc12 build to Azure CI (#3745) 3 years ago
  Martin Kroeker bd30120ba7
Merge pull request #3720 from FlyGoat/mips64 3 years ago
  Jiaxun Yang a50b29c540 Provide a fallback MIPS64_GENERIC target 3 years ago
  Jiaxun Yang 50c4eeb97d alpha: Remove include of version.h 3 years ago
  Ivan Pribec 802e71bf05 Add const attribute to lsame 3 years ago
  gxw fbfe1daf6e LoongArch64: Add DYNAMIC_ARCH support 3 years ago
  Martin Kroeker cd8e57040c
Merge pull request #3691 from martin-frbg/issue3679-sparc 3 years ago
  Martin Kroeker 6c118b7977
Fix DNRM2 returning INF instead of zero due to intermediate overflow 3 years ago
  Martin Kroeker c43ec53bdd
Merge pull request #3690 from RajalakshmiSR/cdotp10 3 years ago
  Martin Kroeker b7c65d08cb
Merge pull request #3689 from RajalakshmiSR/dgemvgcc10 3 years ago
  Martin Kroeker 06ef015234
fix DNRM2 returning INF instead of zero due to intermediate overflow 3 years ago
  Rajalakshmi Srinivasaraghavan a612e78a97 POWER: Fix complex dot function failures 3 years ago
  Rajalakshmi Srinivasaraghavan 432fd99445 POWER10: dgemv builtin rename 3 years ago
  gxw 4dd05e526b LoongArch64: Fix dnrm2_tiny testcase failure 3 years ago
  gxw cce4b1d956 MIPS64: Fix dnrm2_tiny testcase failure 3 years ago
  Martin Kroeker e12d474780
Eliminate uses of CREAL on left-hand side of assignments 3 years ago
  Martin Kroeker 9e29598575
workaround fault with ssq=inf,scale=0 3 years ago
  Honglin Zhu 123e0dfb62 Neoverse N2 sbgemm: 3 years ago
  Honglin Zhu bc3728475f format code 3 years ago
  Honglin Zhu 55d686d41e neoverse n2 sbgemm: 3 years ago
  Honglin Zhu 04593bb27c neoverse n2 sbgemm: init file 3 years ago
  Martin Kroeker be5500e704
Merge pull request #3669 from VFerrari/fix_small_matrix_kernel 3 years ago
  Martin Kroeker 92275a7902
Merge pull request #3642 from nursik/develop 3 years ago
  VFerrari cac634fce3
POWER10: Fix multithreading check when USE_THREAD=0 3 years ago
  Martin Kroeker 9283c7c0b5
Merge pull request #3655 from RajalakshmiSR/zgemmasmp10 3 years ago
  Rajalakshmi Srinivasaraghavan f191bc652b POWER10: Fix ZGEMM testcase failures 3 years ago
  Rajalakshmi Srinivasaraghavan 8419d538ff POWER10: convert dgemv inline assembly 3 years ago
  Xianyi Zhang 5e9a912591 Merge branch 'develop' into risc-v 3 years ago
  Xianyi Zhang 968e1f51d8 Update RISC-V Intrinsic API. 3 years ago