2023 Commits (cd8ac192a901b38980755583faaa35559df7910a)

Author SHA1 Message Date
  Nursultan Zarlyk 1bb7993a97 Fix MSVC ARM64 build. Add generic kernel for ARM64 3 years ago
  Martin Kroeker dc49edd4e6
Revert "roll back DGEMM kernel ... for DYNAMIC_ARCH" 3 years ago
  Rajalakshmi Srinivasaraghavan b62173c5a0 POWER10: Changing store instructions for Level1 functions 3 years ago
  Martin Kroeker 84cb58b7fb
Fix generator rules for ?laswp_ncopy and ?neg_tcopy 3 years ago
  Martin Kroeker 05dcfa176e
fix undefined prefetchsizes 3 years ago
  Martin Kroeker 2bbb9f05c7
fix undefined prefetchsize 3 years ago
  Martin Kroeker 115bc9b98f
CortexX1 is ARMV8 like A7x 3 years ago
  Martin Kroeker b3b4672c30
Add initial support for Phytium FT2000 series and ARMV9 Cortex 510/710/X1/X2 3 years ago
  Martin Kroeker 40302558ed
Remove extraneous (and wrong) definition of sbgemm_r on x86_64 3 years ago
  Caroline Newcombe 5cc1111383 fix unsafe read of Y in assembly kernel 3 years ago
  Xianyi Zhang 45786b05da Merge branch 'develop' into risc-v 3 years ago
  Wangyang Guo 225683218c Small Matrix: use proper inline asm input constraint for AVX512 mask 3 years ago
  Martin Kroeker 9c626e466e
really fix definition of SHUFFLE_MAGIC_NO 3 years ago
  Martin Kroeker 0698212c8c
Remove stray $ 3 years ago
  Martin Kroeker 9d7429406f
Declare SHUFFLE_MAGIC_NO as const to placate clang 3 years ago
  Martin Kroeker d9894f45d3
Define sbgemm_r to fix DYNAMIC_ARCH builds 3 years ago
  Martin Kroeker 522f809825
Merge pull request #3542 from martin-frbg/issue3540 3 years ago
  Mosè Giordano abbc947edb Fix compilation of Skylake AVX512 kernels with GCC 6 3 years ago
  Martin Kroeker c62f8e2c01
Prevent compiler attempts to use k0 as mask register 3 years ago
  Martin Kroeker 80eb581c83
Fix non-portable u_int64_t 3 years ago
  Martin Kroeker 73ffabe6ba
Guard uses of _mm512_reduce_add_p? 3 years ago
  Martin Kroeker 7656aba00e
Merge pull request #3493 from martin-frbg/casts+cleanup 3 years ago
  Martin Kroeker addc2a7aaa
Add proper defaults for IMIN/IMAX 3 years ago
  Martin Kroeker 299d4d70a3
Add default KERNEL file for Elbrus E2K arch 3 years ago
  Martin Kroeker 3492bea602
Create Makefile 3 years ago
  Martin Kroeker 898cf5faf3
Add Elbrus e2k architecture support 3 years ago
  Martin Kroeker c1c0d5ce1d
Merge pull request #3492 from binebrank/arm_sve_zgemm 3 years ago
  Bine Brank 19d435b1b3 update armv8sve + contributors 3 years ago
  Bine Brank f158d59087 adapt CMake 3 years ago
  Bine Brank b6a445cfd8 adapt Makefile for SVE trsm 3 years ago
  Bine Brank 0fb6cc07bf fix ztrsm lt/ut copy 3 years ago
  Bine Brank f1315288a8 add sve ztrsm 3 years ago
  Bine Brank aaa2b1a861 fix sve dtrsm kernels 3 years ago
  Bine Brank 8071e179f1 add remaining sve trsm copy kernels 3 years ago
  Bine Brank f87468ac91 trsm_lncopy_sve 3 years ago
  Bine Brank e8939b3d30 sve trsmRN and trsmRT 3 years ago
  Bine Brank 098672b51b add trsm_kernel_LT_sve 3 years ago
  Bine Brank be7e55880c sve trsm_kernel_LN 3 years ago
  Martin Kroeker b6b024232d
Merge pull request #3508 from snadampal/v1_n2 3 years ago
  Sunita Nadampalli 19c8f615dc OpenBLAS: aarch64: Add neoverse-v1/n2 architecture specifics 3 years ago
  Bine Brank bb33446b40 fix makefile.L3 3 years ago
  Bine Brank f33543d029 combine zchemm into single file 3 years ago
  Bine Brank 0c91d043ae adapt CMake for SVE 3 years ago
  Bine Brank 39ab219704 sve copy functions for cgemm chemm zsymm 3 years ago
  Bine Brank 18102ae8c3 add cgemm ctrmm sve kernels 3 years ago
  Bine Brank 87537b8c55 modify sve zgemmcopy kernels 3 years ago
  Bine Brank d30157d891 update configuration of kernels for A64FX and ARMV8SVE 3 years ago
  Bine Brank 07fa6fa3b1 configure Makefile for sve 3 years ago
  Bine Brank 2e2c02b762 fix sve ztrmm kernel 3 years ago
  Bine Brank 68c414d3a6 ztrmm sve copy functions 3 years ago