272 Commits (3597827c932b90be8916f5fbaa27dd8db818ee9e)

Author SHA1 Message Date
  Dirreke ec89466e14 Add CSKY support 1 year ago
  Martin Kroeker 504f9b0c5e
Increase S/D GEMM PQ to match typical L2 size as forNeoverseV1 1 year ago
  Martin Kroeker 2802478449
revert change to Loongson2k1000 zgemm 1 year ago
  Martin Kroeker 44b5b9e39f
Update C/ZGEMM MN for Loongson2k1000 1 year ago
  Martin Kroeker 519b40fad9
Merge pull request #4398 from yinshiyou/la-dev 1 year ago
  pengxu a5d0d21378 loongarch64: Add zgemm and cgemm optimization 1 year ago
  Hao Chen 179ed51d3b Add dgemm_kernel_8x4.S file. 1 year ago
  Darshan Patel dab0da8243 Update GEMM param for NEOVERSEV1 1 year ago
  Rajalakshmi Srinivasaraghavan 980f702f72 POWER: AIX: Make use of power10 optimization 1 year ago
  gxw 553cc1372f LoongArch64: Add sgemm_kernel 2 years ago
  gxw d46772e037 LoongArch64: Add compiler feature checks 2 years ago
  Chris Sidebottom 84a268b6ca Use SVE zgemm/cgemm on Arm(R) Neoverse(TM) V1 core 2 years ago
  Chris Sidebottom f971ef55f2 Add ARMV8SVE to AArch64 Dynamic Dispatch 2 years ago
  Martin Kroeker 72caceb324
Merge pull request #4009 from Mousius/sve-gemm 2 years ago
  Martin Kroeker 437c0bf2b4
Merge pull request #3843 from Mousius/switch-ratio 2 years ago
  Chris Sidebottom ec334e69dc Use SVE kernel for SGEMM/DGEMM on Arm(R) Neoverse(TM) V1 2 years ago
  Chris Sidebottom 5b165420b5 SWITCH_RATIO for Arm(R) Neoverse(TM) architecture 2 years ago
  Chris Sidebottom 32f2fafde7 Propagate SWITCH_RATIO to DYNAMIC_ARCH builds 2 years ago
  Martin Kroeker 31fd13d048
MIPS: make HAVE_MSA reflect cpu capability and NO_MSA software/env 2 years ago
  Chris Sidebottom 2fb096315e Set SWITCH_RATIO for Arm(R) Neoverse(TM) V1 CPUs 2 years ago
  Honglin Zhu 4989e039a5 Define SBGEMM_ALIGN_K for DYNAMIC_ARCH build 2 years ago
  Jiaxun Yang a50b29c540 Provide a fallback MIPS64_GENERIC target 3 years ago
  gxw fbfe1daf6e LoongArch64: Add DYNAMIC_ARCH support 3 years ago
  gxw 3573306a69 LoongArch64: Add core LOONGSON2K1000 and LOONGSONGENERIC 3 years ago
  Honglin Zhu 123e0dfb62 Neoverse N2 sbgemm: 3 years ago
  Honglin Zhu 55d686d41e neoverse n2 sbgemm: 3 years ago
  Martin Kroeker dac14a5f7d
revert "switch DGEMM parameters for SkylakeX if DYNAMIC_ARCH" 3 years ago
  Martin Kroeker a55a06c269
Update param.h 3 years ago
  Martin Kroeker d93cf7f23c
fix defines for CORTEX-X 3 years ago
  Martin Kroeker 09b8545fc5
Add initial support for M1 on Linux, Phytium FT2xxx series, ARM Cortex 510/710/X1/X2 3 years ago
  Martin Kroeker 8d0f7f0176
Revert accidental change of generic ARMV8 DGEMM parameters from #3425 3 years ago
  Martin Kroeker c1c0d5ce1d
Merge pull request #3492 from binebrank/arm_sve_zgemm 3 years ago
  Bine Brank b6a445cfd8 adapt Makefile for SVE trsm 3 years ago
  Martin Kroeker 499ae5e8f7
Merge pull request #3510 from martin-frbg/issue3505 3 years ago
  Martin Kroeker b6b024232d
Merge pull request #3508 from snadampal/v1_n2 3 years ago
  Martin Kroeker 15d4b37913
SkylakeX: match parameters to dgemm kernels for dyn/non-dyn 3 years ago
  Sunita Nadampalli 19c8f615dc OpenBLAS: aarch64: Add neoverse-v1/n2 architecture specifics 3 years ago
  Bine Brank 39ab219704 sve copy functions for cgemm chemm zsymm 3 years ago
  gxw 8d9b9c6b2a loongarch64: Optimize dgemm_kernel 3 years ago
  Martin Kroeker 697e2752d7
Merge pull request #3464 from binebrank/arm_sve_sgemm 3 years ago
  Bine Brank a8f62a347b fix UNROLL_MN and add to targets for SVE 3 years ago
  Martin Kroeker f7f7fea0dc
Merge pull request #3472 from kavanabhat/p10_aixas_p8 3 years ago
  kavanabhat eee3381cbe Fallback for Power kernels 3 years ago
  Martin Kroeker dd1f645371
switch DGEMM unroll parameters for SkylakeX if DYNAMIC_ARCH 3 years ago
  Bine Brank 86ae89bf33 add sgemm kernel and copy functions for sgemm and ssymm 3 years ago
  Martin Kroeker 454edd741c
Merge pull request #3425 from binebrank/arm_sve_dgemm 3 years ago
  Bine Brank f4da23dcb6 reduced dgemm_unroll_m to work with 128-bit sve 3 years ago
  Bine Brank 9388f05a3c configure SVE Makefile 3 years ago
  Martin Kroeker 52a3f004a0
Fix unintended reversion of recent CortexA53 changes 3 years ago
  Martin Kroeker 19ccef5fb1
Add generic MIPS32 target 3 years ago