241 Commits (revert-3518-elbrus)

Author SHA1 Message Date
  Martin Kroeker c1c0d5ce1d
Merge pull request #3492 from binebrank/arm_sve_zgemm 3 years ago
  Bine Brank b6a445cfd8 adapt Makefile for SVE trsm 3 years ago
  Martin Kroeker 499ae5e8f7
Merge pull request #3510 from martin-frbg/issue3505 3 years ago
  Martin Kroeker b6b024232d
Merge pull request #3508 from snadampal/v1_n2 3 years ago
  Martin Kroeker 15d4b37913
SkylakeX: match parameters to dgemm kernels for dyn/non-dyn 3 years ago
  Sunita Nadampalli 19c8f615dc OpenBLAS: aarch64: Add neoverse-v1/n2 architecture specifics 3 years ago
  Bine Brank 39ab219704 sve copy functions for cgemm chemm zsymm 3 years ago
  gxw 8d9b9c6b2a loongarch64: Optimize dgemm_kernel 3 years ago
  Martin Kroeker 697e2752d7
Merge pull request #3464 from binebrank/arm_sve_sgemm 3 years ago
  Bine Brank a8f62a347b fix UNROLL_MN and add to targets for SVE 3 years ago
  Martin Kroeker f7f7fea0dc
Merge pull request #3472 from kavanabhat/p10_aixas_p8 3 years ago
  kavanabhat eee3381cbe Fallback for Power kernels 3 years ago
  Martin Kroeker dd1f645371
switch DGEMM unroll parameters for SkylakeX if DYNAMIC_ARCH 3 years ago
  Bine Brank 86ae89bf33 add sgemm kernel and copy functions for sgemm and ssymm 3 years ago
  Martin Kroeker 454edd741c
Merge pull request #3425 from binebrank/arm_sve_dgemm 3 years ago
  Bine Brank f4da23dcb6 reduced dgemm_unroll_m to work with 128-bit sve 3 years ago
  Bine Brank 9388f05a3c configure SVE Makefile 3 years ago
  Martin Kroeker 52a3f004a0
Fix unintended reversion of recent CortexA53 changes 3 years ago
  Martin Kroeker 19ccef5fb1
Add generic MIPS32 target 3 years ago
  Jia-Chen 302f22693a MOD: optimize normal DGEMM on ARMV8 cortex-A53 & cortex-A55 3 years ago
  Martin Kroeker 46947efb83
Ignore compiler support for MIPS MSA if the cpu lacks this capability 3 years ago
  Bine Brank ab7917910d add v2x8 kernel + fix sve dtrmm 3 years ago
  Bine Brank 7093372e32 add ARMV8SVE target 3 years ago
  Wangyang Guo 7b2f5cb3b7 sbgemm: spr: enlarge P to 256 for performance 4 years ago
  Wangyang Guo 0abbcd19c1 sbgemm: spr: tuning for blocking params 4 years ago
  Wangyang Guo 3dc6052c7e initial support for Sapphire Rapids platform 4 years ago
  Martin Kroeker 24233b7c49
Use "big arm server" GEMM defaults for Vortex 4 years ago
  kavanabhat fe3c778c51 AIX changes for P10 with GNU Compiler 4 years ago
  Wangyang Guo 8356a604f0 sbgemm: cooperlake: tuning for block params 4 years ago
  Niyas Sait 7cddbf99b1 Make explicit conversion condition on _WIN64 flag 4 years ago
  Niyas Sait d1ed72fa87 [win/arm64]: Explicit casting for GMEMM_DEFAULT_ALIGN to create 64-bit value 4 years ago
  gxw af0a69f355 Add support for LOONGARCH64 4 years ago
  Martin Kroeker a6351e32f0
Remove BLASLONG casts from SPARC entries 4 years ago
  User User-User b7da75e4fd WiP CORTEX A55 support 4 years ago
  Martin Kroeker 7dfc45e840
Remove casts for PPC/POWER and complete parameters for POWER3/4 4 years ago
  Gordon Fossum 198adea961 Changed default P/Q values for CGEMM and ZGEMM (Power10 only) 4 years ago
  Martin Kroeker 8cdf0825de
Add workaround for older gcc on ppc64be not supporting casts in defines 4 years ago
  Martin Kroeker ecb4babcf4
remove inclusion of common.h again to avoid circular dependency 4 years ago
  Martin Kroeker 30d835168a
Merge pull request #3088 from xoviat/msvc 4 years ago
  austinpagan 9579bd47e5 Modifying a couple paramaters in the "POWER10"-specific section of param.h, for performance enhancements for SGEMM and DGEMM. 4 years ago
  Rajalakshmi Srinivasaraghavan 63fa6c832e Fix build issue on POWER8 with DYNAMIC_ARCH 4 years ago
  xoviat 457ccc42c9
Merge branch 'develop' into msvc 4 years ago
  Gordon Fossum ed652d8136 Added definitions for GEMM_PREFERED_SIZE and SWITCH_RATIO to the POWER9 and POWER10 specific sections of param.h. 4 years ago
  Martin Kroeker 83de62c20d
Merge pull request #3026 from martin-frbg/revert747 4 years ago
  gxw 4b548857d6 Add msa support for loongson 4 years ago
  Martin Kroeker d71fe4ed4e
Remove GEMM_DEFAULT_UNROLL_MN parameters for Haswell and ZEN (introduced in PR747) 4 years ago
  Martin Kroeker b0b14f4e9b
Change comments to C style for compatibility 4 years ago
  Rajalakshmi Srinivasaraghavan 41fe6e864e POWER10: Update param.h 4 years ago
  Xianyi Zhang fc35b72ae1 Refs #2899 4 years ago
  Xianyi Zhang 913cc9a4ca Merge branch 'develop' into risc-v 4 years ago