3236 Commits (42bc2a92023070ee871ffd81b6a9b8fb6dd1892b)
 

Author SHA1 Message Date
  Martin Kroeker 42bc2a9202
Fix copy-paste errors (POWER8/9 and extraneous return) 6 years ago
  Martin Kroeker 2f04cf22ac
Detect POWER9 as POWER8 on AIX and Linux 6 years ago
  Martin Kroeker 807f6e6922
Use prtconf to determine CPU type on AIX 6 years ago
  Martin Kroeker 76a66eaac8
Merge pull request #1829 from ashwinyes/develop_aarch64_dynamic_arch_support 7 years ago
  Ashwin Sekhar T K d5aeff636f ARM64: Enable DYNAMIC_ARCH 7 years ago
  Ashwin Sekhar T K af2837c392 ARM64: Remove #define ARMV8 for THUNDERX 7 years ago
  Ashwin Sekhar T K e7b66cd36e ARM64: Fix DYNAMIC_ARCH compilation for cores which dont use GEMM3M 7 years ago
  Ashwin Sekhar T K d50abc8903 ARM64: Move parameters from parameter.c to param.h 7 years ago
  Ashwin Sekhar T K 351a0c777c ARM64: Remove XGENE1 references 7 years ago
  Martin Kroeker e3c262e5cf
Merge pull request #1825 from brada4/hemv 7 years ago
  Andrew a293bdcd5e re-arrange new code for readability 7 years ago
  Andrew c7bbf9c987 Attempt to tame _hemv threading #1820 7 years ago
  Andrew 898a8dcaba init 7 years ago
  Martin Kroeker 71c6deed60
Merge pull request #1821 from ashwinyes/develop_aarch64_armv8neonkernels 7 years ago
  Ashwin Sekhar T K 21f46a1cf2 ARM64: Use THUNDERX2T99 Neon Kernels for ARMV8 7 years ago
  Ashwin Sekhar T K caf339412f ARM64: Remove dependency of THUNDERX2T99 Makefile on CORTEXA57 Makefile 7 years ago
  Ashwin Sekhar T K 8001fdcd2a ARM64: Remove dependency of THUNDERX Makefile on ARMV8 Makefile 7 years ago
  Ashwin Sekhar T K 162e312832 ARM64: Remove dependency of CORTEXA57 Makefile on ARMV8 Makefile 7 years ago
  Ashwin Sekhar T K c3d93caa8d ARM64: Remove dependency of XGENE1 Makefile on ARMV8 Makefile 7 years ago
  Martin Kroeker a71923514f
Merge pull request #1815 from fenrus75/sgemm_beta_fix 7 years ago
  Arjan van de Ven 55b244ca0d enable the SGEMM/SKX C based kernel 7 years ago
  Martin Kroeker 2263d3906c
Merge pull request #1812 from martin-frbg/issue1806-2 7 years ago
  Martin Kroeker 81c9985c3a
Use KERNEL_DEFINITIONS rather than COMMON_OPTS to pass -march=skylake-avx512 7 years ago
  Martin Kroeker 56ebc7b53e
Merge pull request #1808 from martin-frbg/issue1806 7 years ago
  Martin Kroeker c5f88f5a57
Merge pull request #1807 from xianyi/revert-1798-cmake-avx512 7 years ago
  Martin Kroeker 8a11ec19d1
Syntax fix 7 years ago
  Martin Kroeker fa53b903db
Add -march=skylake-avx512 to CFLAGS when the target is Skylake 7 years ago
  Martin Kroeker 84bcdf9c66
Revert "Add -march=skylake-avx512 when required" 7 years ago
  Martin Kroeker 8f7e986184
Merge pull request #1802 from martin-frbg/issue1801 7 years ago
  Martin Kroeker d0e83666ad
Merge pull request #1804 from fenrus75/sgemm 7 years ago
  Arjan van de Ven d4bad73834 Add a C+intrinsics version of the SGEMM/skylakex kernel 7 years ago
  Martin Kroeker 065763adde
Merge pull request #1800 from fengrl/patch-1 7 years ago
  Martin Kroeker 210b03b543
Merge pull request #1792 from martin-frbg/cmakesuffix 7 years ago
  Martin Kroeker 6234a32656
Use cygwin compilation workaround for avx512 on msys2/mingw64 as well 7 years ago
  Martin Kroeker c0d7cd3dac
Merge pull request #1799 from martin-frbg/issue1796 7 years ago
  Martin Kroeker 667f0cc1cb
Merge pull request #1793 from fenrus75/ncopy 7 years ago
  fengrl d4c8853a02
Update common_mips64.h 7 years ago
  Martin Kroeker d3d58f8ee5
Catch conflicting usage of ARCH in at least some BSD environments 7 years ago
  Martin Kroeker 697dc1baf8
Use override for ARCH in make.inc 7 years ago
  Martin Kroeker a9b51b8448
Merge pull request #1798 from martin-frbg/cmake-avx512 7 years ago
  Martin Kroeker eba394c711
Add -march=skylake-avx512 when required 7 years ago
  Arjan van de Ven 582c589727 dgemm/skylakex: replace discrete mul/add with fma 7 years ago
  Arjan van de Ven adbf6afa25 Add vector optimizations for ncopy as well for dgemm/skylakex 7 years ago
  Arjan van de Ven 32bec8afbb add a skylakex optimized dgemm beta function 7 years ago
  Martin Kroeker 6e2c494556
Merge pull request #1791 from dev-zero/develop 7 years ago
  Arjan van de Ven 20c5d668fe dgemm/avx512 simplify and speed up the 4x4 kernel 7 years ago
  Arjan van de Ven 6d43c51ccf undo slow dgemm/skylake microoptimization 7 years ago
  Arjan van de Ven d74dc39b0f Add optimized *copy versions for skylakex 7 years ago
  Martin Kroeker 41951da6d4
Merge pull request #6 from xianyi/develop 7 years ago
  Martin Kroeker 474f7e9583
Add SYMBOLPREFIX and -SUFFIX options and improve help output 7 years ago