167 Commits (ce33da4cab30ce7692aaf19b33a6336d8210de52)

Author SHA1 Message Date
  Martin Kroeker ea8eec5d17
Merge pull request #2422 from wjc404/develop 5 years ago
  Ali Saidi c623a965f9 Add Neoverse-N1 core 5 years ago
  Martin Kroeker 8164fd1328
Always assume server-class cpu count for TSV110 and EMAG8180 5 years ago
  Martin Kroeker 71e5669c3e
Add preliminary support for EMAG8180 ARMV8 processor 5 years ago
  wjc404 b0558c11b9
Update param.h 5 years ago
  wjc404 83b6be7976
Update param.h 5 years ago
  wjc404 f3f969f681
Update param.h 5 years ago
  Wang,Long fbf4f48f4a fix a few performance drop in some matrix size per data type 5 years ago
  wjc404 1c67567008
improve skylakex paralleled sgemm performance 5 years ago
  wjc404 b7b408a120
optimize AVX2 SGEMM 5 years ago
  wjc404 6362c34ee6
Update param.h 5 years ago
  wjc404 64639f440f
Update param.h 5 years ago
  wjc404 611445c7f8
Update param.h 5 years ago
  wjc404 105e26e12a
Adjust Haswell ZGEMM blocking parameters 5 years ago
  wjc404 e20709e976
Update param.h 5 years ago
  Martin Kroeker 6082e556cd
Use "generic" S/CGEMM unroll M on big-endian PPC970 5 years ago
  Martin Kroeker 4c6a457358
Merge pull request #2300 from wjc404/develop 5 years ago
  wjc404 ae43b75a6a
Add files via upload 6 years ago
  wjc404 274ff5cdb8
update sgemm_q on skylakex cpus 6 years ago
  Martin Kroeker df857551c0
Remove special parameter set for obsolete IOS/ARMV8 workaround 6 years ago
  wjc404 5da9484d93
Add files via upload 6 years ago
  Martin Kroeker 6b83079368
Count cpu cores on ARMV8 and use that to pick the GEMM_PQ parameters (#2267) 6 years ago
  Martin Kroeker 6b6c9b1441
Merge pull request #2172 from quickwritereader/develop 6 years ago
  AbdelRauf a97b301aaa cgemm/ctrmm power9 6 years ago
  pkubaj 7c7505a778
Fix build for PPC970 on FreeBSD pt.2 6 years ago
  AbdelRauf cdbfb891da new sgemm 8x16 6 years ago
  AbdelRauf d0c3543c3f power9 zgemm ztrmm optimized 6 years ago
  AbdelRauf a469b32cf4 sgemm pipeline improved, zgemm rewritten without inner packs, ABI lxvx v20 fixed with vs52 6 years ago
  AbdelRauf 8fe794f059 improved zgemm power9 based on power8 6 years ago
  AbdelRauf 628b335e83 Merge branch 'develop' of https://github.com/quickwritereader/OpenBLAS into develop 6 years ago
  AbdelRauf 0f105dd8a5 sgemm/strmm 6 years ago
  Martin Kroeker 7c51cc8527
Merge branch 'develop' into develop 6 years ago
  AbdelRauf 853a18bc17 power9 makefile. dgemm based on power8 kernel with following changes : 32x unrolled 16x4 kernel and 8x4 kernel using (lxv stxv butterfly rank1 update). improvement from 17 to 22-23gflops. dtrmm cases were added into dgemm itself 6 years ago
  Martin Kroeker 03d7110900
Merge pull request #2042 from maomao194313/develop 6 years ago
  maomao194313 7e3eb9b25d
make DYNAMIC_ARCH=1 package work on TSV110 6 years ago
  ken-cunningham-webuse b0c714ef60 param.h : enable defines for PPC970 on DarwinOS 6 years ago
  Martin Kroeker bdc73a49e0
Add parameters for Z14 6 years ago
  Martin Kroeker bbfdd6c0fe
Increase Zen SWITCH_RATIO to 16 6 years ago
  Arjan van de Ven b28f75cd7e set GEMM_PREFERED_SIZE for HASWELL 6 years ago
  Arjan van de Ven cdc668d82b Add a "sgemm direct" mode for small matrixes 6 years ago
  Renato Golin 310ea55f29 Simplifying ARMv8 build parameters 6 years ago
  Arjan van de Ven 5b708e5eb1 sgemm/dgemm: add a way for an arch kernel to specify prefered sizes 7 years ago
  Ashwin Sekhar T K d50abc8903 ARM64: Move parameters from parameter.c to param.h 7 years ago
  Ashwin Sekhar T K 21f46a1cf2 ARM64: Use THUNDERX2T99 Neon Kernels for ARMV8 7 years ago
  Martin Kroeker 4cf7315a5d
Adjust ARMV8 SGEMM unrolling when using the C fallback kernel_2x2 for IOS 7 years ago
  Arjan van de Ven 6eb4b9ae7c Tune HASWELL SWITCH_RATIO as well 7 years ago
  Arjan van de Ven 5c6f008365 Tune param.h for SkylakeX 7 years ago
  Arjan van de Ven 99c7bba8e4 Initial support for SkylakeX / AVX512 7 years ago
  Martin Kroeker d94d7baf7e
Add mips32r2 api target 7 years ago
  Shivraj Patil e3d844b062 Added mips I6500 core 8 years ago