176 Commits (72888497e2ffb6233ffd18ccf0b4d4bb01701b17)

Author SHA1 Message Date
  Martin Kroeker f16e39554d
Change PPCG4 CGEMM_M to match kernel change 5 years ago
  张丹枫 ea5bdc3f72 split cortex-a53 param to match 8x8 kernel 5 years ago
  Marius Hillenbrand 1b0b4349a1 s390x/Z14: Change register blocking for SGEMM to 16x4 5 years ago
  Martin Kroeker 03ff213c51
Increase POWER8 ZGEMM_R and use same R values for POWER9 5 years ago
  Martin Kroeker 00172d440b
Typo fix in MIPS24K addition 5 years ago
  Martin Kroeker 61bbae3ac1
Handle MIPS24K like P5600 5 years ago
  Martin Kroeker a33d177430
Increase default BUFFER_SIZE on ARM, ZARCH and newer x86_64, add GEMM_R for POWER8/9 5 years ago
  Martin Kroeker 567d2760e6
Merge pull request #2520 from wjc404/develop 5 years ago
  wjc404 64daad4365
Update param.h 5 years ago
  Martin Kroeker ea8eec5d17
Merge pull request #2422 from wjc404/develop 5 years ago
  Ali Saidi c623a965f9 Add Neoverse-N1 core 5 years ago
  Martin Kroeker 8164fd1328
Always assume server-class cpu count for TSV110 and EMAG8180 5 years ago
  Martin Kroeker 71e5669c3e
Add preliminary support for EMAG8180 ARMV8 processor 5 years ago
  wjc404 b0558c11b9
Update param.h 5 years ago
  wjc404 83b6be7976
Update param.h 5 years ago
  wjc404 f3f969f681
Update param.h 5 years ago
  Wang,Long fbf4f48f4a fix a few performance drop in some matrix size per data type 5 years ago
  wjc404 1c67567008
improve skylakex paralleled sgemm performance 5 years ago
  wjc404 b7b408a120
optimize AVX2 SGEMM 5 years ago
  wjc404 6362c34ee6
Update param.h 5 years ago
  wjc404 64639f440f
Update param.h 5 years ago
  wjc404 611445c7f8
Update param.h 5 years ago
  wjc404 105e26e12a
Adjust Haswell ZGEMM blocking parameters 5 years ago
  wjc404 e20709e976
Update param.h 5 years ago
  Martin Kroeker 6082e556cd
Use "generic" S/CGEMM unroll M on big-endian PPC970 5 years ago
  Martin Kroeker 4c6a457358
Merge pull request #2300 from wjc404/develop 5 years ago
  wjc404 ae43b75a6a
Add files via upload 6 years ago
  wjc404 274ff5cdb8
update sgemm_q on skylakex cpus 6 years ago
  Martin Kroeker df857551c0
Remove special parameter set for obsolete IOS/ARMV8 workaround 6 years ago
  wjc404 5da9484d93
Add files via upload 6 years ago
  Martin Kroeker 6b83079368
Count cpu cores on ARMV8 and use that to pick the GEMM_PQ parameters (#2267) 6 years ago
  Martin Kroeker 6b6c9b1441
Merge pull request #2172 from quickwritereader/develop 6 years ago
  AbdelRauf a97b301aaa cgemm/ctrmm power9 6 years ago
  pkubaj 7c7505a778
Fix build for PPC970 on FreeBSD pt.2 6 years ago
  AbdelRauf cdbfb891da new sgemm 8x16 6 years ago
  AbdelRauf d0c3543c3f power9 zgemm ztrmm optimized 6 years ago
  AbdelRauf a469b32cf4 sgemm pipeline improved, zgemm rewritten without inner packs, ABI lxvx v20 fixed with vs52 6 years ago
  AbdelRauf 8fe794f059 improved zgemm power9 based on power8 6 years ago
  AbdelRauf 628b335e83 Merge branch 'develop' of https://github.com/quickwritereader/OpenBLAS into develop 6 years ago
  AbdelRauf 0f105dd8a5 sgemm/strmm 6 years ago
  Martin Kroeker 7c51cc8527
Merge branch 'develop' into develop 6 years ago
  AbdelRauf 853a18bc17 power9 makefile. dgemm based on power8 kernel with following changes : 32x unrolled 16x4 kernel and 8x4 kernel using (lxv stxv butterfly rank1 update). improvement from 17 to 22-23gflops. dtrmm cases were added into dgemm itself 6 years ago
  Martin Kroeker 03d7110900
Merge pull request #2042 from maomao194313/develop 6 years ago
  maomao194313 7e3eb9b25d
make DYNAMIC_ARCH=1 package work on TSV110 6 years ago
  ken-cunningham-webuse b0c714ef60 param.h : enable defines for PPC970 on DarwinOS 6 years ago
  Martin Kroeker bdc73a49e0
Add parameters for Z14 6 years ago
  Martin Kroeker bbfdd6c0fe
Increase Zen SWITCH_RATIO to 16 6 years ago
  Arjan van de Ven b28f75cd7e set GEMM_PREFERED_SIZE for HASWELL 6 years ago
  Arjan van de Ven cdc668d82b Add a "sgemm direct" mode for small matrixes 6 years ago
  Renato Golin 310ea55f29 Simplifying ARMv8 build parameters 6 years ago