182 Commits (de139337b8bcb1c76cd157afd4d5fd035a76efdf)

Author SHA1 Message Date
  Chen, Guobing e740c4873d Enable COOPERLAKE build target 5 years ago
  Marius Hillenbrand e115c97e05 s390x/SGEMM: adjust default P and Q to multiples of M 5 years ago
  Ashwin Sekhar T K 4e1be0e481 ARM64: Add THUNDERX3T110 Target 5 years ago
  Martin Kroeker bd2498c886
Use POWER6 GEMM parameters on 32bit POWER8 5 years ago
  Rajalakshmi Srinivasaraghavan d23419accc powerpc: Optimized SHGEMM kernel for POWER10 5 years ago
  Rajalakshmi Srinivasaraghavan 9fe930f205 powerpc: Add support for future processor 5 years ago
  Martin Kroeker f16e39554d
Change PPCG4 CGEMM_M to match kernel change 5 years ago
  张丹枫 ea5bdc3f72 split cortex-a53 param to match 8x8 kernel 5 years ago
  Marius Hillenbrand 1b0b4349a1 s390x/Z14: Change register blocking for SGEMM to 16x4 5 years ago
  Martin Kroeker 03ff213c51
Increase POWER8 ZGEMM_R and use same R values for POWER9 5 years ago
  Martin Kroeker 00172d440b
Typo fix in MIPS24K addition 5 years ago
  Martin Kroeker 61bbae3ac1
Handle MIPS24K like P5600 5 years ago
  Martin Kroeker a33d177430
Increase default BUFFER_SIZE on ARM, ZARCH and newer x86_64, add GEMM_R for POWER8/9 5 years ago
  Martin Kroeker 567d2760e6
Merge pull request #2520 from wjc404/develop 5 years ago
  wjc404 64daad4365
Update param.h 5 years ago
  Martin Kroeker ea8eec5d17
Merge pull request #2422 from wjc404/develop 5 years ago
  Ali Saidi c623a965f9 Add Neoverse-N1 core 5 years ago
  Martin Kroeker 8164fd1328
Always assume server-class cpu count for TSV110 and EMAG8180 5 years ago
  Martin Kroeker 71e5669c3e
Add preliminary support for EMAG8180 ARMV8 processor 5 years ago
  wjc404 b0558c11b9
Update param.h 5 years ago
  wjc404 83b6be7976
Update param.h 5 years ago
  wjc404 f3f969f681
Update param.h 5 years ago
  Wang,Long fbf4f48f4a fix a few performance drop in some matrix size per data type 5 years ago
  wjc404 1c67567008
improve skylakex paralleled sgemm performance 5 years ago
  wjc404 b7b408a120
optimize AVX2 SGEMM 5 years ago
  wjc404 6362c34ee6
Update param.h 5 years ago
  wjc404 64639f440f
Update param.h 5 years ago
  wjc404 611445c7f8
Update param.h 5 years ago
  wjc404 105e26e12a
Adjust Haswell ZGEMM blocking parameters 5 years ago
  wjc404 e20709e976
Update param.h 5 years ago
  Martin Kroeker 6082e556cd
Use "generic" S/CGEMM unroll M on big-endian PPC970 5 years ago
  Martin Kroeker 4c6a457358
Merge pull request #2300 from wjc404/develop 5 years ago
  wjc404 ae43b75a6a
Add files via upload 6 years ago
  wjc404 274ff5cdb8
update sgemm_q on skylakex cpus 6 years ago
  Martin Kroeker df857551c0
Remove special parameter set for obsolete IOS/ARMV8 workaround 6 years ago
  wjc404 5da9484d93
Add files via upload 6 years ago
  Martin Kroeker 6b83079368
Count cpu cores on ARMV8 and use that to pick the GEMM_PQ parameters (#2267) 6 years ago
  Martin Kroeker 6b6c9b1441
Merge pull request #2172 from quickwritereader/develop 6 years ago
  AbdelRauf a97b301aaa cgemm/ctrmm power9 6 years ago
  pkubaj 7c7505a778
Fix build for PPC970 on FreeBSD pt.2 6 years ago
  AbdelRauf cdbfb891da new sgemm 8x16 6 years ago
  AbdelRauf d0c3543c3f power9 zgemm ztrmm optimized 6 years ago
  AbdelRauf a469b32cf4 sgemm pipeline improved, zgemm rewritten without inner packs, ABI lxvx v20 fixed with vs52 6 years ago
  AbdelRauf 8fe794f059 improved zgemm power9 based on power8 6 years ago
  AbdelRauf 628b335e83 Merge branch 'develop' of https://github.com/quickwritereader/OpenBLAS into develop 6 years ago
  AbdelRauf 0f105dd8a5 sgemm/strmm 6 years ago
  Martin Kroeker 7c51cc8527
Merge branch 'develop' into develop 6 years ago
  AbdelRauf 853a18bc17 power9 makefile. dgemm based on power8 kernel with following changes : 32x unrolled 16x4 kernel and 8x4 kernel using (lxv stxv butterfly rank1 update). improvement from 17 to 22-23gflops. dtrmm cases were added into dgemm itself 6 years ago
  Martin Kroeker 03d7110900
Merge pull request #2042 from maomao194313/develop 6 years ago
  maomao194313 7e3eb9b25d
make DYNAMIC_ARCH=1 package work on TSV110 6 years ago