192 Commits (60e1fddca7634917a56bcc4cb43bbbee08eb136a)

Author SHA1 Message Date
  Xianyi Zhang fc35b72ae1 Refs #2899 4 years ago
  Xianyi Zhang 913cc9a4ca Merge branch 'develop' into risc-v 4 years ago
  Rajalakshmi Srinivasaraghavan dd7a9cc5bf POWER10: Change dgemm unroll factors 4 years ago
  Zhang Xianyi d7ba7679b6 Merge branch 'develop' into risc-v 5 years ago
  damonyu ef8e7d0279 Add the support for RISC-V Vector. 5 years ago
  Martin Kroeker ca31c32693
Rename "HALF" and "sh" to "BFLOAT16" and "sb" 5 years ago
  Chen, Guobing e740c4873d Enable COOPERLAKE build target 5 years ago
  Marius Hillenbrand e115c97e05 s390x/SGEMM: adjust default P and Q to multiples of M 5 years ago
  Ashwin Sekhar T K 4e1be0e481 ARM64: Add THUNDERX3T110 Target 5 years ago
  Martin Kroeker bd2498c886
Use POWER6 GEMM parameters on 32bit POWER8 5 years ago
  Rajalakshmi Srinivasaraghavan d23419accc powerpc: Optimized SHGEMM kernel for POWER10 5 years ago
  Rajalakshmi Srinivasaraghavan 9fe930f205 powerpc: Add support for future processor 5 years ago
  Martin Kroeker f16e39554d
Change PPCG4 CGEMM_M to match kernel change 5 years ago
  张丹枫 ea5bdc3f72 split cortex-a53 param to match 8x8 kernel 5 years ago
  Marius Hillenbrand 1b0b4349a1 s390x/Z14: Change register blocking for SGEMM to 16x4 5 years ago
  Martin Kroeker 03ff213c51
Increase POWER8 ZGEMM_R and use same R values for POWER9 5 years ago
  Martin Kroeker 00172d440b
Typo fix in MIPS24K addition 5 years ago
  Martin Kroeker 61bbae3ac1
Handle MIPS24K like P5600 5 years ago
  Martin Kroeker a33d177430
Increase default BUFFER_SIZE on ARM, ZARCH and newer x86_64, add GEMM_R for POWER8/9 5 years ago
  Martin Kroeker 567d2760e6
Merge pull request #2520 from wjc404/develop 5 years ago
  wjc404 64daad4365
Update param.h 5 years ago
  Martin Kroeker ea8eec5d17
Merge pull request #2422 from wjc404/develop 5 years ago
  Ali Saidi c623a965f9 Add Neoverse-N1 core 5 years ago
  Xianyi Zhang 265ab484c8 Change default RISC-V 64-bit corename to RISCV64_GENERIC 5 years ago
  Xianyi Zhang 4aa2d89217 Merge branch 'develop' into risc-v 5 years ago
  Martin Kroeker 8164fd1328
Always assume server-class cpu count for TSV110 and EMAG8180 5 years ago
  Martin Kroeker 71e5669c3e
Add preliminary support for EMAG8180 ARMV8 processor 5 years ago
  wjc404 b0558c11b9
Update param.h 5 years ago
  wjc404 83b6be7976
Update param.h 5 years ago
  wjc404 f3f969f681
Update param.h 5 years ago
  Wang,Long fbf4f48f4a fix a few performance drop in some matrix size per data type 5 years ago
  wjc404 1c67567008
improve skylakex paralleled sgemm performance 5 years ago
  wjc404 b7b408a120
optimize AVX2 SGEMM 5 years ago
  wjc404 6362c34ee6
Update param.h 5 years ago
  wjc404 64639f440f
Update param.h 5 years ago
  wjc404 611445c7f8
Update param.h 5 years ago
  wjc404 105e26e12a
Adjust Haswell ZGEMM blocking parameters 5 years ago
  wjc404 e20709e976
Update param.h 5 years ago
  Martin Kroeker 6082e556cd
Use "generic" S/CGEMM unroll M on big-endian PPC970 5 years ago
  Martin Kroeker 4c6a457358
Merge pull request #2300 from wjc404/develop 5 years ago
  wjc404 ae43b75a6a
Add files via upload 6 years ago
  wjc404 274ff5cdb8
update sgemm_q on skylakex cpus 6 years ago
  Martin Kroeker df857551c0
Remove special parameter set for obsolete IOS/ARMV8 workaround 6 years ago
  wjc404 5da9484d93
Add files via upload 6 years ago
  Martin Kroeker 6b83079368
Count cpu cores on ARMV8 and use that to pick the GEMM_PQ parameters (#2267) 6 years ago
  Martin Kroeker 6b6c9b1441
Merge pull request #2172 from quickwritereader/develop 6 years ago
  AbdelRauf a97b301aaa cgemm/ctrmm power9 6 years ago
  pkubaj 7c7505a778
Fix build for PPC970 on FreeBSD pt.2 6 years ago
  AbdelRauf cdbfb891da new sgemm 8x16 6 years ago
  AbdelRauf d0c3543c3f power9 zgemm ztrmm optimized 6 years ago