63 Commits (be3349405d1b3e09d0084d77f0167cb0b4ed2aae)

Author SHA1 Message Date
  Xianyi Zhang be3349405d Add alpha=1.0 beta=0.0 for small gemm. 5 years ago
  Xianyi Zhang 0a2077901c Add small marix optimization kernel interface. 5 years ago
  Martin Kroeker c4da892ba0
Only filter out -mavx on Sandybridge ZGEMM/ZTRMM kernels 4 years ago
  Martin Kroeker bd60fb6ffc
filter out -mavx flag on zgemm kernels as it can cause problems with older gcc 4 years ago
  gxw 4b548857d6 Add msa support for loongson 4 years ago
  Zhang Xianyi d7ba7679b6 Merge branch 'develop' into risc-v 5 years ago
  Rajalakshmi Srinivasaraghavan b5d30b390d Fix build issues with bfloat16 5 years ago
  Martin Kroeker 3aecafad80
Change "HALF" and "sh" to "BFLOAT16" and "sb" 5 years ago
  Martin Kroeker 6b6adf8a4a
Allow compiling only a subset of kernels for specific variable types 5 years ago
  Martin Kroeker 9ee21a0a39
Merge pull request #2780 from Guobing-Chen/CPL_build_support 5 years ago
  Martin Kroeker 75eeb265d7
[WIP] Refactor the driver code for direct SGEMM (#2782) 5 years ago
  Chen, Guobing e740c4873d Enable COOPERLAKE build target 5 years ago
  Rajalakshmi Srinivasaraghavan 475b5c95b9 Remove extra symbol in Makefile 5 years ago
  Martin Kroeker da17abec87
fix trailing whitespace 5 years ago
  Martin Kroeker b144423f0f
Do not define USE_TRMM for 32bit POWER8 5 years ago
  Martin Kroeker ed7e155c35
Merge branch 'develop' into aix 5 years ago
  Martin Kroeker c854ef5471
Fix variable names in conditional 5 years ago
  Martin Kroeker c0afc11742
Fix POWERPC builds on AIX (gcc/gfortran 7) 5 years ago
  Kavana Bhat df4ade070f Fix for #2671 5 years ago
  Rajalakshmi Srinivasaraghavan 9fe930f205 powerpc: Add support for future processor 5 years ago
  Martin Kroeker 5dd14e3d48
Make building the bfloat16 functions conditional on option BUILD_HALF (#2590) 5 years ago
  Rajalakshmi Srinivasaraghavan ff010f496e Build shgemm for all architecture 5 years ago
  Rajalakshmi Srinivasaraghavan 7eb55504b1 RFC : Add half precision gemm for bfloat16 in OpenBLAS 5 years ago
  Xianyi Zhang 4aa2d89217 Merge branch 'develop' into risc-v 5 years ago
  Martin Kroeker 1a6ea8ee6d
Merge pull request #2338 from kavanabhat/aix_mod 5 years ago
  Kavana Bhat 6baa9b07d7 AIX changes for Power8 5 years ago
  Kavana Bhat 3938e59569 AIX changes for Power8 5 years ago
  Martin Kroeker e7c4d6705a
Revert #2051 and replace with a better fix (#2261) 6 years ago
  Kavana Bhat 3dc6b26eff AIX changes for Power8 6 years ago
  Martin Kroeker 7c51cc8527
Merge branch 'develop' into develop 6 years ago
  AbdelRauf 853a18bc17 power9 makefile. dgemm based on power8 kernel with following changes : 32x unrolled 16x4 kernel and 8x4 kernel using (lxv stxv butterfly rank1 update). improvement from 17 to 22-23gflops. dtrmm cases were added into dgemm itself 6 years ago
  Martin Kroeker 5b95534afc
Make TARGET=GENERIC compatible with DYNAMIC_ARCH=1 6 years ago
  Martin Kroeker 885a3c4350
USE_TRMM on Z14 6 years ago
  Martin Kroeker f3fd44a731
Set USE_TRMM for all ZARCH variants to fix TRMM faults with zarch-generic 7 years ago
  Arjan van de Ven 99c7bba8e4 Initial support for SkylakeX / AVX512 7 years ago
  Martin Kroeker 82012b960b
Revert " Switch mips32 target to USE_TRMM to fix complex TRMM" 7 years ago
  Martin Kroeker 018f2dad27
Switch mips32 target to USE_TRMM to fix complex TRMM 7 years ago
  Martin Kroeker 9c5518319a
Revert "Fix 32bit HASWELL builds" 7 years ago
  Jerry Zhao 0ee395db35 Fixed TRMM and SYMM for RISCV 7 years ago
  Martin Kroeker 0e2cf102e1 Fix 32bit HASWELL 8 years ago
  Denis Steckelmacher c9ff735da6 Add ZEN support (tested for auto-detected static backend) 8 years ago
  Zhang Xianyi b678471d65 Merge branch 'z13' into develop 8 years ago
  Zhang Xianyi 864e202afd Add USE_TRMM=1 for IBM z13 in kernel/Makefile.L3 8 years ago
  Kaustubh Raste c8a7860eb3 STRSM optimized 9 years ago
  Werner Saar b752858d6c added dgemm-, dtrmm-, zgemm- and ztrmm-kernel for power8 9 years ago
  Zhang Xianyi 94b125255f Merge branch 'develop' into cmake 10 years ago
  Martin Koehler 711ca33bc6 Improved Ximatcopy when lda==ldb. 10 years ago
  Zhang Xianyi f874465bb8 Use cmake to build OpenBLAS GENERIC Target on MSVC x86 64-bit. 10 years ago
  Werner Saar 9bd962f655 modified haswell parameter dgemm_unroll_n 10 years ago
  Zhang Xianyi ea7f9dacf4 Refs #509. Fixed geadd building bug with DYNAMIC_ARCH=1. 10 years ago