50 Commits (3906ef3b0fb19e7436f2b4cf6394b11f3466b1f3)

Author SHA1 Message Date
  gxw 4b548857d6 Add msa support for loongson 4 years ago
  Chen, Guobing a7b1f9b1bb Implementation of BF16 based gemv 5 years ago
  Martin Kroeker 10379fc83b
Use ifdef instead of if 5 years ago
  Martin Kroeker 3aecafad80
Change "HALF" and "sh" to "BFLOAT16" and "sb" 5 years ago
  Martin Kroeker 6b6adf8a4a
Allow compiling only a subset of kernels for specific variable types 5 years ago
  Martin Kroeker dfbc62ef7e
Support building only a subset of types 5 years ago
  Chen, Guobing deaeb6c5b8 Add bfloat16 based dot and conversion with single/double 5 years ago
  Martin Kroeker 9ee21a0a39
Merge pull request #2780 from Guobing-Chen/CPL_build_support 5 years ago
  Martin Kroeker 75eeb265d7
[WIP] Refactor the driver code for direct SGEMM (#2782) 5 years ago
  Chen, Guobing e740c4873d Enable COOPERLAKE build target 5 years ago
  Martin Kroeker 5dd14e3d48
Make building the bfloat16 functions conditional on option BUILD_HALF (#2590) 5 years ago
  Rajalakshmi Srinivasaraghavan 67cc4b9e16 Fix warnings in clang and export symbol 5 years ago
  Rajalakshmi Srinivasaraghavan a87793e03c Fix DYNAMIC_ARCH compilation errors 5 years ago
  Rajalakshmi Srinivasaraghavan 7eb55504b1 RFC : Add half precision gemm for bfloat16 in OpenBLAS 5 years ago
  int_13h 96ad579428 add in runtime cpu detection for zarch (#2349) 5 years ago
  Martin Kroeker ccfb7ead15
Merge pull request #2072 from martin-frbg/sum 6 years ago
  Rashmica Gupta bcdf1d4917 Add in runtime CPU detection for POWER. 6 years ago
  Martin Kroeker b9f4943a14
Add ?sum 6 years ago
  Ashwin Sekhar T K d5aeff636f ARM64: Enable DYNAMIC_ARCH 7 years ago
  Ashwin Sekhar T K e7b66cd36e ARM64: Fix DYNAMIC_ARCH compilation for cores which dont use GEMM3M 7 years ago
  Martin Kroeker 6f71c0fce4
Return a somewhat sane default value for L2 cache size if cpuid retur… (#1611) 7 years ago
  Arjan van de Ven 99c7bba8e4 Initial support for SkylakeX / AVX512 7 years ago
  Martin Kroeker acf3d34bc5
Silence an unused variable warning with a cast 7 years ago
  Denis Steckelmacher c9ff735da6 Add ZEN support (tested for auto-detected static backend) 8 years ago
  Werner Saar 28e2fab33e prepared kernel/setparam-ref.c for UNROLL values, that are not a power of two 8 years ago
  Werner Saar 78b05f6476 bugfix for EXCAVATOR and DYNAMIC_ARCH 9 years ago
  Zhang Xianyi 69363622a8 Fix DYNAMIC_ARCH=1 bug. 10 years ago
  Zhang Xianyi ea7f9dacf4 Refs #509. Fixed geadd building bug with DYNAMIC_ARCH=1. 10 years ago
  Werner Saar 4319769b79 added target processor STEAMROLLER 10 years ago
  wernsaar 7aae4a62e7 enabled use of GEMM3M functions 11 years ago
  wernsaar 125610d23b allow to set custom value for ?GEMM_DEFAULT_UNROLL_MN, optimizations for syrk 11 years ago
  wernsaar 9964ed2f79 bugfix for CORE2 11 years ago
  wernsaar f7267d9b0e added missing definition for DUNNINGTON 11 years ago
  Timothy Gu 6c2ead30f0 Remove all trailing whitespace except lapack-netlib 11 years ago
  wernsaar cee257f384 Ref #51: added blas extensions zomatcopy and comatcopy 11 years ago
  wernsaar 7bfb3011e8 Ref #51: added blas extension somatcopy 11 years ago
  wernsaar 8c8f596238 Ref #51: added blas extension domatcopy as not opimized reference 11 years ago
  wernsaar faf3ac0aad Ref #285: added axpby kernels 11 years ago
  wernsaar 53eaf41901 added support for HASWELL 12 years ago
  Zhang Xianyi 886cbaf4e4 Support AMD Piledriver by bulldozer kernels. 12 years ago
  Zhang Xianyi 5c8bf6ae0e Merge branch 'bulldozer' into develop 12 years ago
  Julian Taylor 9fb341a9f8 set parameters for CORE_ATHLON 12 years ago
  Zhang Xianyi bfaaa975e6 Added BULLDOZER target. So far it uses barcelona kernels. 13 years ago
  Zhang Xianyi d3b67d0bd8 Refs #113. Fixed the typo BOBCATE -> BOBCAT 13 years ago
  Zhang Xianyi d6cab3f37e Refs #113. Support AMD Bobcate using Barcelona kernel codes. Replace 3DNow! with MMX. 13 years ago
  Xianyi Zhang 19a48b82cf Init Sandybridge codes based on Nehalem. 13 years ago
  traits 9fc6764fa7 refs #55. Added DTB_ENTRIES into dynamic arch setting parameters. Now, it can read DTB_ENTRIES on runtime. 14 years ago
  Xianyi Zhang b3d1887745 Fixed #35 a build bug with NO_LAPACK=1 DYNAMIC_ARCH=1 FC=gfortran. I forgot to test it with gfortran in last bug fixed commit. 14 years ago
  Xianyi Zhang 8d50a9fd1a Fixed #35 a build bug with NO_LAPACK=1 & DYNAMIC_ARCH=1. 14 years ago
  Xianyi Zhang 342bbc3871 Import GotoBLAS2 1.13 BSD version codes. 14 years ago