91 Commits (d96daa220de9e5c9d8d69f332a6fa550181c7f7e)

Author SHA1 Message Date
  Rajalakshmi Srinivasaraghavan 7eb55504b1 RFC : Add half precision gemm for bfloat16 in OpenBLAS 5 years ago
  Xianyi Zhang 4aa2d89217 Merge branch 'develop' into risc-v 5 years ago
  Martin Kroeker 1a6ea8ee6d
Merge pull request #2338 from kavanabhat/aix_mod 5 years ago
  Kavana Bhat 6baa9b07d7 AIX changes for Power8 5 years ago
  Kavana Bhat 3938e59569 AIX changes for Power8 5 years ago
  Martin Kroeker e7c4d6705a
Revert #2051 and replace with a better fix (#2261) 6 years ago
  Kavana Bhat 3dc6b26eff AIX changes for Power8 6 years ago
  Martin Kroeker 7c51cc8527
Merge branch 'develop' into develop 6 years ago
  AbdelRauf 853a18bc17 power9 makefile. dgemm based on power8 kernel with following changes : 32x unrolled 16x4 kernel and 8x4 kernel using (lxv stxv butterfly rank1 update). improvement from 17 to 22-23gflops. dtrmm cases were added into dgemm itself 6 years ago
  Martin Kroeker 5b95534afc
Make TARGET=GENERIC compatible with DYNAMIC_ARCH=1 6 years ago
  Martin Kroeker 885a3c4350
USE_TRMM on Z14 6 years ago
  Martin Kroeker f3fd44a731
Set USE_TRMM for all ZARCH variants to fix TRMM faults with zarch-generic 7 years ago
  Arjan van de Ven 99c7bba8e4 Initial support for SkylakeX / AVX512 7 years ago
  Martin Kroeker 82012b960b
Revert " Switch mips32 target to USE_TRMM to fix complex TRMM" 7 years ago
  Martin Kroeker 018f2dad27
Switch mips32 target to USE_TRMM to fix complex TRMM 7 years ago
  Martin Kroeker 9c5518319a
Revert "Fix 32bit HASWELL builds" 7 years ago
  Jerry Zhao 0ee395db35 Fixed TRMM and SYMM for RISCV 7 years ago
  Martin Kroeker 0e2cf102e1 Fix 32bit HASWELL 8 years ago
  Denis Steckelmacher c9ff735da6 Add ZEN support (tested for auto-detected static backend) 8 years ago
  Zhang Xianyi b678471d65 Merge branch 'z13' into develop 8 years ago
  Zhang Xianyi 864e202afd Add USE_TRMM=1 for IBM z13 in kernel/Makefile.L3 8 years ago
  Kaustubh Raste c8a7860eb3 STRSM optimized 9 years ago
  Werner Saar b752858d6c added dgemm-, dtrmm-, zgemm- and ztrmm-kernel for power8 9 years ago
  Zhang Xianyi 94b125255f Merge branch 'develop' into cmake 10 years ago
  Martin Koehler 711ca33bc6 Improved Ximatcopy when lda==ldb. 10 years ago
  Zhang Xianyi f874465bb8 Use cmake to build OpenBLAS GENERIC Target on MSVC x86 64-bit. 10 years ago
  Werner Saar 9bd962f655 modified haswell parameter dgemm_unroll_n 10 years ago
  Zhang Xianyi ea7f9dacf4 Refs #509. Fixed geadd building bug with DYNAMIC_ARCH=1. 10 years ago
  Martin Koehler 39cc6b21d3 Add ATLAS-style ?geadd function 10 years ago
  Zhang Xianyi a85c2785ae Refs #467. Added generic kernel file for x86_64. 11 years ago
  wernsaar e80b144932 enabled compiling of *3M functions 11 years ago
  wernsaar be94db096c disabled *3M functions for x86_64 platforms 11 years ago
  Timothy Gu 6c2ead30f0 Remove all trailing whitespace except lapack-netlib 11 years ago
  wernsaar cee257f384 Ref #51: added blas extensions zomatcopy and comatcopy 11 years ago
  wernsaar 7bfb3011e8 Ref #51: added blas extension somatcopy 11 years ago
  wernsaar 8c8f596238 Ref #51: added blas extension domatcopy as not opimized reference 11 years ago
  wernsaar ffe70b1fdc modified Makefile.L3 12 years ago
  wernsaar cff70a666d added generic trmm kernels and modified Makefile.L3 12 years ago
  wernsaar d854b30ae6 Added UNROLL values for 3M to getarch_2nd.c, Makefile.system and Makefile.L3 12 years ago
  Wang Qian 8e53b57bb2 Appending gemmkernel and trmmkernel C code in kernel/generic, this code can be used to execute on a new platform which dose not have optimized assemble kernel. 13 years ago
  Xianyi Zhang 342bbc3871 Import GotoBLAS2 1.13 BSD version codes. 14 years ago