44 Commits (develop)

Author SHA1 Message Date
  Martin Kroeker b37516add6
Add BGEMM parameters 2 months ago
  gkdddd 670ec6f757 Added shgemm_kernel_8x8 for RISCV64_ZVL128B and shgemm_kernel_16x8 for RISCV64_ZVL256B 4 months ago
  gxw 48698b2b1d LoongArch64: Rename core 1 year ago
  gxw d8c4ea8793 loongarch: Optimizing the performance of the GEMM on servers 1 year ago
  Martin Kroeker c6b1d8e7a3
fix improper function prototypes (empty parentheses) 2 years ago
  Wangyang Guo 3dc6052c7e initial support for Sapphire Rapids platform 4 years ago
  Wangyang Guo 045ed5c91d sbgemm: fix build error in BFLOAT16 disabled 4 years ago
  Wangyang Guo 8356a604f0 sbgemm: cooperlake: tuning for block params 4 years ago
  gxw 4b548857d6 Add msa support for loongson 4 years ago
  Martin Kroeker 85154c2e18
Change "HALF" and "sh" to "BFLOAT16" and "sb" 5 years ago
  Chen, Guobing e740c4873d Enable COOPERLAKE build target 5 years ago
  Rajalakshmi Srinivasaraghavan 7eb55504b1 RFC : Add half precision gemm for bfloat16 in OpenBLAS 5 years ago
  Ashwin Sekhar T K d50abc8903 ARM64: Move parameters from parameter.c to param.h 7 years ago
  Ashwin Sekhar T K 21f46a1cf2 ARM64: Use THUNDERX2T99 Neon Kernels for ARMV8 7 years ago
  Arjan van de Ven 99c7bba8e4 Initial support for SkylakeX / AVX512 7 years ago
  Denis Steckelmacher c9ff735da6 Add ZEN support (tested for auto-detected static backend) 8 years ago
  Ashwin Sekhar T K a86474c6f7 THUNDERX2T99: Performance fix for ZGEMM 8 years ago
  Ashwin Sekhar T K 19ba133383 THUNDERX2T99: Add Optimized ZGEMM Implementation 8 years ago
  Ashwin Sekhar T K 2757b49767 THUNDERX2T99: Add Optimized CGEMM Implementation 8 years ago
  Ashwin Sekhar T K f279ff4789 THUNDERX2T99: Add Optimized SGEMM Implementation 8 years ago
  Zhang Xianyi 0863a0d4b4 Merge pull request #1061 from ashwinyes/develop_aarch64_vulcan_thunderx_patch 8 years ago
  Werner Saar c1c5a63d3c prepared parameter.c for UNROLL values, that are not a power of two 8 years ago
  Ashwin Sekhar T K 4b55fae337 ARM64: Add Cavium THUNDERX2T99 Target 8 years ago
  Ashwin Sekhar T K 0b8e876d89 VULCAN: Add optimized DGEMM implementation 8 years ago
  Ashwin Sekhar T K 4713e7c47f ARM64: Add the VULCAN Target 9 years ago
  Werner Saar 78b05f6476 bugfix for EXCAVATOR and DYNAMIC_ARCH 9 years ago
  Zhang Xianyi 05196a8497 Refs #716. Only call getenv at init function. 9 years ago
  Werner Saar 4319769b79 added target processor STEAMROLLER 10 years ago
  wernsaar a64fe9bcc9 added optimized sgemv_n kernel for sandybridge 11 years ago
  wernsaar 2021d0f9d6 experimentally removed expensive function calls 11 years ago
  wernsaar 50e99a52ea added definitions for PILEDRIVER and HASWELL 11 years ago
  Zhang Xianyi 7a8949e0ce Merge branch 'develop' of https://github.com/TimothyGu/OpenBLAS into TimothyGu-develop 11 years ago
  Timothy Gu 6c2ead30f0 Remove all trailing whitespace except lapack-netlib 11 years ago
  Jameson Nash f41f03ab83 fix #394. this cleans up some handles after using them, and doesn't disable ALL process privileges upon success 11 years ago
  Zhang Xianyi bfaaa975e6 Added BULLDOZER target. So far it uses barcelona kernels. 13 years ago
  Zhang Xianyi d3b67d0bd8 Refs #113. Fixed the typo BOBCATE -> BOBCAT 13 years ago
  Zhang Xianyi d6cab3f37e Refs #113. Support AMD Bobcate using Barcelona kernel codes. Replace 3DNow! with MMX. 13 years ago
  Xianyi Zhang 19a48b82cf Init Sandybridge codes based on Nehalem. 13 years ago
  Wang Qian 8163ab7e55 Change the block size on Loongson 3B. 14 years ago
  Xianyi Zhang b95ad4cfaf Support detecting ICT Loongson-3B CPU. 14 years ago
  traz 831858b883 Modify aligned address of sa and sb to improve the performance of multi-threads. 14 years ago
  Xianyi Zhang 16fc083322 Refs #47. Fixed the seting parameter bug on Loongson 3A single thread version. 14 years ago
  Xianyi Zhang 4727fe8abf Refs #47. On Loongson 3A, set DGEMM_R parameter depending on different number of threads. It would improve double precision BLAS3 on multi-threads. 14 years ago
  Xianyi Zhang 342bbc3871 Import GotoBLAS2 1.13 BSD version codes. 14 years ago