90 Commits (947d7af4c9c4ea7e6edeb42ceeb865eab05d5c37)

Author SHA1 Message Date
  Chris Sidebottom 947d7af4c9 Fix CMake references to bscal and bgemv 2 months ago
  Martin Kroeker ff614575c9
Fix arm64 HAVE_SME setting for DYNAMIC_ARCH builds 2 months ago
  Chris Sidebottom 66d9185ebe Fix CMake support 2 months ago
  gkdddd 670ec6f757 Added shgemm_kernel_8x8 for RISCV64_ZVL128B and shgemm_kernel_16x8 for RISCV64_ZVL256B 4 months ago
  Martin Kroeker 5141a90993
Fix ARMV9SME target in DYNAMIC_ARCH and add SME query code for MacOS (#5222) 4 months ago
  Ruiyang Wu 02fd1df10b CMake: Pass `OpenMP` compiler and linker flags through CMake targets 6 months ago
  Martin Kroeker eb84aac7ad
Merge pull request #5084 from quic/topic/sgemm_direct_sme1 7 months ago
  Martin Kroeker b9ae246f20
define USE_TRMM for RISCV64 targets as well 7 months ago
  Vaisakh K V d23eb3b93e Support for SME1 based sgemm_direct kernel for cblas_sgemm level 3 API 10 months ago
  Martin Kroeker 1a6a9fb22f
add another generator line for rotm 8 months ago
  tingbo.liao 3c8df6358f Further rearranged the rotm kernel for the different architectures. 8 months ago
  Dmitry Mikushin d0f5dc763b Adding USE_GEMM3M macro to kernel targets, so that the *gemm3m functions and parameters can be included into the gotoblas structure. Fixes #4500 1 year ago
  Martin Kroeker 38d6fb4225
Fix dependencies in builds with specified subsets of precision types 2 years ago
  Martin Kroeker e412bee313
fix GEMM kernel dependencies in builds that use only a subset of precisions 2 years ago
  Martin Kroeker 84cb58b7fb
Fix generator rules for ?laswp_ncopy and ?neg_tcopy 3 years ago
  Martin Kroeker 0698212c8c
Remove stray $ 3 years ago
  Bine Brank f158d59087 adapt CMake 3 years ago
  Bine Brank 0c91d043ae adapt CMake for SVE 3 years ago
  Rafael Cardoso Fernandes Sousa 23a7561353 Fix error cmake (small kernels) 3 years ago
  Martin Kroeker 454edd741c
Merge pull request #3425 from binebrank/arm_sve_dgemm 3 years ago
  Bine Brank 1af73ce38e Adapt CMake for SVE 3 years ago
  Rafael Cardoso Fernandes Sousa d5c9353f1b Modify the order that cmake set the KERNEL variables (generic now is fallback) 3 years ago
  Wangyang Guo 3dc6052c7e initial support for Sapphire Rapids platform 4 years ago
  Martin Kroeker ddf106f769
Add dedicated entries for BFLOAT16 kernels 4 years ago
  Wangyang Guo 989e6bbdd3 Small Matrix: reduce generic kernel source files 4 years ago
  Wangyang Guo fee5abd84b Small Matrix: support cmake build 4 years ago
  Albert Ziegenhagel e3f4063683 Fix building "generic" TRMM kernel with CMake 4 years ago
  Martin Kroeker 3aecafad80
Change "HALF" and "sh" to "BFLOAT16" and "sb" 5 years ago
  Martin Kroeker 6b6adf8a4a
Allow compiling only a subset of kernels for specific variable types 5 years ago
  Martin Kroeker dfbc62ef7e
Support building only a subset of types 5 years ago
  Martin Kroeker 9ee21a0a39
Merge pull request #2780 from Guobing-Chen/CPL_build_support 5 years ago
  Martin Kroeker 75eeb265d7
[WIP] Refactor the driver code for direct SGEMM (#2782) 5 years ago
  Chen, Guobing e740c4873d Enable COOPERLAKE build target 5 years ago
  Rajalakshmi Srinivasaraghavan 9fe930f205 powerpc: Add support for future processor 5 years ago
  Martin Kroeker 5dd14e3d48
Make building the bfloat16 functions conditional on option BUILD_HALF (#2590) 5 years ago
  Rajalakshmi Srinivasaraghavan 22bb50fb81 cmake fixes 5 years ago
  Martin Liska 0e7f43c898
Add missing USE_MIN in kernel/CMakeLists.txt. 5 years ago
  Martin Kroeker 7e5cbb6f35
Fix bad conditional syntax that caused spurious application of USE_TRMM 5 years ago
  Martin Kroeker b9f4943a14
Add ?sum 6 years ago
  Martin Kroeker 2fc712469d
Avoid creating spurious non-suffixed c/zgemm_kernels 6 years ago
  Arjan van de Ven 99c7bba8e4 Initial support for SkylakeX / AVX512 7 years ago
  Martin Kroeker 485df77612
Make USE_TRMM depend on TARGET_CORE not TARGET 7 years ago
  Martin Kroeker c7a8512d12 Cmake fixes for DYNAMIC_ARCH builds and whitespace in path names (#1323) 8 years ago
  Sacha Refshauge 47ebce4d1a Clean up, fix old typos. Simplify arch usages. Move system arch check to earlier position. 8 years ago
  Sacha Refshauge 69b560751c Improvements to previous commit (cross-compile). 8 years ago
  Sacha Refshauge 11911fd941 Add kernel/Makefile.LA to CMake 8 years ago
  Isuru Fernando d3b677fe87 Add commonobjs 8 years ago
  Isuru Fernando 505b218829 Merge remote-tracking branch 'upstream/develop' into dyn 8 years ago
  Isuru Fernando d9346930dd Merge remote-tracking branch 'upstream/develop' into develop 8 years ago
  Isuru Fernando 7892434572 Add hemm3m and symm3m objects 8 years ago