44 Commits (28f8fdaf0f87c9bca2a79dd41536bd7ff2027e0c)

Author SHA1 Message Date
  Martin Kroeker 5141a90993
Fix ARMV9SME target in DYNAMIC_ARCH and add SME query code for MacOS (#5222) 6 months ago
  Vaisakh K V f66ca05b31
Merge branch 'develop' into topic/sgemm_direct_sme1 9 months ago
  Vaisakh K V d23eb3b93e Support for SME1 based sgemm_direct kernel for cblas_sgemm level 3 API 11 months ago
  Martin Kroeker 1829ac5b44
Add (dummy) declaration of SBROT_M 10 months ago
  tingbo.liao 3c8df6358f Further rearranged the rotm kernel for the different architectures. 10 months ago
  Honglin Zhu 90f041e348 Invoke the syscall to allow the use of amx tiles 2 years ago
  Martin Kroeker 437c0bf2b4
Merge pull request #3843 from Mousius/switch-ratio 2 years ago
  Chris Sidebottom 32f2fafde7 Propagate SWITCH_RATIO to DYNAMIC_ARCH builds 3 years ago
  Martin Kroeker 75d5e3eaf5
Replace ifdefs and fix conditional definitions for including only selected precisions in DYNAMIC_ARCH 2 years ago
  Martin Kroeker ee44082827
fix DYNAMIC_ARCH builds that use only a subset of precisions 2 years ago
  Honglin Zhu 4989e039a5 Define SBGEMM_ALIGN_K for DYNAMIC_ARCH build 3 years ago
  Honglin Zhu b00d5b9746 New sbgemm implementation for Neoverse N2 3 years ago
  Wangyang Guo 1d83ca4bca Small Matrix: support BFLOAT16 data type 4 years ago
  Wangyang Guo 478d1086c1 Small Matrix: support DYNAMIC_ARCH build 4 years ago
  Chen, Guobing a7b1f9b1bb Implementation of BF16 based gemv 5 years ago
  Martin Kroeker cb839575ed
Convert the prototypes of the unimplemented BFLOAT16 functions to the new naming scheme 5 years ago
  Martin Kroeker ca31c32693
Rename "HALF" and "sh" to "BFLOAT16" and "sb" 5 years ago
  Martin Kroeker 1c0b03efb4
Merge branch 'develop' into develop 5 years ago
  Martin Kroeker e396ec8b56
Allow building support for only a subset of variable types 5 years ago
  Martin Kroeker c5a32288c6
Work around sgemm_r/dgemm_r not being properly defined with BUILD_COMPLEX/BUILD_COMPLEX16 5 years ago
  Martin Kroeker b886bd672b
add defines for building a subset of types 5 years ago
  Chen, Guobing deaeb6c5b8 Add bfloat16 based dot and conversion with single/double 5 years ago
  Martin Kroeker 75eeb265d7
[WIP] Refactor the driver code for direct SGEMM (#2782) 5 years ago
  Martin Kroeker 5dd14e3d48
Make building the bfloat16 functions conditional on option BUILD_HALF (#2590) 5 years ago
  Rajalakshmi Srinivasaraghavan 67cc4b9e16 Fix warnings in clang and export symbol 5 years ago
  Rajalakshmi Srinivasaraghavan a87793e03c Fix DYNAMIC_ARCH compilation errors 5 years ago
  Rajalakshmi Srinivasaraghavan ac6a22ae78 Update header 5 years ago
  Rajalakshmi Srinivasaraghavan 7eb55504b1 RFC : Add half precision gemm for bfloat16 in OpenBLAS 5 years ago
  Martin Kroeker 5c42287c4f
Add declarations for ?sum and cblas_?sum 6 years ago
  Martin Kroeker 7e860acd38
Correct zgeadd_k prototype 8 years ago
  Isuru Fernando ca17b4b75c Fix complex support for MSVC headers 8 years ago
  Zhang Xianyi 69363622a8 Fix DYNAMIC_ARCH=1 bug. 10 years ago
  Martin Koehler 711ca33bc6 Improved Ximatcopy when lda==ldb. 10 years ago
  Martin Koehler 39cc6b21d3 Add ATLAS-style ?geadd function 10 years ago
  wernsaar f1b9a4a1ca Ref #454: fixed bug in common_param.h 11 years ago
  wernsaar 7aae4a62e7 enabled use of GEMM3M functions 11 years ago
  wernsaar 125610d23b allow to set custom value for ?GEMM_DEFAULT_UNROLL_MN, optimizations for syrk 11 years ago
  Timothy Gu 6c2ead30f0 Remove all trailing whitespace except lapack-netlib 11 years ago
  wernsaar cee257f384 Ref #51: added blas extensions zomatcopy and comatcopy 11 years ago
  wernsaar 7bfb3011e8 Ref #51: added blas extension somatcopy 11 years ago
  wernsaar 8c8f596238 Ref #51: added blas extension domatcopy as not opimized reference 11 years ago
  wernsaar faf3ac0aad Ref #285: added axpby kernels 11 years ago
  traits 9fc6764fa7 refs #55. Added DTB_ENTRIES into dynamic arch setting parameters. Now, it can read DTB_ENTRIES on runtime. 14 years ago
  Xianyi Zhang 342bbc3871 Import GotoBLAS2 1.13 BSD version codes. 15 years ago