17 Commits (39c90f985930d23b1591a5d030a0bfe82fa86a1e)

Author SHA1 Message Date
  Rajendra Prasad Matcha eae0abfdb6 SME1 based direct kernel with alpha and beta for cblas_sgemm level 3 API. 2 months ago
  Vaisakh K V f66ca05b31
Merge branch 'develop' into topic/sgemm_direct_sme1 7 months ago
  Vaisakh K V d23eb3b93e Support for SME1 based sgemm_direct kernel for cblas_sgemm level 3 API 10 months ago
  tingbo.liao 3c8df6358f Further rearranged the rotm kernel for the different architectures. 8 months ago
  Wangyang Guo 478d1086c1 Small Matrix: support DYNAMIC_ARCH build 4 years ago
  Wangyang Guo 5dc7c3c8e5 Small Matrix: add GEMM_SMALL_MATRIX_PERMIT to tune small matrics case 4 years ago
  Xianyi Zhang 17d32a4a82 Change a1b0 gemm to b0 gemm. 5 years ago
  Xianyi Zhang be3349405d Add alpha=1.0 beta=0.0 for small gemm. 5 years ago
  Xianyi Zhang 0a2077901c Add small marix optimization kernel interface. 5 years ago
  Martin Kroeker 75eeb265d7
[WIP] Refactor the driver code for direct SGEMM (#2782) 5 years ago
  Martin Kroeker 5c42287c4f
Add declarations for ?sum and cblas_?sum 6 years ago
  Martin Koehler 711ca33bc6 Improved Ximatcopy when lda==ldb. 10 years ago
  Martin Koehler 39cc6b21d3 Add ATLAS-style ?geadd function 10 years ago
  wernsaar 7bfb3011e8 Ref #51: added blas extension somatcopy 11 years ago
  wernsaar faf3ac0aad Ref #285: added axpby kernels 11 years ago
  wernsaar 9db0fb8b02 bugfix for sdsdot 11 years ago
  Xianyi Zhang 342bbc3871 Import GotoBLAS2 1.13 BSD version codes. 14 years ago