49 Commits (7c1839899e81829b096c62e73804d6859a0beed1)

Author SHA1 Message Date
  Martin Kroeker 39c90f9859
Merge pull request #5380 from quic/topic/sgemm_direct_sme1_alpha_beta 2 months ago
  Rajendra Prasad Matcha eae0abfdb6 SME1 based direct kernel with alpha and beta for cblas_sgemm level 3 API. 2 months ago
  Chris Sidebottom e105411460 Add infrastructure for bgemv/bscal 2 months ago
  Chris Sidebottom f95e7b0e32 Add infrastructure for BGEMM 3 months ago
  Srangrang 0a967797a1 Add FP16 support for RISCV 4 months ago
  Martin Kroeker 5141a90993
Fix ARMV9SME target in DYNAMIC_ARCH and add SME query code for MacOS (#5222) 4 months ago
  Vaisakh K V f66ca05b31
Merge branch 'develop' into topic/sgemm_direct_sme1 7 months ago
  Vaisakh K V d23eb3b93e Support for SME1 based sgemm_direct kernel for cblas_sgemm level 3 API 10 months ago
  Martin Kroeker 1829ac5b44
Add (dummy) declaration of SBROT_M 8 months ago
  tingbo.liao 3c8df6358f Further rearranged the rotm kernel for the different architectures. 8 months ago
  Honglin Zhu 90f041e348 Invoke the syscall to allow the use of amx tiles 2 years ago
  Martin Kroeker 437c0bf2b4
Merge pull request #3843 from Mousius/switch-ratio 2 years ago
  Chris Sidebottom 32f2fafde7 Propagate SWITCH_RATIO to DYNAMIC_ARCH builds 2 years ago
  Martin Kroeker 75d5e3eaf5
Replace ifdefs and fix conditional definitions for including only selected precisions in DYNAMIC_ARCH 2 years ago
  Martin Kroeker ee44082827
fix DYNAMIC_ARCH builds that use only a subset of precisions 2 years ago
  Honglin Zhu 4989e039a5 Define SBGEMM_ALIGN_K for DYNAMIC_ARCH build 2 years ago
  Honglin Zhu b00d5b9746 New sbgemm implementation for Neoverse N2 2 years ago
  Wangyang Guo 1d83ca4bca Small Matrix: support BFLOAT16 data type 4 years ago
  Wangyang Guo 478d1086c1 Small Matrix: support DYNAMIC_ARCH build 4 years ago
  Chen, Guobing a7b1f9b1bb Implementation of BF16 based gemv 5 years ago
  Martin Kroeker cb839575ed
Convert the prototypes of the unimplemented BFLOAT16 functions to the new naming scheme 5 years ago
  Martin Kroeker ca31c32693
Rename "HALF" and "sh" to "BFLOAT16" and "sb" 5 years ago
  Martin Kroeker 1c0b03efb4
Merge branch 'develop' into develop 5 years ago
  Martin Kroeker e396ec8b56
Allow building support for only a subset of variable types 5 years ago
  Martin Kroeker c5a32288c6
Work around sgemm_r/dgemm_r not being properly defined with BUILD_COMPLEX/BUILD_COMPLEX16 5 years ago
  Martin Kroeker b886bd672b
add defines for building a subset of types 5 years ago
  Chen, Guobing deaeb6c5b8 Add bfloat16 based dot and conversion with single/double 5 years ago
  Martin Kroeker 75eeb265d7
[WIP] Refactor the driver code for direct SGEMM (#2782) 5 years ago
  Martin Kroeker 5dd14e3d48
Make building the bfloat16 functions conditional on option BUILD_HALF (#2590) 5 years ago
  Rajalakshmi Srinivasaraghavan 67cc4b9e16 Fix warnings in clang and export symbol 5 years ago
  Rajalakshmi Srinivasaraghavan a87793e03c Fix DYNAMIC_ARCH compilation errors 5 years ago
  Rajalakshmi Srinivasaraghavan ac6a22ae78 Update header 5 years ago
  Rajalakshmi Srinivasaraghavan 7eb55504b1 RFC : Add half precision gemm for bfloat16 in OpenBLAS 5 years ago
  Martin Kroeker 5c42287c4f
Add declarations for ?sum and cblas_?sum 6 years ago
  Martin Kroeker 7e860acd38
Correct zgeadd_k prototype 7 years ago
  Isuru Fernando ca17b4b75c Fix complex support for MSVC headers 8 years ago
  Zhang Xianyi 69363622a8 Fix DYNAMIC_ARCH=1 bug. 10 years ago
  Martin Koehler 711ca33bc6 Improved Ximatcopy when lda==ldb. 10 years ago
  Martin Koehler 39cc6b21d3 Add ATLAS-style ?geadd function 10 years ago
  wernsaar f1b9a4a1ca Ref #454: fixed bug in common_param.h 11 years ago
  wernsaar 7aae4a62e7 enabled use of GEMM3M functions 11 years ago
  wernsaar 125610d23b allow to set custom value for ?GEMM_DEFAULT_UNROLL_MN, optimizations for syrk 11 years ago
  Timothy Gu 6c2ead30f0 Remove all trailing whitespace except lapack-netlib 11 years ago
  wernsaar cee257f384 Ref #51: added blas extensions zomatcopy and comatcopy 11 years ago
  wernsaar 7bfb3011e8 Ref #51: added blas extension somatcopy 11 years ago
  wernsaar 8c8f596238 Ref #51: added blas extension domatcopy as not opimized reference 11 years ago
  wernsaar faf3ac0aad Ref #285: added axpby kernels 11 years ago
  traits 9fc6764fa7 refs #55. Added DTB_ENTRIES into dynamic arch setting parameters. Now, it can read DTB_ENTRIES on runtime. 14 years ago
  Xianyi Zhang 342bbc3871 Import GotoBLAS2 1.13 BSD version codes. 14 years ago