38 Commits (7c1839899e81829b096c62e73804d6859a0beed1)

Author SHA1 Message Date
  Chris Sidebottom e105411460 Add infrastructure for bgemv/bscal 2 months ago
  Chris Sidebottom f95e7b0e32 Add infrastructure for BGEMM 3 months ago
  Usui, Tetsuzo 14107e37d9 Add parallel laed3 3 months ago
  Srangrang 0a967797a1 Add FP16 support for RISCV 4 months ago
  tingbo.liao 3c8df6358f Further rearranged the rotm kernel for the different architectures. 8 months ago
  Martin Kroeker 103637887e
add cblas_?gemm_batch 1 year ago
  Jiaxun Yang fa14bdb26d Entitle missing declearation for alpha 3 years ago
  Egbert Eich 5e6d160020 Do not include symbols defined in driver/others/parameter.c in DYNAMIC_ARCH 3 years ago
  Martin Kroeker bc93f468ef
Add Elbrus E2000 architecture as generic x86_64 compatible 3 years ago
  Wangyang Guo 1d83ca4bca Small Matrix: support BFLOAT16 data type 4 years ago
  Wangyang Guo 5dc7c3c8e5 Small Matrix: add GEMM_SMALL_MATRIX_PERMIT to tune small matrics case 4 years ago
  Xianyi Zhang 57ed58cefe Refs #2587 Add small matrix optimization reference kernel for c/zgemm. 5 years ago
  Xianyi Zhang 17d32a4a82 Change a1b0 gemm to b0 gemm. 5 years ago
  Xianyi Zhang be3349405d Add alpha=1.0 beta=0.0 for small gemm. 5 years ago
  Xianyi Zhang 0a2077901c Add small marix optimization kernel interface. 5 years ago
  gxw af0a69f355 Add support for LOONGARCH64 4 years ago
  Chen, Guobing a7b1f9b1bb Implementation of BF16 based gemv 5 years ago
  Rajalakshmi Srinivasaraghavan b5d30b390d Fix build issues with bfloat16 5 years ago
  Martin Kroeker 629c497b6c
common_sh.h renamed to common_sb.h 5 years ago
  Martin Kroeker ca31c32693
Rename "HALF" and "sh" to "BFLOAT16" and "sb" 5 years ago
  Chen, Guobing deaeb6c5b8 Add bfloat16 based dot and conversion with single/double 5 years ago
  Martin Kroeker 7dbb59b256
Update common_macro.h 5 years ago
  Martin Kroeker c7d668c248
Update common_macro.h 5 years ago
  Martin Kroeker e7afe8a969
Define AXPBY_K fallback for float16 5 years ago
  Rajalakshmi Srinivasaraghavan 22bb50fb81 cmake fixes 5 years ago
  Rajalakshmi Srinivasaraghavan 7eb55504b1 RFC : Add half precision gemm for bfloat16 in OpenBLAS 5 years ago
  Guillaume Horel c7b5a459b6 add missing defines and headers 6 years ago
  Guillaume Horel ea747cf933 start working on ?trtrs 6 years ago
  Martin Kroeker 5c42287c4f
Add declarations for ?sum and cblas_?sum 6 years ago
  Ashwin Sekhar T K 4713e7c47f ARM64: Add the VULCAN Target 9 years ago
  Martin Koehler 711ca33bc6 Improved Ximatcopy when lda==ldb. 10 years ago
  Martin Koehler 39cc6b21d3 Add ATLAS-style ?geadd function 10 years ago
  wernsaar cee257f384 Ref #51: added blas extensions zomatcopy and comatcopy 11 years ago
  wernsaar 7bfb3011e8 Ref #51: added blas extension somatcopy 11 years ago
  wernsaar 8c8f596238 Ref #51: added blas extension domatcopy as not opimized reference 11 years ago
  wernsaar faf3ac0aad Ref #285: added axpby kernels 11 years ago
  Xianyi Zhang 4727fe8abf Refs #47. On Loongson 3A, set DGEMM_R parameter depending on different number of threads. It would improve double precision BLAS3 on multi-threads. 14 years ago
  Xianyi Zhang 342bbc3871 Import GotoBLAS2 1.13 BSD version codes. 14 years ago