9504 Commits (114316f36102e3500ec84c055acb37802afdb313)
 

Author SHA1 Message Date
  Martin Kroeker a86419fb66
Merge pull request #5280 from martin-frbg/zscal_x86_64 3 months ago
  Martin Kroeker 11ff18bb0f
Merge pull request #5081 from XiWeiGu/kernel_generic_fixed_cscal_zscal 3 months ago
  Martin Kroeker 2e2691b34b
Merge pull request #5078 from XiWeiGu/la64_fixed_cscal_zscal 3 months ago
  Martin Kroeker f4194fc65f
Merge branch 'develop' into la64_fixed_cscal_zscal 3 months ago
  Martin Kroeker e12132abd4
Use generic C/ZSCAL kernels to address inf/nan handling for now 3 months ago
  Martin Kroeker 1cefbea7ea
Use generic SCAL kernels to address inf/nan handling for now 3 months ago
  Sharif Inamdar 8279e68805 Optimize gemv_n_sve_v1x3 kernel 3 months ago
  Martin Kroeker f18b7a46bf
add dummy2 flag handling for inf/nan agnostic zeroing 3 months ago
  Martin Kroeker fe220a0d7d
Merge pull request #5291 from guoyuanplct/develop 3 months ago
  Martin Kroeker bbdc265798
Merge pull request #5294 from arnej27959/arnej/fix-arm64-register 3 months ago
  Arne Juul 5442aff218 Accumulate results in output register explicitly 3 months ago
  guoyuanplct 83fcab7578 Merge branch 'develop' of https://github.com/guoyuanplct/OpenBLAS into develop 3 months ago
  guoyuanplct 2ae019161a fixed the performance problem in RISCV64_ZVL256 when OPENBLAS_K is small 3 months ago
  Srangrang fb89820f20 Merge branch 'develop' of https://github.com/Srangrang/OpenBLAS into develop 4 months ago
  Srangrang 4e1a381e5b fix: resolve the compilation failure without zfh instruction 4 months ago
  Linjin Li fa2b08b378
Merge pull request #1 from gkdddd/riscv_shgemm 4 months ago
  gkdddd 670ec6f757 Added shgemm_kernel_8x8 for RISCV64_ZVL128B and shgemm_kernel_16x8 for RISCV64_ZVL256B 4 months ago
  Martin Kroeker 02267d86f5
Merge pull request #5288 from guoyuanplct/develop 4 months ago
  guoyuanplct d2003dc886 del lines 4 months ago
  guoyuanplct 45fd2d9b07 Optimized the axpby function. 4 months ago
  Srangrang 0a967797a1 Add FP16 support for RISCV 4 months ago
  Martin Kroeker fb8dc8ff5c
Add dummy2 flag handling 4 months ago
  Srangrang 2996c25c94 add shgemm for RISCV_ZVL128B 4 months ago
  Martin Kroeker cf06250d36
add handling of dummy2 flag 4 months ago
  Martin Kroeker 28f8fdaf0f
support flag for NaN/Inf handling and fix scaling of NaN/Inf values 4 months ago
  Martin Kroeker 669c847ceb
support extra flag for NaN handling 4 months ago
  Martin Kroeker 0163143fdd
Merge pull request #5278 from martin-frbg/fixup5276 4 months ago
  Martin Kroeker 20f2ba0141
Move declaration of i for pre-C99 compilers 4 months ago
  Martin Kroeker e2e6a4d90a
Merge pull request #5276 from nakagawa-fj/gemm_2d_thread_partitioning 4 months ago
  Martin Kroeker 9ef5995c22
Merge pull request #5277 from martin-frbg/fixmingw32 4 months ago
  Martin Kroeker 42b7d1f897
Fix addressing of alpha in CBLAS 4 months ago
  Martin Kroeker bd573a9d38
Expand mingw32 gfortran workaround to all versions after 14.1 4 months ago
  Masato Nakagawa 2351a98005 Update 2D thread-partitioned GEMM for M << N case. 4 months ago
  Martin Kroeker a5f701c4ab
Merge pull request #5274 from martin-frbg/issue5247 4 months ago
  Martin Kroeker 4ca76d9de4
Expressly provide a shared libs option 4 months ago
  Martin Kroeker 846a5436e7
Merge pull request #5273 from martin-frbg/issue5259 4 months ago
  Martin Kroeker 8779eac3b8
Do not add a 64 suffix to the library name if the user-provided suffix already contains it 4 months ago
  Martin Kroeker 3473118213
Merge pull request #5272 from martin-frbg/issue5271 4 months ago
  Martin Kroeker f2022c23ac
Remove sve capability from NeoverseN1 and specify CortexX2/A?10 as arm8.4a 4 months ago
  Martin Kroeker b5456c1b41
Merge pull request #5260 from taoye9/enable_bf16_gemm_gemv_forward_on_arm64 4 months ago
  Martin Kroeker 5a322f21af
Merge pull request #5268 from martin-frbg/fix-dyn-sgemmdirect 4 months ago
  Martin Kroeker 6680e0592f
Fix conditional inclusion of SGEMM_KERNEL_DIRECT 4 months ago
  Martin Kroeker 0b0bb9951d
Merge pull request #5265 from guoyuanplct/develop 4 months ago
  guoyuanplct 7732a55200 Add retry mechanism after deadlock timeout for c910v. 4 months ago
  guoyuanplct be9f7550b5 Format Code 4 months ago
  guoyuanplct 4d213653d8 kernel/riscv64:Added support for omatcopy on riscv64. 4 months ago
  Martin Kroeker 8afddc1a81
Merge pull request #5262 from guoyuanplct/develop 4 months ago
  guoyuanplct 9a7e3f102b kernel/riscv64:Fixed the bug of openblas_utest_ext failing in c/zgemv and some c/zgbmv tests: 4 months ago
  Martin Kroeker 5366902f9d
Merge pull request #5261 from ErnstPeng/fix-lasx 4 months ago
  pengxu a978ad3180 Loongarch64: add C functions of zgemm_ncopy_16 4 months ago