9428 Commits (b37516add6e1d39ddeeb7139e6288e5259794e17)
 

Author SHA1 Message Date
  Martin Kroeker b37516add6
Add BGEMM parameters 2 months ago
  Martin Kroeker d030f81380
Merge pull request #5369 from martin-frbg/lapack1144 2 months ago
  Martin Kroeker b746f0eda3
Allocate IWORK to hold at least the one element for workspace queries 2 months ago
  Martin Kroeker b8f66ba0ee
Merge pull request #5367 from Mousius/bgemm-init 2 months ago
  Martin Kroeker cdebb4fd4b
Merge pull request #5365 from martin-frbg/issue5324 2 months ago
  Martin Kroeker ff614575c9
Fix arm64 HAVE_SME setting for DYNAMIC_ARCH builds 2 months ago
  Martin Kroeker 0e11537cab
Merge pull request #5357 from Mousius/bgemm-init 2 months ago
  Chris Sidebottom 8cd4be8d47 Temporarily disable test_bgemm 2 months ago
  Chris Sidebottom 66d9185ebe Fix CMake support 2 months ago
  Martin Kroeker 98aefb70b4
Merge pull request #5292 from isharif168/optimized_gemv_n_1x3 2 months ago
  Martin Kroeker fd37406817
Merge branch 'develop' into optimized_gemv_n_1x3 2 months ago
  Chris Sidebottom 48394384ef Use correct constants for per-target BGEMM/SBGEMM 2 months ago
  Chris Sidebottom 73bf0b941a Add bgemm to gensymbol 2 months ago
  Chris Sidebottom f95e7b0e32 Add infrastructure for BGEMM 3 months ago
  Martin Kroeker 15d6e58510
Merge pull request #5364 from martin-frbg/blashalf 2 months ago
  Martin Kroeker 04bb5acd79
change BLAS_HALF to BLAS_BFLOAT16 (another missed rename) 2 months ago
  Martin Kroeker 3d31887073
Merge pull request #5362 from Mousius/fix-bf16 2 months ago
  Martin Kroeker 0ddf8ebd42
Merge pull request #5354 from pratiklp00/p11 2 months ago
  Martin Kroeker d2ea9bbb6d
Merge pull request #5363 from guoyuanplct/develop 2 months ago
  guoyuanplct 4ff549a450
Update CONTRIBUTORS.md 2 months ago
  guoyuanplct 309c48e327
Update CONTRIBUTORS.md 2 months ago
  Chris Sidebottom 552e1c7a7a Correct compiler flags for NEOVERSEV1 target 2 months ago
  Chris Sidebottom 46b9b7a080 Also enable BFLOAT16 for make cirun 2 months ago
  Chris Sidebottom eaaa628af2 Enable BUILD_BFLOAT16 in cirun 2 months ago
  Chris Sidebottom 7a97c4ca97 Rename HALF -> BFLOAT16 in some more places 2 months ago
  Martin Kroeker ee6560c89f
Merge pull request #5360 from sertonix/cpuid-arm 2 months ago
  Sertonix 8d11e4630c Fix cpuid.S on arm 2 months ago
  Martin Kroeker 03a4afcf14
Merge pull request #5359 from martin-frbg/gitign_isnan 2 months ago
  Martin Kroeker 901de8f33a
remove lapacke_mangling.h and add la_xisnan.mod 2 months ago
  Martin Kroeker ce6991780a
Merge pull request #5356 from ilina-linaro/ilina-woa 2 months ago
  Martin Kroeker df013c5e28
Merge pull request #5358 from iha-taisei/dot_unroll 2 months ago
  Iha, Taisei f7ad906b49 Performance improvements of [SD]DOT with loop-unrolling on A64FX 3 months ago
  Lina Iyer 7f360001f9
Update README.md to include Windows on Arm64 3 months ago
  Martin Kroeker 36c2589d3a
Merge pull request #5355 from tetsuzo-usui/add_parallel_laed3 3 months ago
  Usui, Tetsuzo 14107e37d9 Add parallel laed3 3 months ago
  Martin Kroeker a06bcf836b
Merge pull request #5353 from nakagawa-fj/feature/gemm_divide_rate_for_A64FX 3 months ago
  Masato Nakagawa 5253c8f165 Multi-thread Performance Improvement of GEMM with DIVIDE_RATE=1 for 3 months ago
  Martin Kroeker 8f0a1a3f82
Merge pull request #5303 from martin-frbg/issue5289 3 months ago
  Martin Kroeker 2c0dd2468e
Merge pull request #5350 from martin-frbg/issue5341 3 months ago
  Martin Kroeker 7ae24d0b85
Merge pull request #5351 from martin-frbg/lapack1140 3 months ago
  Martin Kroeker 5aeca597fe
Fix documentation error and ordering bug (Reference-LAPACK PR 1140) 3 months ago
  Martin Kroeker dcb289539b
Merge pull request #5344 from MaartenBaert/fix-dlasd7 3 months ago
  Martin Kroeker 9bcffbd655
Declare the server_lock mutex volatile in addition to static 3 months ago
  Martin Kroeker 334cd242d4
Merge pull request #5348 from hideaki-motoki/issue5343_prefered_size_for_a64fx 3 months ago
  h-motoki bba75d5e45 GEMM_PREFERED_SIZE parameter has been changed for A64FX. 3 months ago
  Martin Kroeker 4062c10370
Merge pull request #5345 from OpenMathLib/revert-5251-issue5250 3 months ago
  Martin Kroeker b78d1dc0ae
Merge pull request #5342 from martin-frbg/cmake_ampere 3 months ago
  Martin Kroeker 83a01d29ca
Revert "Fix out-of-bounds accesses in ?/SCAL/?GEEV triggered by preceding errrors/invalid inputs" 3 months ago
  Martin Kroeker 560fa88c96
Add cross-build parameters for Ampere One 3 months ago
  Martin Kroeker 55bb5ef867
Add compiler options for Ampere One 3 months ago