9527 Commits (develop)
 

Author SHA1 Message Date
  Martin Kroeker c31861ea62
Merge pull request #5435 from martin-frbg/update_rvv_ci 4 weeks ago
  Martin Kroeker 57c2936a43
Merge branch 'OpenMathLib:develop' into update_rvv_ci 4 weeks ago
  Martin Kroeker 6d070820fc
Merge pull request #5436 from martin-frbg/update_osx_ci 4 weeks ago
  Martin Kroeker 1c7251ca20
remove the -llto_library option for any osx fortran compiler 4 weeks ago
  Martin Kroeker a1331406a3
drop (re)installation of cmake on osx runners 4 weeks ago
  Martin Kroeker c42fccccb5
Drop installation of cmake 4 weeks ago
  Martin Kroeker 4c1a4e60a6
Update toolchain to its latest nightly build 4 weeks ago
  Martin Kroeker 06c09deee9
Merge pull request #5426 from hideaki-motoki/issue5417_axpy_sve 1 month ago
  Martin Kroeker da7d0f4a38
Merge pull request #5427 from yuanjia111/develop 1 month ago
  yuanjia c2cc7a3602 riscv64: optimize gemv_t_vector.c 1 month ago
  h-motoki e23f9c6642 Merge remote-tracking branch 'upstream/develop' into issue5417_axpy_sve 1 month ago
  Martin Kroeker b3f247ae5a
Merge pull request #5425 from martin-frbg/fixup5389 1 month ago
  h-motoki 855945befb Implementing SVE in [SD]AXPY Kernels for A64FX and Graviton3E 1 month ago
  Martin Kroeker 7c1839899e
Increase assumed L2 sizes for RISCV X280 / ZVL256B and for SVE-capable ARM64 1 month ago
  Martin Kroeker 9c43301b6d
Merge pull request #5421 from reibax-marcus/develop 1 month ago
  Martin Kroeker 9d6df1dd3e
Merge pull request #5422 from ChipKerchner/addRVVVectorizedPacking 1 month ago
  Martin Kroeker f3b2a15fad
Merge pull request #5420 from yuanjia111/develop 1 month ago
  Chip Kerchner 64401b4417 Disable vectorized packing for DGEMM - since it is slower than scalar. 1 month ago
  Martin Kroeker 5e43ba948c
Merge pull request #5419 from Mousius/bgemm-optimisation 1 month ago
  Chip Kerchner c00afc86a6 Add and use vectorized packing to ZVL128B and ZVL256B. Up to 3x+ faster than generic scalar functions. 1 month ago
  Xabier Marquiegui 3a6b79c50f fix: broken cblas installation when using makefile based builds 1 month ago
  yuanjia 803e8d4838 Move the value assignment of vector x in gemv_n_sve.c to the outermost loop to reduce the repeated data retrieval. 1 month ago
  Chris Sidebottom 5f47b872f1 Remove older kernels for BGEMM on NEOVERSEV1 1 month ago
  Chris Sidebottom 114316f361 Optimize SBGEMM / BGEMM for NEOVERSEV1 further 1 month ago
  Martin Kroeker 75c6ab4036
CI: Update WoA job to use LLVM 20.1.8 and avoid stray preinstalled LLVM19 (#5411) 1 month ago
  Martin Kroeker 5c5f852ee3
Merge pull request #5415 from martin-frbg/Fixum-5399 1 month ago
  Martin Kroeker f1ee61ea30
Include NEON header for the bfloat conversion functions 1 month ago
  Martin Kroeker b3ffd5524a
Include NEON header for the bfloat conversion functions 1 month ago
  Martin Kroeker d23680b81d
Merge pull request #5407 from nakagawa-fj/feature/gemm_divide_rate_for_neoversev1 2 months ago
  Martin Kroeker b4cc4be2ce
Merge pull request #5410 from martin-frbg/issue5404 2 months ago
  Martin Kroeker 0968dddf1a
Merge pull request #5409 from martin-frbg/issue5372 2 months ago
  Martin Kroeker eddfe1e6b3
Merge pull request #5408 from ChipKerchner/fixRISCV64GEMVInitializationAndWarnings 2 months ago
  Martin Kroeker 30d11bc92c
Adjust multithreading threshold and add an intermediate step 2 months ago
  Martin Kroeker a3b9c933c5
mark xbuffer as volatile to work around gcc15.1 optimizer bug 2 months ago
  Chip Kerchner 72f082f31d Fix bad vector zero initializer and other compiler warnings for RISC-V. 2 months ago
  Masato Nakagawa 7e29f11396 Multi-thread GEMM Performance Improvement on NeoverseV1 (DIVIDE_RATE=1) 2 months ago
  Martin Kroeker 9a64b32b44
Merge pull request #5406 from martin-frbg/fixbgemmtest 2 months ago
  Martin Kroeker b66a01f909
Fix building of bgemm tests on GEMM3M-capable (x86) targets 2 months ago
  Martin Kroeker a5e7c0e3e0
Merge pull request #5396 from abhishek-iitmadras/abhishekk_bfloat16 2 months ago
  abhishek-fujitsu 6356190d06 fix gfortran link path in dynamic_arch.yml 2 months ago
  abhishek-fujitsu 4c8dcb3a8f Darwin/arm64: disable SVE/SME and fix gfortran link path 2 months ago
  Martin Kroeker 33b50548eb
Merge pull request #5403 from martin-frbg/issue5402 2 months ago
  Martin Kroeker c504aedca1
Merge pull request #5400 from Mousius/neoversev2-target 2 months ago
  Martin Kroeker b9e107932a
add NeoverseV2 2 months ago
  Martin Kroeker 2f89a5970e
fix NeoverseV2 typo 2 months ago
  Martin Kroeker a9e8fa06bf
Introduce a (crude) threshold to multithreading 2 months ago
  Martin Kroeker b4c2b34a45
Merge pull request #5401 from martin-frbg/followup-5397 2 months ago
  Martin Kroeker c9204f7b6f
Merge pull request #5399 from Mousius/bgemm-8x4 2 months ago
  Martin Kroeker a55e65dba9
Merge pull request #5391 from martin-frbg/issue5387 2 months ago
  abhishek-fujitsu 0bc79da587 add neon header 2 months ago