2524 Commits (2c0dd2468e253ec7ecdabafcb15d5016a7218a12)

Author SHA1 Message Date
  pengxu f19e72c402 Loongarch64: fixed swap_lasx 5 months ago
  pengxu b471fa337b Loongarch64: fixed snrm2_lasx 5 months ago
  pengxu 57bb46bedf Loongarch64: fixed rot_lasx 5 months ago
  pengxu 6dc4ca2391 Loongarch64: fixed icamax_lasx 5 months ago
  pengxu b528b1b8ea Loongarch64: fixed iamax_lasx 5 months ago
  pengxu ba9569e382 Loongarch64: fixed dot_lasx 5 months ago
  pengxu dc5fa29851 Loongarch64: fixed cscal_lasx 5 months ago
  pengxu a98dd6d911 Loongarch64: fixed copy_lasx 5 months ago
  pengxu d49319c2d2 Loongarch64: fixed cnrm2_lasx 5 months ago
  pengxu 74c97ef814 Loongarch64: fixed cdot_lasx 5 months ago
  pengxu be525521ad Loongarch64: fixed asum_lasx 5 months ago
  pengxu 0cd5ca5527 Loongarch64: fixed amax_lasx 5 months ago
  guoyuanplct 11ffc8680e Format the code 5 months ago
  guoyuanplct 7616c42095 Optimized RVV_ZVL256B Implementation of zgemv_n 5 months ago
  abhishek-fujitsu 9c02cdb073 optimise dot using thread throttling for NEOVERSE V1 6 months ago
  Martin Kroeker d0e8fd6d40
Merge pull request #5239 from annop-w/gemv_n_sve 5 months ago
  Iha, Taisei 08b5c18d70 fixed a potential out-of-bounds on gemv. 5 months ago
  Annop Wongwathanarat e11744a411 Use SVE kernel for S/DGEMVN for SVE machines 5 months ago
  Martin Kroeker db0abfa907
Merge pull request #5238 from martin-frbg/revert5125 5 months ago
  Martin Kroeker 7389b6c483
Merge pull request #5237 from martin-frbg/revert5219 5 months ago
  Martin Kroeker 4ec62d7f73
remove non-vectorized code path for power8, restoring PR4880 5 months ago
  Martin Kroeker 1df8738f27
Merge pull request #5235 from quickwritereader/issue_unaligned_ppc64le 5 months ago
  Martin Kroeker 99d9f1ff38
Fix conditional 5 months ago
  Martin Kroeker 96d80801bc
Reinstate the CooperLake microkernel 5 months ago
  Martin Kroeker 2e4309315c
Merge pull request #5219 from martin-frbg/sbgemvn_cooper 5 months ago
  Ubuntu 0cc2485594 Explicit unaligned vector load/stores in PPC64LE GEMV kernels 5 months ago
  Martin Kroeker dd38b4e811
Merge pull request #5225 from annop-w/gemv_n 5 months ago
  Martin Kroeker 0241d516f6
Merge pull request #5220 from iha-taisei/sdgemv_n_unroll 5 months ago
  Annop Wongwathanarat d535728803 Improve performance for SGEMVN on NEONVERSEN1 6 months ago
  Usui, Tetsuzo d711906e3e Add symv kernels for arm64 5 months ago
  Iha, Taisei f1e628b889 Further performance improvements to [SD]GEMV. 5 months ago
  Martin Kroeker 211dfd0754
disable the CooperLake microkernel as it produces wrong results 5 months ago
  Martin Kroeker b30dc9701f
Merge pull request #5215 from annop-w/gemv_t 5 months ago
  Martin Kroeker 2893d0add4
Merge pull request #5211 from guoyuanplct/develop 5 months ago
  Annop Wongwathanarat ec146157d3 Use SVE kernel for S/DGEMVT for SVE machines 6 months ago
  Martin Kroeker 70865a894e
Merge pull request #5180 from ywwry66/openmp_use_cmake 6 months ago
  lglglglgy 1ff303f36e Optimizing the Implementation of GEMV on the RISC-V V Extension 6 months ago
  ColumbusAI 7bf848454d
Update zsum.c -- fixed spelling error to successfully compile 6 months ago
  Vaisakh K V 04915be829 Add vector registers to clobber list to prevent compiler optimization. 6 months ago
  Egbert Eich ea6515c4b3 On zarch don't produce objects from assembler with a writable stack section 6 months ago
  Ruiyang Wu 02fd1df10b CMake: Pass `OpenMP` compiler and linker flags through CMake targets 6 months ago
  Ye Tao f27ba5efd1 fix bugs in aarch64 sbgemv_n kernel 6 months ago
  Annop Wongwathanarat edef2e4441 Fix bug in ARM64 sbgemv_t 6 months ago
  Martin Kroeker b55ca71d5b
Merge pull request #5182 from annop-w/sgemm_ncopy 6 months ago
  Martin Kroeker 2f778554b8
Merge pull request #5181 from taoye9/change_sbgemn_cast_bf16 6 months ago
  Annop Wongwathanarat 9807f56580 Optimize aarch64 sgemm_ncopy 6 months ago
  Martin Kroeker a3e7b16072
Merge pull request #5157 from manaalmj/feature 6 months ago
  Ye Tao 4c00099ed6 replace customize bf16_to_fp32 with arm neon vcvtah_f32_bf16 6 months ago
  Annop Wongwathanarat a085b6c9ec Fix aarch64 sbgemv_t compilation error for GCC < 13 6 months ago
  manjam01 5c4e38ab17 Optimize gemv_n_sve kernel 7 months ago