2524 Commits (2c0dd2468e253ec7ecdabafcb15d5016a7218a12)

Author SHA1 Message Date
  Martin Kroeker 0d2e486edf
Handle NAN and INF 1 year ago
  Martin Kroeker 5f5b7c4f45
Merge pull request #4423 from martin-frbg/issue4422 1 year ago
  Martin Kroeker f31bea07dd
Merge pull request #4419 from martin-frbg/issue4413 1 year ago
  Martin Kroeker 20413ee6ec
Update zscal.c 1 year ago
  Martin Kroeker b57627c27f
Handle NAN and INF 1 year ago
  Martin Kroeker 995a990e24
Make AVX512 BFLOAT16 kernels conditional on compiler capability 1 year ago
  Martin Kroeker 7df363e1e2
temporarily disable the MSA C/ZSCAL kernels 1 year ago
  Chip-Kerchner 058dd2a4cb Replace two vector loads with one vector pair load and fix endianess of stores - DGEMM versions. 1 year ago
  Martin Kroeker 1c31f56e5a
Handle NAN 1 year ago
  Martin Kroeker 7ee1ee38e2
Handle NaN in input 1 year ago
  Martin Kroeker f637e12713
Handle INF and NAN 1 year ago
  Martin Kroeker 25b0c48082
Update zscal.c 1 year ago
  Martin Kroeker 5e7f714e93
Update zscal.c 1 year ago
  Martin Kroeker cf8b03ae8b
Use NAN rather than SNAN for portability 1 year ago
  Martin Kroeker f0808d856b
Handle NAN in input 1 year ago
  Martin Kroeker acf17a825d
Handle NAN in input 1 year ago
  Martin Kroeker c9df62e883
Fix handling of NAN 1 year ago
  Martin Kroeker def4996170
Fix handling of NAN and INF arguments 1 year ago
  Martin Kroeker 519b40fad9
Merge pull request #4398 from yinshiyou/la-dev 1 year ago
  pengxu a5d0d21378 loongarch64: Add zgemm and cgemm optimization 1 year ago
  gxw 546f13558c loongarch64: Add {c/z}swap and {c/z}sum optimization 1 year ago
  Hao Chen edabb93668 loongarch64: Refine axpby optimization functions. 1 year ago
  Hao Chen 1ec5dded43 loongarch64: Add c/zrot optimization functions. 1 year ago
  Hao Chen 3c53ded315 loongarch64: Add c/znrm2 optimization functions. 1 year ago
  Hao Chen fbd612f8c4 loongarch64: Add ic/zamin optimization functions. 1 year ago
  Hao Chen d97272cb35 loongarch64: Add c/zdot optimization functions. 1 year ago
  Hao Chen 65a0aeb128 loongarch64: Add c/zcopy optimization functions. 1 year ago
  Hao Chen 2a34fb4b80 loongarch64: Add and refine scal optimization functions. 1 year ago
  Hao Chen 8785e948b5 loongarch64: Add camin optimization function. 1 year ago
  Hao Chen 0753848e03 loongarch64: Refine and add axpy optimization functions. 1 year ago
  Hao Chen 06fd5b5995 loongarch64: Add and Refine asum optimization functions. 1 year ago
  guxiwei e771be185e Optimize copy functions with lsx. 1 year ago
  Hao Chen 179ed51d3b Add dgemm_kernel_8x4.S file. 1 year ago
  Hao Chen 173a65d4e6 loongarch64: Add and refine iamax optimization functions. 1 year ago
  zhoupeng ea70e165c7 loongarch64: Refine rot optimization. 1 year ago
  zhoupeng 116aee7527 loongarch64: Refine imin optimization. 1 year ago
  zhoupeng 8be2654193 loongarch64: Refine imax optimization. 1 year ago
  zhoupeng 154baad454 loongarch64: Refine iamin optimization. 1 year ago
  Shiyou Yin 36c12c4971 loongarch64: Refine copy,swap,nrm2,sum optimization. 1 year ago
  Shiyou Yin c6996a80e9 loongarch64: Refine amax,amin,max,min optimization. 1 year ago
  Chris Sidebottom ecae1389df Reduce duplication in kernel definitions 1 year ago
  Chris Sidebottom 60e66725e4 Use numeric labels to allow repeated inlining 1 year ago
  Chris Sidebottom 7a4fef4f60 Tweak SVE dot kernel 1 year ago
  Martin Kroeker f06b535566
Use C kernel for dgemv_t due to limitations of the old assembly one 1 year ago
  barracuda156 d9653af018 KERNEL.PPC970, KERNEL.PPCG4: unbreak CMake parsing 1 year ago
  Chip-Kerchner 93747fb377 Merge remote-tracking branch 'origin/develop' into power10Copies 1 year ago
  Chip-Kerchner 4e738e561a Replace two vector loads with one vector pair load and fix endianess of stores. 1 year ago
  yancheng d32f38fb37 loongarch64: Add optimizations for nrm2. 1 year ago
  yancheng f9b468990e loongarch64: Add optimizations for rot. 1 year ago
  yancheng c80e7e27d1 loongarch64: Add optimizations for sum and asum. 1 year ago