76 Commits (2d0b2334259d41c2003b51a07580dbd25cfe267c)

Author SHA1 Message Date
  Marius Hillenbrand 22aa81f3e5 s390x: fix cscal and zscal implementations 5 years ago
  Marius Hillenbrand f91057cbad s390x: move common vector definitions and utils into header 5 years ago
  Marius Hillenbrand 2ee5b899ce s390x: enable S/DGEMM block with explicit loop unrolling + interleaving with clang 5 years ago
  Marius Hillenbrand 87e5bbd887 s390x: avoid variable-length arrays in struct for asm operands 5 years ago
  Marius Hillenbrand b9b3265ec8 s390x: avoid inline assembly for vector loads for clang 5 years ago
  Marius Hillenbrand a1616a0b86 s390x: replace nop with "nop 0" in inline assembly 5 years ago
  Marius Hillenbrand 60ef193258 s390x: use "lghi" for immediate values to fix build with clang 5 years ago
  Marius Hillenbrand 07c334e7be s390x: Factor out small block sizes for SGEMM/DGEMM on z14 5 years ago
  Marius Hillenbrand e2828e30aa s390x: Optimize SGEMM/DGEMM blocks for z14 with explicit loop unrolling/interleaving 5 years ago
  Marius Hillenbrand 89fe17f20e s390x: Use new sgemm kernel also for DGEMM and DTRMM on Z14 5 years ago
  Marius Hillenbrand bdd795ed03 s390x/GEMM: replace 0-init with peeled first iteration 5 years ago
  Marius Hillenbrand 2840432e49 s390x: improvise vector alignment hints for older compilers 5 years ago
  Marius Hillenbrand 1b0b4349a1 s390x/Z14: Change register blocking for SGEMM to 16x4 5 years ago
  Marius Hillenbrand 71b6eaf459 s390x: Use new sgemm kernel also for strmm on Z14 and newer 5 years ago
  Marius Hillenbrand 43c0d4f312 s390x: Add vectorized sgemm kernel for Z14 and newer 5 years ago
  int_13h 96ad579428 add in runtime cpu detection for zarch (#2349) 5 years ago
  Andreas Arnez d117dfd505 Change bad usage of "asum" to "sum" in ZARCH versions of ?sum 6 years ago
  Martin Kroeker 246ca29679
Add ZARCH implementation of ?sum 6 years ago
  maamountki 0a54c98b9d
[ZARCH] Modify constraints 6 years ago
  maamountki bec54ae366
[ZARCH] Fix caxpy 6 years ago
  maamountki f583674109
[ZARCH] Fix cgemv_t_4 6 years ago
  maamountki 77fe70019f
[ZARCH] Fix constraints and source code formatting 6 years ago
  maamountki 7039770165
[ZARCH] Undo the last commit 6 years ago
  maamountki 11a43e8116
[ZARCH] Set alignment hint for vl/vst 6 years ago
  maamountki 61526480f9
[ZARCH] Fix copy constraint 6 years ago
  maamountki 81daf6bc38
[ZARCH] Format source code, Fix constraints 6 years ago
  Martin Kroeker 874df65491
Fix incorrect sgemv results for IBM z14 6 years ago
  Martin Kroeker 877023e1e1
Fix precision of zarch DSDOT 6 years ago
  Martin Kroeker 265142edd5
Fix typo in the zarch min/max kernels 6 years ago
  maamountki 29416cb5a3
[ZARCH] Add Z13 version for max/min functions 6 years ago
  maamountki 48b9b94f7f
[ZARCH] Improve loading performance for camax/icamax 6 years ago
  maamountki fcd814a8d2
[ZARCH] Fix bug in max/min functions 6 years ago
  maamountki dc4d3bccd5
[ZARCH] Fix icamax/icamin 6 years ago
  maamountki c7143c1019
[ZARCH] Fix iamax/imax single precision 6 years ago
  maamountki 04873bb174
[ZARCH] Undo the last commit 6 years ago
  maamountki c8ef9fb220
[ZARCH] Fix bug in iamax/iamin/imax/imin 6 years ago
  maamountki b111829226
[ZARCH] Update max/min functions 6 years ago
  maamountki b815a04c87
[ZARCH] fix a bug in max/min functions 6 years ago
  maamountki 1a7925b3a3
[ZARCH] Update dgemv_n_4.c 6 years ago
  maamountki 406f835f00
[ZARCH] update cgemv_n_4.c 6 years ago
  maamountki 621dedb37b
[ZARCH] Update cgemv_t_4.c 6 years ago
  maamountki b731e8246f
Update sgemv_t_4.c 6 years ago
  maamountki ecc31b743f
Update dgemv_t_4.c 6 years ago
  maamountki 5d89d6b143
[ZARCH] fix sgemv_n_4.c 6 years ago
  maamountki 67432b23c2
[ZARCH] fix cgemv_n_4.c 6 years ago
  maamountki be66f5d5c2
[ZARCH] fix data prefetch type in sdot 6 years ago
  maamountki c2ffef8156
[ZARCH] fix data prefetch type in ddot 6 years ago
  maamountki e7455f500c
[ZARCH] fix dsdot.c 6 years ago
  maamountki 3eafcfa650
[ZARCH] fix cgemv_n_4.c 6 years ago
  maamountki 94cd946b96
[ZARCH] fix cgemv_n_4.c 6 years ago