994 Commits (1309711e243ee945908b0c6139e9ea35c12e97f1)

Author SHA1 Message Date
  Martin Kroeker ba8388cee0
Merge pull request #1651 from martin-frbg/avx512-nodgemm 7 years ago
  Martin Kroeker 6e54b0a027
Disable the 16x2 DTRMM kernel on SkylakeX as well 7 years ago
  Martin Kroeker 40c8cbc3bf
Merge pull request #1650 from martin-frbg/avx512-nodgemm 7 years ago
  Martin Kroeker f0a8dc2eec
Disable the AVX512 DGEMM kernel for now 7 years ago
  Martin Kroeker b83e4c60c7
Remove premature exit for INC_X or INC_Y zero 7 years ago
  Martin Kroeker e344db269b
Remove premature exit for INC_X or INC_Y zero 7 years ago
  Martin Kroeker 545b82efd3
Remove premature exit for INC_X or INC_Y zero 7 years ago
  Martin Kroeker e322a951fe
Remove premature exit for INC_X or INC_Y zero 7 years ago
  Martin Kroeker c628c6fa59
Merge pull request #1612 from oon3m0oo/cpus 7 years ago
  Martin Kroeker 6f71c0fce4
Return a somewhat sane default value for L2 cache size if cpuid retur… (#1611) 7 years ago
  Craig Donner c2545b0fd6 Fixed a few more unnecessary calls to num_cpu_avail. 7 years ago
  Arjan van de Ven 89372e0993 Use AVX512 also for DGEMM 7 years ago
  Martin Kroeker 0023515733
Typo fix (misplaced parenthesis) 7 years ago
  Arjan van de Ven 99c7bba8e4 Initial support for SkylakeX / AVX512 7 years ago
  Martin Kroeker 8562d5787a
Merge pull request #1583 from martin-frbg/issue1575 7 years ago
  Martin Kroeker 7df8c4f76f
typo fix 7 years ago
  Martin Kroeker 2fc748bf72
Restore optimized swap kernel now that we have a proper fix 7 years ago
  Martin Kroeker d1b7be14aa
Handle INCX=0,INCY=0 case 7 years ago
  Martin Kroeker 961d25e9c7
Use the new zrot.c on POWER8 for crot as well 7 years ago
  Martin Kroeker f5959f2543
Merge pull request #1567 from martin-frbg/mipstrmm 7 years ago
  Martin Kroeker 82012b960b
Revert " Switch mips32 target to USE_TRMM to fix complex TRMM" 7 years ago
  Martin Kroeker 8dd3515fa2
Merge pull request #1565 from martin-frbg/mipstypo 7 years ago
  Martin Kroeker 95f7f0229c
Remove extraneous brace from previous commit 7 years ago
  Martin Kroeker 5082fe4306
Merge pull request #1564 from martin-frbg/issue1563 7 years ago
  Martin Kroeker 7a7619af6d
Revert changes from PR#1419 7 years ago
  Martin Kroeker 893b535540
Use correct data type for initializers of v2f64, v4f32 7 years ago
  Martin Kroeker 018f2dad27
Switch mips32 target to USE_TRMM to fix complex TRMM 7 years ago
  Martin Kroeker 9d5098dbc9
Add MIPS 1004K target (Mediatek MT7621 SOC) 7 years ago
  Martin Kroeker 954f1832de
Merge pull request #1540 from martin-frbg/mips32-zasum 7 years ago
  Martin Kroeker 941ad280a8
Fix typo in MIPS P5600 complex ASUM code selection 7 years ago
  Martin Kroeker 1da365312a
Merge pull request #1538 from martin-frbg/arm7utest 7 years ago
  Martin Kroeker 2d0929fa7c
Move the test for zero incx,incy in ARMV7 ROT 7 years ago
  Martin Kroeker 125343cc88
Drop test for zero incx,incy in armv7 AXPY 7 years ago
  Martin Kroeker 8a3b6fa108
Use generic zrot.c on ppc64/POWER6 to work around utest failure from … (#1535) 7 years ago
  Martin Kroeker 9c5518319a
Revert "Fix 32bit HASWELL builds" 7 years ago
  Martin Kroeker 2ca0faf495
Merge pull request #1515 from martin-frbg/mipsdot 7 years ago
  Martin Kroeker 0fe434598b
Fix precision of mips dsdot 7 years ago
  Martin Kroeker c7b55b6082
Merge pull request #1499 from quickwritereader/develop 7 years ago
  Martin Kroeker 840e01061f
Merge pull request #1491 from martin-frbg/ddot_mt 7 years ago
  QWR QWR 28ca97015d power8:Added initial zgemv_(t|n) ,i(d|z)amax,i(d|z)amin,dgemv_t(transposed),zrot 7 years ago
  Martin Kroeker 6a6ffaff1e
Merge pull request #1494 from martin-frbg/x86_dsdot 7 years ago
  Martin Kroeker 28ac9ea5a6
Use generic/dot.c instead of the inferior arm/dot.c for x86 DSDOT 7 years ago
  Martin Kroeker a55694dd5b
Declare dot_compute static to avoid conflicts in multiarch builds 7 years ago
  Martin Kroeker 85a41e9cdb
Add multithreading support for Haswell DDOT 7 years ago
  Martin Kroeker 81215711a2
Re-enable DAXPY microkernels for x86_64 7 years ago
  Martin Kroeker 22167170b3
Merge pull request #1477 from quickwritereader/develop 7 years ago
  Ashwin Sekhar T K fa9ca65c0e ARM64: Fix utest dsdot errors 7 years ago
  Martin Kroeker 719b68f077
Merge pull request #1473 from martin-frbg/p2align 7 years ago
  Martin Kroeker fe9f15f2d8
Merge pull request #1472 from martin-frbg/utest-fixes 7 years ago
  Martin Kroeker 497f0c3d8a
Replace .align with .p2align in the Nehalem microkernels 7 years ago