Piotr Kubaj
eebfeba768
Fix build on FreeBSD/powerpc64.
Signed-off-by: Piotr Kubaj <pkubaj@anongoth.pl>
6 years ago
Martin Kroeker
7faf42b7bb
Merge pull request #2167 from kavanabhat/dtrmm_power8_segfault
Fix DTRMMKERNEL register save for power8 64-bit mode (Fix for #2166 )
6 years ago
kavanabhat
a575f1e4c7
Update dtrmm_kernel_16x4_power8.S
6 years ago
Martin Kroeker
2a43062de7
Merge pull request #2159 from martin-frbg/issue2149
Avoid unintentional activation of TLS codepath via USE_TLS=0
6 years ago
Martin Kroeker
4ea794a522
Avoid unintentional activation of TLS code via USE_TLS=0
fixes #2149
6 years ago
Martin Kroeker
ece0bfb881
Merge pull request #2158 from martin-frbg/issue2143
Remove any inadvertent use of -march=native from DYNAMIC_ARCH builds
6 years ago
Martin Kroeker
1f4b6a5d5d
Remove any inadvertent use of -march=native from DYNAMIC_ARCH builds
from #2143 , -march=native precludes use of more specific options like -march=skylake-avx512 in individual kernels, and defeats the purpose of dynamic arch anyway.
6 years ago
Martin Kroeker
be8f70d269
Merge pull request #2157 from martin-frbg/2154-2
Add gfortran workaround for potential ABI violation
6 years ago
Martin Kroeker
e674e1c735
Update fc.cmake
6 years ago
Martin Kroeker
6ca898b63b
Add gfortran workaround for potential ABI violation
for #2154
6 years ago
Martin Kroeker
26411acd56
Merge pull request #2148 from TiborGY/cpp_thread_test_2
Thread safety tester using C++11 threading (cleaned history)
6 years ago
Martin Kroeker
0ab4076dd8
Merge pull request #2156 from martin-frbg/issue2154
Add gfortran workaround for C->FORTRAN ABI violation
6 years ago
Martin Kroeker
a0caa762b3
Add gfortran workaround for ABI violations
for #2154 (see gcc bug 90329)
6 years ago
Martin Kroeker
900d5a3205
Add gfortran workaround for ABI violations in LAPACKE
for #2154 (see gcc bug 90329)
6 years ago
Martin Kroeker
a17cf36225
Merge pull request #2153 from quickwritereader/develop
improved power9 zgemm,sgemm
6 years ago
AbdelRauf
148c4cc5fd
conflict resolve
6 years ago
AbdelRauf
d0c3543c3f
power9 zgemm ztrmm optimized
6 years ago
Martin Kroeker
909ad04aef
Merge pull request #2145 from martin-frbg/1912-3
Separate implementations of AMAX and IAMAX on arm
6 years ago
Martin Kroeker
417efd41c6
Merge pull request #2110 from pc2/cpu-detection
Fix detection of Skylake processors when using GCC
6 years ago
Michael Lass
9cdc828afa
c_check: Unlink correct file
6 years ago
Michael Lass
7a9a4dbc4f
Fix detection of AVX512 capable compilers in getarch
21eda8b5
introduced a check in getarch.c to test if the compiler is capable of
AVX512. This check currently fails, since the used __AVX2__ macro is only
defined if getarch itself was compiled with AVX2/AVX512 support. Make sure this
is the case by building getarch with -march=native on x86_64. It is only
supposed to run on the build host anyway.
6 years ago
AbdelRauf
a469b32cf4
sgemm pipeline improved, zgemm rewritten without inner packs, ABI lxvx v20 fixed with vs52
6 years ago
Martin Kroeker
27649b9543
Document NO_AVX512
for #2151
6 years ago
TiborGY
16f3df5d35
add c++ thread test option to Makefile.rule
6 years ago
TiborGY
1aded69821
hook up c++ thread safety test (main Makefile)
6 years ago
TiborGY
c00289ba54
upload thread safety test folder
6 years ago
AbdelRauf
8fe794f059
improved zgemm power9 based on power8
6 years ago
Martin Kroeker
74c10b57c6
Use generic kernels for complex (I)AMAX to support softfp
6 years ago
Martin Kroeker
c5495d2056
Ensure correct output for DAMAX with softfp
6 years ago
Martin Kroeker
c70496b108
Separate implementations of AMAX and IAMAX on arm
As noted in #1912 and comment on #1942 , the combined implementation happens to "do the right thing" on hardfp, but cannot return both value and index on softfp where they would have to share the return register
6 years ago
Martin Kroeker
ca8d8835f5
Merge pull request #2144 from xianyi/revert-2142-issue1912-2
Revert "Add softfp support in min/max kernels"
6 years ago
Martin Kroeker
d76b20b4d2
Revert "Add softfp support in min/max kernels"
6 years ago
Martin Kroeker
85af04da3c
Merge pull request #2142 from martin-frbg/issue1912-2
Add softfp support in min/max kernels
6 years ago
Martin Kroeker
11e0dcbffb
Merge pull request #2141 from martin-frbg/issue1912
Build and run utests independently of fortran
6 years ago
Martin Kroeker
79366ff7a9
Add softfp support in min/max kernels
fix for #1912
6 years ago
Martin Kroeker
21d05a4835
Merge pull request #2140 from martin-frbg/pgi19
Do not try ancient PGI hacks with recent versions of that compiler
6 years ago
Martin Kroeker
940f38f6dd
Build and run utests in any case, they do their own checks for fortran availability
6 years ago
Martin Kroeker
1778fd4219
Do not try ancient PGI hacks with recent versions of that compiler
should fix #2139
6 years ago
Martin Kroeker
969dd6175e
Merge pull request #2136 from martin-frbg/issue2126
Add option to allow combining USE_THREAD=0 with thread locking support
6 years ago
Martin Kroeker
d8d5682481
Merge pull request #2134 from tylerjereddy/skylake_regress_guard_may14
TST: add SkylakeX AVX512 CI test
6 years ago
Martin Kroeker
f66c11fc22
Remove unrelated change
6 years ago
Martin Kroeker
5ecffc28f2
Add option USE_LOCKING but keep default settings intact
6 years ago
Martin Kroeker
86dda5c2fa
Add option USE_LOCKING for SMP-like locking in USE_THREAD=0 builds
6 years ago
Martin Kroeker
1e52572be3
Add option USE_LOCKING for single-threaded build with locking support
6 years ago
Martin Kroeker
d2cb610272
Add option USE_LOCKING for single-threaded build with locking support
for calling from concurrent threads
6 years ago
Tyler Reddy
a211bc9b6a
TST: add SkylakeX AVX512 CI test
* adapt the C-level reproducer code for some
recent SkylakeX AVX512 kernel issues, provided
by Isuru Fernando and modified by Martin Kroeker,
for usage in the utest suite
* add an Intel SDE SkylakeX emulation utest run to
the Azure CI matrix; a custom Docker build was required
because Ubuntu image provided by Azure does not support
AVX512VL instructions
6 years ago
Martin Kroeker
9208ab8603
Merge pull request #2130 from isuruf/drone
Drone CI for arm64 native builds
6 years ago
Isuru Fernando
b43deb4ad6
Fix typo
6 years ago
Isuru Fernando
b911525c81
arm32 build
6 years ago
Isuru Fernando
7ff44e0016
Remove qemu armv8 builds
6 years ago