Martin Kroeker
1b90989662
Add NetBSD to the xBSD conditionals
6 years ago
Martin Kroeker
5f6206fa2d
Simplify OSX/IOS cross-compilation and add a CI test for it ( #2279 )
* Add automatic fixups for OSX/IOS cross-compilation
* Add OSX/IOS cross-compilation test to Travis CI
* Handle platforms that lack hwcap.h by falling back to ARMV8
* Fix PROLOGUE for OSX/IOS
6 years ago
Martin Kroeker
8617d75548
Revert "Avoid taking root of negative number in symv_thread.c"
6 years ago
Sebastian Berg
6355c25dde
Avoid taking root of negative number in symv_thread.c
This is similar to fixes in gh-1929, but there was one remaining
occurance of this type of pattern in the driver/level2/*_thread.c
files.
6 years ago
Martin Kroeker
673e5a0495
Replace several POWER8/9 C kernels with their gcc7-generated assembly versions ( #2263 )
* Add gcc7-generated assembly files for POWER8/9 isa/ica-min/max and POWER9 caxpy
To work around internal compiler errors encountered when compiling the original C source with gcc 4 and 5, and wrong code generated by gcc 8.3.0
* Use gcc-generated assembly instead of original C sources
to work around internal compiler errors encountered with gcc 4.8/5.4 and wrong code generation by gcc 8.3
* Use gcc-generated assembly instead of the original C source
to work around internal compiler errors encountered with gcc 4.8 and 5.4, and wrong code generation by gcc 8.3
* Add gcc7-generated assembler version of caxpy for power8
to work around wrong code generated by gcc 8.3
* Handle CONJ define for caxpyc
* Handle CONJ define for caxpyc
* Add gcc7-generated assembly cdot for POWER9
* Use prebuilt assembly for POWER9 cdot
created with gcc 7.3.1 to work around ICE in older gcc versions
* Exclude POWER9 from DYNAMIC_ARCH when gcc versions is lower than 6
* Update Makefile.system
* Use PROLOGUE macro to ensure correct function name for DYNAMIC_ARCH
* Disable POWER9 with old gcc versions
6 years ago
Andrew
4de545aa7d
address minor warnings from gcc7
6 years ago
Martin Kroeker
bf1430f7d7
Merge pull request #2208 from martin-frbg/munmap-debug
Provide more information on mmap/munmap failure
6 years ago
Martin Kroeker
1776ad82c0
Add files via upload
6 years ago
Martin Kroeker
4e2f81cfa1
Provide more information on mmap/munmap failure
for #2207
6 years ago
Martin Kroeker
3d36c45116
Add CPUID identification of Intel Ice Lake
6 years ago
Martin Kroeker
21d05a4835
Merge pull request #2140 from martin-frbg/pgi19
Do not try ancient PGI hacks with recent versions of that compiler
6 years ago
Martin Kroeker
1778fd4219
Do not try ancient PGI hacks with recent versions of that compiler
should fix #2139
6 years ago
Martin Kroeker
86dda5c2fa
Add option USE_LOCKING for SMP-like locking in USE_THREAD=0 builds
6 years ago
Martin Kroeker
5cabda79d0
Merge pull request #2117 from martin-frbg/issue2114
Fix errors in cpu affinity setup with glibc 2.6
6 years ago
Martin Kroeker
a6a8cc2b7f
Fix errors in cpu enumeration with glibc 2.6
for #2114
6 years ago
Martin Kroeker
a387a23518
Merge pull request #2101 from luzpaz/misc-typos
Misc. typo fixes in comments and documentation
6 years ago
Martin Kroeker
b43c8382c8
Correct argument of CPU_ISSET for glibc <2.5
fixes #2104
6 years ago
luz.paz
daf2fec12d
Misc. typo fixes
Found via `codespell -q 3 -w -L ith,als,dum,nd,amin,nto,wis,ba -S ./relapack,./kernel,./lapack-netlib`
6 years ago
Jeff Baylor
40e53e52d6
snprintf define consolidated to common.h
6 years ago
Rashmica Gupta
bcdf1d4917
Add in runtime CPU detection for POWER.
6 years ago
Erik M. Bray
8ba9e2a61a
Also call CloseHandle on each thread, as well as on the event so as to not leak thread handles.
6 years ago
Erik M. Bray
4ad694eda1
Fix for #2063 : The DllMain used in Cygwin did not run the thread memory
pool cleanup upon THREAD_DETACH which is needed when compiled with
USE_TLS=1.
6 years ago
Martin Kroeker
3ce28fb81a
Merge pull request #2055 from martin-frbg/atomid
Add CPUID data for Intel Denverton (as Nehalem)
6 years ago
Martin Kroeker
04f2226ea6
Add Intel Denverton
6 years ago
Martin Kroeker
4741ce803b
Merge pull request #2045 from martin-frbg/2033-3
Do not compile in AVX512 check if AVX support is disabled
6 years ago
Martin Kroeker
11cfd0bd75
Do not compile in AVX512 check if AVX support is disabled
xgetbv is function depends on NO_AVX being undefined - we could change that too, but that combo is unlikely to work anyway
6 years ago
Martin Kroeker
d7b2c53c0b
Merge pull request #2039 from brada4/meminit
Address warning in memory.c
6 years ago
Martin Kroeker
10d841d8b9
Merge pull request #2026 from martin-frbg/trmv_threads
Correct range limiting in trmv_thread and re-enable TRMV multithreading
6 years ago
Martin Kroeker
6c83b878f6
Merge pull request #2040 from martin-frbg/locks2002
Restore locking optimizations for OpenMP case
6 years ago
Martin Kroeker
af480b02a4
Restore locking optimizations for OpenMP case
restore another accidentally dropped part of #1468 that was missed in #2004 to address performance regression reported in #1461
6 years ago
Andrew
e4a79be6bb
address warning introed with #1814 et al
6 years ago
Martin Kroeker
45333d5793
Fix error introduced during cleanup
6 years ago
Martin Kroeker
78d9910236
Correct range_n limiting
same bug as seen in #1388 , somehow missed in corresponding PR #1389
6 years ago
Martin Kroeker
03a2bf2602
Fix potential memory leak in cpu enumeration on Linux ( #2008 )
* Fix potential memory leak in cpu enumeration with glibc
An early return after a failed call to sched_getaffinity would leak the previously allocated cpu_set_t. Wrong calculation of the size argument in that call increased the likelyhood of that failure. Fixes #2003
6 years ago
Martin Kroeker
69edc5bbe7
Restore dropped patches in the non-TLS branch of memory.c ( #2004 )
* Restore dropped patches in the non-TLS branch of memory.c
As discovered in #2002 , the reintroduction of the "original" non-TLS version of memory.c as an alternate branch had inadvertently used ba1f91f rather than a8002e2 , thereby dropping the commits for #1450 , #1468 , #1501 , #1504 and #1520 .
6 years ago
caiyu
29dc72889f
Add support for Hygon Dhyana
6 years ago
Martin Kroeker
dbc9a060ef
Fix missing braces in support_av() call
6 years ago
Martin Kroeker
21c0f2af7b
Merge pull request #1957 from martin-frbg/issue1954
Move TLS key deletion to openblas_quit
6 years ago
Martin Kroeker
ad2c386d6a
Move TLS key deletion to openblas_quit
fixes #1954 (as suggested by thrasibule in that issue)
6 years ago
Martin Kroeker
31ed19e8b9
Add message for SkylakeX and KNL fallbacks to Haswell
6 years ago
Martin Kroeker
e1574fa2b4
Add xcr0 (os support) check
6 years ago
Martin Kroeker
ae1d1f74f7
Query AVX2 and AVX512 capability for runtime cpu selection
6 years ago
Martin Kroeker
8643521127
Merge pull request #1943 from martin-frbg/issue1748
Re-enable loop unrolling in trmv and remove the scary warning
7 years ago
Martin Kroeker
5a720cf9ca
Re-enable loop unrolling in trmv and remove the scary warning
fixes #1748 as that half of the fix for #1332 appears to have been an overreaction on my part.
7 years ago
Martin Kroeker
ccd5945d38
Merge pull request #1942 from martin-frbg/issue1720
Delete the pthread key on cleanup in TLS mode
7 years ago
Martin Kroeker
bba1e67269
Delete the pthread key on cleanup in TLS mode
to avoid a crash when OpenBLAS was loaded via dlopen and libc tries to clean up the leaked TLS after dlclose
Fixes #1720
7 years ago
Martin Kroeker
f343ed65b5
Avoid taking the root of a negative number
Fixes #1924 where numpy 1.17+ would report the (transient) FE_INVALID exception raised for the domain error.
7 years ago
Martin Kroeker
0bf6d74e5f
Fix typo in previous commit for arm dynamic arch
7 years ago
Martin Kroeker
2b355592e3
Make sure to use the arm version of dynamic.c in ARM64 DYNAMIC_ARCH
cf. #1908
7 years ago
Andrew
2601cd58ab
remove surplus locking code , only enabled w x86, disabled or never enabled on all others
7 years ago