Martin Kroeker
6ca898b63b
Add gfortran workaround for potential ABI violation
for #2154
6 years ago
Michael Lass
7a9a4dbc4f
Fix detection of AVX512 capable compilers in getarch
21eda8b5
introduced a check in getarch.c to test if the compiler is capable of
AVX512. This check currently fails, since the used __AVX2__ macro is only
defined if getarch itself was compiled with AVX2/AVX512 support. Make sure this
is the case by building getarch with -march=native on x86_64. It is only
supposed to run on the build host anyway.
6 years ago
Martin Kroeker
1e52572be3
Add option USE_LOCKING for single-threaded build with locking support
6 years ago
luz.paz
daf2fec12d
Misc. typo fixes
Found via `codespell -q 3 -w -L ith,als,dum,nd,amin,nto,wis,ba -S ./relapack,./kernel,./lapack-netlib`
6 years ago
Martin Kroeker
ccfb7ead15
Merge pull request #2072 from martin-frbg/sum
Add (C)BLAS extension ?sum
6 years ago
Martin Kroeker
e06b8438b4
Merge pull request #2080 from martin-frbg/issue2075
Add -lm and disable EXPRECISION support on *BSD
6 years ago
Martin Kroeker
9229d6859b
Add -lm and disable EXPRECISION support on *BSD
fixes #2075
6 years ago
Martin Kroeker
d17da6c6a4
Add cmake defaults for ?sum kernels
6 years ago
Martin Kroeker
1679de5e59
Detect 32bit environment on 64bit ARM hardware
for #2056 , using same approach as #2058
6 years ago
Sacha
c3e30b2bc2
Change 64-bit detection as explained in #2056
6 years ago
Martin Kroeker
fd34820b99
Fix AVX512 test always returning false due to missing compiler option
6 years ago
Martin Kroeker
5952e586ce
Support DYNAMIC_LIST option in cmake
e.g. cmake -DDYNAMIC_ARCH=1 -DDYNAMIC_LIST="NEHALEM;HASWELL;ZEN" ..
original issue was #1639
6 years ago
Martin Kroeker
58dd7e4501
Change ARMV8 target to ARMV7 for BINARY=32
6 years ago
Martin Kroeker
802f0dbde1
More fixes for cross-compiling ARM64 targets
Fixed core naming for DYNAMIC_ARCH. Corrected GEMM_DEFAULT entries and added SYMV_P. Replaced outdated VULCAN define for ThunderX2T99 with ARMV8 to get basic definitions back. For issue #1908
6 years ago
Martin Kroeker
20d1aad13f
Fix missing quotes around thunderx targets
6 years ago
Martin Kroeker
e1eab96502
Merge pull request #1931 from martin-frbg/pr1921
Add -mavx2 to TARGET=HASWELL builds
6 years ago
Martin Kroeker
76b4b8980f
Use -dumpversion with gcc only
6 years ago
Martin Kroeker
49e0f485da
Add -mavx2 for TARGET=HASWELL if compiler supports and requires it
6 years ago
Martin Kroeker
26a3402773
Reflect ARMV8 target definition changes from PR1876
and create config target directory for cross-compiles.
6 years ago
Martin Kroeker
133c278ee5
Add DYNAMIC_CORE list for ARM64
cf #1908
6 years ago
Martin Kroeker
dceff5542c
Handle Android environments that identify as Linux ( #1898 )
* Handle Android environments that identify as Linux
termux terminal emulator does this, causing build failures through missed defines in common.h
6 years ago
Martin Kroeker
081ceb3e02
Propagate version number for openblas_get_config
6 years ago
Andrew
40cce0e353
handle cmake too
7 years ago
Martin Kroeker
2263d3906c
Merge pull request #1812 from martin-frbg/issue1806-2
Use KERNEL_DEFINITIONS rather than COMMON_OPTS to pass -march=skylake…
7 years ago
Martin Kroeker
81c9985c3a
Use KERNEL_DEFINITIONS rather than COMMON_OPTS to pass -march=skylake-avx512
7 years ago
Martin Kroeker
56ebc7b53e
Merge pull request #1808 from martin-frbg/issue1806
Add -march=skylake-avx512 to CFLAGS when the target is Skylake
7 years ago
Martin Kroeker
8a11ec19d1
Syntax fix
7 years ago
Martin Kroeker
fa53b903db
Add -march=skylake-avx512 to CFLAGS when the target is Skylake
Should fix 1806 and #1801
7 years ago
Martin Kroeker
84bcdf9c66
Revert "Add -march=skylake-avx512 when required"
7 years ago
Martin Kroeker
a9b51b8448
Merge pull request #1798 from martin-frbg/cmake-avx512
Add -march=skylake-avx512 when required
7 years ago
Martin Kroeker
eba394c711
Add -march=skylake-avx512 when required
fixes #1797
7 years ago
Martin Kroeker
02ef20a1e4
Merge pull request #1786 from martin-frbg/immintrin
Check for Immintrin.h presence in the AVX512 compatibility test as well
7 years ago
Martin Kroeker
4c3643ed7f
Check availability of immintrin.h in the AVX512 compatibility test
7 years ago
Yuri
2349e15149
Allow to install the 'interfare64' version concurrently with the regular version
7 years ago
Martin Kroeker
b1b743f434
Merge branch 'develop' into interim033
7 years ago
Martin Kroeker
2a589c4b28
Add USE_TLS option to switch between old and new memory.c
7 years ago
Martin Kroeker
25f2d25cfe
Merge pull request #1697 from martin-frbg/issue1696
Do not treat WIndows UWB builds as cross-compiling
7 years ago
Martin Kroeker
73131fa30a
Do not treat WIndows UWB builds as cross-compiling
7 years ago
Martin Kroeker
b74aef2816
Add -march=skylake-avx512 to AVX512 compile check and suppress its output
7 years ago
Martin Kroeker
26e1cfb653
Merge pull request #1607 from martin-frbg/dynarch
Move some x86_64 DYNAMIC_ARCH targets to new DYNAMIC_OLDER option
7 years ago
Martin Kroeker
02634b549b
Add template for OpenBLASConfig.cmake
7 years ago
Martin Kroeker
1cbd8f3ae4
Move some DYNAMIC_ARCH targets to new DYNAMIC_OLDER option
7 years ago
Martin Kroeker
cf234a0561
Merge pull request #1589 from fenrus75/skylakex
Initial support for SkylakeX / AVX512
7 years ago
Martin Kroeker
e4718b1fee
Better AVX512 test case
7 years ago
Martin Kroeker
7fb62aed7e
Check build system support for AVX512 instructions
7 years ago
Arjan van de Ven
99c7bba8e4
Initial support for SkylakeX / AVX512
This patch adds the basic infrastructure for adding the SkylakeX (Intel Skylake server)
target. The SkylakeX target will use the AVX512 (AVX512VL level) instruction set,
which brings 2 basic things:
1) 512 bit wide SIMD (2x width of AVX2)
2) 32 SIMD registers (2x the number on AVX2)
This initial patch only contains a trivial transofrmation of the Haswell SGEMM kernel
to AVX512VL; more will follow later but this patch aims to get the infrastructure
in place for this "later".
Full performance tuning has not been done yet; with more registers and wider SIMD
it's in theory possible to retune the kernels but even without that there's an
interesting enough performance increase (30-40% range) with just this change.
7 years ago
Martin Kroeker
6791294312
Merge pull request #1559 from martin-frbg/buildconf
Add build-time configuration options to pkgconfig file
7 years ago
Martin Kroeker
7d7564568c
Add build-time configuration options to pkgconfig file
7 years ago
Zhiyong Dang
1b83341d19
Fix race condition in blas_server_omp.c
Change-Id: Ic896276cd073d6b41930c7c5a29d66348cd1725d
7 years ago
Sacha
f81815e48a
Fix CMake cross-compiling
Without specifying thread count, NUM_THREADS would not be defined and CMake would fail.
This is because core count cannot be determined when cross-compiling.
7 years ago