Vaisakh K V
d23eb3b93e
Support for SME1 based sgemm_direct kernel for cblas_sgemm level 3 API
* Added ARMV9SME target
* Added SGEMM_DIRECT kernel based on SME1
10 months ago
Rajalakshmi Srinivasaraghavan
db0805906b
powerpc: Fix build errors with Open XL C
This patch fixes errors when using Open XL C compiler on AIX.
Tested with gcc/xlf and ibm-clang/xlf compiler combinations.
2 years ago
Martin Kroeker
7a6203ffa1
restore default Neoverse SVE build instructions for non-NVIDIA compilers
2 years ago
Martin Kroeker
8794544b43
Add support for compiling the Neoverse SVE kernels with the NVIDIA HPC compiler
2 years ago
Ian McInerney
8a8a8479be
Fix cooperlake and sapphire rapids march flags on clang
The march=cooperlake and march=sapphirerapids flags were never getting
added when building with Clang targetting those architectures. Instead
it was falling back to the skylake AVX512 implementation.
Clang added support for these two architectures in Clang 9 and Clang 12,
so introduce new checks for those versions to enable the appropriate
march flag, and fallback to skylake otherwise.
2 years ago
Honglin Zhu
9e80a194d6
Fix dynamic_list build and gcc version check error
2 years ago
Martin Kroeker
23d59baaf1
Add -mfma to -mavx2 for Apple clang, and set AVX2 options for Zen as well
3 years ago
Wangyang Guo
3dc6052c7e
initial support for Sapphire Rapids platform
4 years ago
Martin Kroeker
005cce5507
Amend SkylakeX options to support the NVIDIA compiler
4 years ago
gxw
4b548857d6
Add msa support for loongson
1. Using core loongson3r3 and loongson3r4 for loongson
2. Add DYNAMIC_ARCH for loongson
Change-Id: I1c6b54dbeca3a0cc31d1222af36a7e9bd6ab54c1
4 years ago
Martin Kroeker
b976a0bf40
Remove previous workaround for compiler flags related to cpu capabilities in x86_64 DYNAMIC_ARCH builds
4 years ago
Guillaume Horel
1917a4e7b8
reuse variables defined in Makefile.system
5 years ago
Martin Kroeker
2e48d560ba
Fix compiler version check
5 years ago
Martin Kroeker
df70667043
fix core list for sse/sse2
5 years ago
Martin Kroeker
f071d1207a
add sse2
5 years ago
Martin Kroeker
dc6cefd2f5
Expressly enable -msse for 32bit DYNAMIC_ARCH kernels
5 years ago
Martin Kroeker
ebf0470fc2
add sse4.1 for DYNAMIC_ARCH kernels
5 years ago
Martin Kroeker
fecedc9c69
Add -mssse3
5 years ago
Martin Kroeker
0eacbca85f
Add Haswell and Zen to temporary sse3 whitelist
5 years ago
Martin Kroeker
6999086a2b
whitelist SANDYBRIDGE for SSE3
5 years ago
Martin Kroeker
7a53128481
Add whitelist of DYNAMIC_ARCH kernels for which -msse3 needs to be enabled
5 years ago
Qiyu8
f32d34a015
add sse3 compiler flag
5 years ago
Martin Kroeker
6f4dc7445d
Fix typo
5 years ago
Martin Kroeker
81fbe8d088
-march=cooperlake only available in gcc >= 10
5 years ago
Chen, Guobing
e740c4873d
Enable COOPERLAKE build target
Enable new build target platform -- COOPERLAKE. This target platform
supports all the SKYLAKEX supported ISAs + avx512bf16. So all the
SKYLAKEX specific kernels/drivers and related code are now extended
to be also active on COOPERLAKE. Besides, new BF16 related kernels
are active under this target.
5 years ago
Rajalakshmi Srinivasaraghavan
d557584b71
Fix compilation issues with clang on POWER
As gcc defaults to -malign-power, removing that option. Also
adding -fno-integrated-as to use GNU assembler for powerpc
assembly optimization files. Fixed other compilation errors
reported in dgemv_t.c file.
5 years ago
Martin Kroeker
97762234f9
Add variable for gcc >=9 test
used in KERNEL.POWER9
5 years ago
Martin Kroeker
c4e23dd016
Update Makefile
6 years ago
Martin Kroeker
cfc4acc221
typo
6 years ago
Martin Kroeker
545c2b1bbb
Add -mavx2 on Haswell only if the compiler supports it
6 years ago
Martin Kroeker
3843e3e017
use -maxv2 on haswell
6 years ago
Martin Kroeker
fbcb14a74b
should be core-avx2
6 years ago
Martin Kroeker
2a3190dc76
fix elseifeq and use older option core2-avx for compatibility
6 years ago
Martin Kroeker
1ebe5c0f49
Add -march=haswell to HASWELL part of DYNAMIC_ARCH build
6 years ago
Martin Kroeker
51aec8e96b
make sure the added march=skylake-avx512 does not cause problems on Windows
6 years ago
Martin Kroeker
06f7d78d70
Add -march=skylake-avx512 to SkylakeX part of DYNAMIC_ARCH builds
6 years ago
Ashwin Sekhar T K
e7b66cd36e
ARM64: Fix DYNAMIC_ARCH compilation for cores which dont use GEMM3M
7 years ago
Timothy Gu
6c2ead30f0
Remove all trailing whitespace except lapack-netlib
Signed-off-by: Timothy Gu <timothygu99@gmail.com>
11 years ago
Jameson Nash
d0e731e8b8
provide support for passing CFLAGS, FFLAGS, PFLAGS, FPFLAGS to make on the command line
13 years ago
Xianyi Zhang
25f1a573fd
Fixed the build bug when DYNAMIC_ARCH=0.
13 years ago
Xianyi Zhang
32353a9d30
Refs #20 . Fixed the installation bug with DYNAMIC_ARCH=1.
14 years ago
Xianyi Zhang
8d50a9fd1a
Fixed #35 a build bug with NO_LAPACK=1 & DYNAMIC_ARCH=1.
14 years ago
Xianyi Zhang
342bbc3871
Import GotoBLAS2 1.13 BSD version codes.
14 years ago