Martin Kroeker
ed430cd963
Update -tp option for recent nvfortran on x86_64
3 years ago
kavanabhat
eee3381cbe
Fallback for Power kernels
3 years ago
Martin Kroeker
54d321d742
Merge pull request #3466 from rafaelcfsousa/rafael/small_matrix_p10
[POWER] Add small matrix for sgemm/dgemm on Power10
3 years ago
kavanabhat
9a45b5123f
Update Makefile.system
3 years ago
kavanabhat
7b5b93037d
Fix truncated assembler checks
3 years ago
Rafael Cardoso Fernandes Sousa
c78fdcc80d
[POWER] Add support for SMALL_MATRIX_OPT
3 years ago
Martin Kroeker
46947efb83
Ignore compiler support for MIPS MSA if the cpu lacks this capability
3 years ago
Martin Kroeker
9cc0098ce2
Fix potentially wrong HOSTARCH definition in cross-compilation
3 years ago
Martin Kroeker
a6fd497820
Fix nvidia HPC version checks
3 years ago
Martin Kroeker
bb01e26cfe
Adjust compiler options for nvidia hpc 21.9 (and fix a long-standing typo in dynamic_arch settings)
3 years ago
Wangyang Guo
3dc6052c7e
initial support for Sapphire Rapids platform
4 years ago
Martin Kroeker
8e4c209002
Merge pull request #3398 from kavanabhat/aix_p10_gnuas
Big Endian Changes for Power10 kernels
4 years ago
Martin Kroeker
04f3ecd026
Fix minor typo
4 years ago
kavanabhat
9cc95e5657
AIX changes for P10 with GNU Compiler
4 years ago
Alexandru Ardelean
b7bb2e36b8
Makefile.system: adjust mipsel/mips64el ARCH variables
When building for MIPS{64} little-endian variants, the included makefiles
should be the same as for the big-endian.
There are already some adjustments being done for some ARCH names.
This change adds the ones for the `mipsel` and `mips64el` names, so that
the Makefile.mips{64} files get included.
This comes as a result of: https://github.com/openwrt/packages/issues/16649
Signed-off-by: Alexandru Ardelean <ardeleanalex@gmail.com>
4 years ago
Wangyang Guo
76ea8db4da
Small Matrix: enable by default for x86_64 arch
If no customized GEMM_SMALL_M_PERMIT kernel defined, it will just by pass to normal path.
4 years ago
Xianyi Zhang
0a2077901c
Add small marix optimization kernel interface.
make SMALL_MATRIX_OPT=1
5 years ago
gxw
34207bdf5b
Fixed typos about LOONGARCH64
4 years ago
gxw
af0a69f355
Add support for LOONGARCH64
4 years ago
User User-User
9335d42740
add gcc8 version matching
4 years ago
User User-User
b7da75e4fd
WiP CORTEX A55 support
4 years ago
MikaelUrankar
4fbc0777f4
Fix typo
4 years ago
Martin Kroeker
26ccf643a3
Add -lm for FreeBSD on ARM/ARM64
4 years ago
Martin Kroeker
3c356b1a1f
Support compilation with the NAG Fortran compiler
4 years ago
Martin Kroeker
20f492c298
Fix AMD AOCC compiler detection
4 years ago
Martin Kroeker
9b2d69aa80
Add DYNAMIC_LIST option for ARM64
4 years ago
Martin Kroeker
6bbe6d5b92
Make compile-time BUFFERSIZE setting actually reach the compiler/preprocessor
4 years ago
pkubaj
7aa1ff8ff6
Fix build on FreeBSD/powerpc64le
4 years ago
Martin Kroeker
75b1f3becc
Limit POWERPC DYNAMIC_CORE list to P8 and P9 for NVIDIA compilers
4 years ago
Martin Kroeker
b212a2fb9f
Add/modify "PGI" compiler options for NVIDIA SDK 20.11
4 years ago
Martin Kroeker
18d8a67485
Merge pull request #2994 from antonblanchard/power10-fixes
Power10 fixes
4 years ago
gxw
4b548857d6
Add msa support for loongson
1. Using core loongson3r3 and loongson3r4 for loongson
2. Add DYNAMIC_ARCH for loongson
Change-Id: I1c6b54dbeca3a0cc31d1222af36a7e9bd6ab54c1
4 years ago
Martin Kroeker
6c7d557a16
Fix compiler options for 32 and 64bit SPARC builds with SolarisStudio
4 years ago
Martin Kroeker
2e99e2699b
Add workaround for gcc 4.6 miscompiling assembly kernels with -mavx
4 years ago
Martin Kroeker
437702e0e1
Merge pull request #2965 from epsilon-0/develop
allow setting soname without suffix or prefix
4 years ago
Anton Blanchard
fdf71d66b3
POWER10: Fix ld version detection
LDVERSIONGTEQ35 needs to escape the '>' character.
LDVERSIONGTEQ35 is checking the system ld version which may be different
to the toolchain being used to compile OpenBLAS. We don't have a path
to the linker in our Makefiles, so (ab)use gcc -Wl,--version to get the
version of ld in our toolchain.
4 years ago
Xianyi Zhang
fc35b72ae1
Refs #2899
Merge branch 'openblas-open-910' of git://github.com/damonyu1989/OpenBLAS into damonyu1989-openblas-open-910
4 years ago
Xianyi Zhang
913cc9a4ca
Merge branch 'develop' into risc-v
4 years ago
Martin Kroeker
1c4cfdc139
Stay compatible with old gmake that did not support undefine
4 years ago
Martin Kroeker
f6a57d8f63
Update Makefile.system
4 years ago
Martin Kroeker
f4b7ba12b7
Update Makefile.system
4 years ago
Martin Kroeker
a04f532edf
Reset cpu property flags between build cycles in DYNAMIC_ARCH mode
4 years ago
Martin Kroeker
8cc73fee98
Export NO_EXPRECISION after overriding for DYNAMIC_ARCH with GENERIC target
4 years ago
Aisha Tammy
60997ddd73
allow setting soname without suffix or prefix
Allows to create a library with a different
SONAME without the need to add suffixes to symbols
Backwards compatible and should have no effect
on the workflow and previous users.
Useful for allowing INTERFACE64 library alongside
the standard library without file conflicts
4 years ago
Martin Kroeker
40a93c232b
Disable EXPRECISION for DYNAMIC_ARCH in combination with TARGET=GENERIC
NO_EXPRECISION is disabled for the GENERIC_TARGET already, so prevent mixing with code parts that use a different float size by default
4 years ago
Chen, Guobing
c5e62dad69
Fix cooperlake compile issue
Add a missing macro which is required in Makefile.x86_64 due to recent
clearnup, which causes cooperlake platform build failure.
4 years ago
Martin Kroeker
878b6d1f41
Remove spurious expr in flang version check
5 years ago
Martin Kroeker
1a0f57c8f0
Fix missing backquotes
5 years ago
Martin Kroeker
bb8c3f6861
Add ld/binutils version check for POWER10 support
5 years ago
Zhang Xianyi
d7ba7679b6
Merge branch 'develop' into risc-v
5 years ago