Martin Kroeker
8f13ab94d2
Merge pull request #3613 from Rabenda/fix-riscv
Fix riscv64 detect
3 years ago
Martin Kroeker
24e99eca31
Avoid adding -lgfortran with NOFORTRAN
3 years ago
Han Gao
3fc52ebcfb
Fix other arch build in detect.
When CORE is empty, use -march=loongson3a. Fix it.
Signed-off-by: Han Gao <gaohan@uniontech.com>
3 years ago
Niyas Sait
3f5d145cd4
build: minor fixes to build on windows with make
This patch contains following fixes
1. Fix to build without PIC flag
2. Define LAPACK_COMPLEX_STRUCTURE for windows. Builds are failing
without it and changes are consistent with the CMake rules defined
in system.cmake (line 576)
3 years ago
Martin Kroeker
b7873605d4
Use f2c translations of LAPACK when no Fortran compiler is available ( #3539 )
* Add C equivalents of the Fortran routines from Reference-LAPACK as fallbacks, and C_LAPACK variable to trigger their use
3 years ago
Martin Kroeker
499ae5e8f7
Merge pull request #3510 from martin-frbg/issue3505
Fix recent SkylakeX/DYNAMIC_ARCH DGEMM breakage
3 years ago
Martin Kroeker
f1ac59f200
Forward DYNAMIC_ARCH option to Makefile.prebuild
3 years ago
Sunita Nadampalli
19c8f615dc
OpenBLAS: aarch64: Add neoverse-v1/n2 architecture specifics
3 years ago
Martin Kroeker
ed430cd963
Update -tp option for recent nvfortran on x86_64
3 years ago
kavanabhat
eee3381cbe
Fallback for Power kernels
3 years ago
Martin Kroeker
54d321d742
Merge pull request #3466 from rafaelcfsousa/rafael/small_matrix_p10
[POWER] Add small matrix for sgemm/dgemm on Power10
3 years ago
kavanabhat
9a45b5123f
Update Makefile.system
3 years ago
kavanabhat
7b5b93037d
Fix truncated assembler checks
3 years ago
Rafael Cardoso Fernandes Sousa
c78fdcc80d
[POWER] Add support for SMALL_MATRIX_OPT
3 years ago
Martin Kroeker
46947efb83
Ignore compiler support for MIPS MSA if the cpu lacks this capability
3 years ago
Martin Kroeker
9cc0098ce2
Fix potentially wrong HOSTARCH definition in cross-compilation
3 years ago
Martin Kroeker
a6fd497820
Fix nvidia HPC version checks
3 years ago
Martin Kroeker
bb01e26cfe
Adjust compiler options for nvidia hpc 21.9 (and fix a long-standing typo in dynamic_arch settings)
3 years ago
Wangyang Guo
3dc6052c7e
initial support for Sapphire Rapids platform
4 years ago
Martin Kroeker
8e4c209002
Merge pull request #3398 from kavanabhat/aix_p10_gnuas
Big Endian Changes for Power10 kernels
4 years ago
Martin Kroeker
04f3ecd026
Fix minor typo
4 years ago
kavanabhat
9cc95e5657
AIX changes for P10 with GNU Compiler
4 years ago
Alexandru Ardelean
b7bb2e36b8
Makefile.system: adjust mipsel/mips64el ARCH variables
When building for MIPS{64} little-endian variants, the included makefiles
should be the same as for the big-endian.
There are already some adjustments being done for some ARCH names.
This change adds the ones for the `mipsel` and `mips64el` names, so that
the Makefile.mips{64} files get included.
This comes as a result of: https://github.com/openwrt/packages/issues/16649
Signed-off-by: Alexandru Ardelean <ardeleanalex@gmail.com>
4 years ago
Wangyang Guo
76ea8db4da
Small Matrix: enable by default for x86_64 arch
If no customized GEMM_SMALL_M_PERMIT kernel defined, it will just by pass to normal path.
4 years ago
Xianyi Zhang
0a2077901c
Add small marix optimization kernel interface.
make SMALL_MATRIX_OPT=1
5 years ago
gxw
34207bdf5b
Fixed typos about LOONGARCH64
4 years ago
gxw
af0a69f355
Add support for LOONGARCH64
4 years ago
User User-User
9335d42740
add gcc8 version matching
4 years ago
User User-User
b7da75e4fd
WiP CORTEX A55 support
4 years ago
MikaelUrankar
4fbc0777f4
Fix typo
4 years ago
Martin Kroeker
26ccf643a3
Add -lm for FreeBSD on ARM/ARM64
4 years ago
Martin Kroeker
3c356b1a1f
Support compilation with the NAG Fortran compiler
4 years ago
Martin Kroeker
20f492c298
Fix AMD AOCC compiler detection
4 years ago
Martin Kroeker
9b2d69aa80
Add DYNAMIC_LIST option for ARM64
4 years ago
Martin Kroeker
6bbe6d5b92
Make compile-time BUFFERSIZE setting actually reach the compiler/preprocessor
4 years ago
pkubaj
7aa1ff8ff6
Fix build on FreeBSD/powerpc64le
4 years ago
Martin Kroeker
75b1f3becc
Limit POWERPC DYNAMIC_CORE list to P8 and P9 for NVIDIA compilers
4 years ago
Martin Kroeker
b212a2fb9f
Add/modify "PGI" compiler options for NVIDIA SDK 20.11
4 years ago
Martin Kroeker
18d8a67485
Merge pull request #2994 from antonblanchard/power10-fixes
Power10 fixes
4 years ago
gxw
4b548857d6
Add msa support for loongson
1. Using core loongson3r3 and loongson3r4 for loongson
2. Add DYNAMIC_ARCH for loongson
Change-Id: I1c6b54dbeca3a0cc31d1222af36a7e9bd6ab54c1
4 years ago
Martin Kroeker
6c7d557a16
Fix compiler options for 32 and 64bit SPARC builds with SolarisStudio
4 years ago
Martin Kroeker
2e99e2699b
Add workaround for gcc 4.6 miscompiling assembly kernels with -mavx
4 years ago
Martin Kroeker
437702e0e1
Merge pull request #2965 from epsilon-0/develop
allow setting soname without suffix or prefix
4 years ago
Anton Blanchard
fdf71d66b3
POWER10: Fix ld version detection
LDVERSIONGTEQ35 needs to escape the '>' character.
LDVERSIONGTEQ35 is checking the system ld version which may be different
to the toolchain being used to compile OpenBLAS. We don't have a path
to the linker in our Makefiles, so (ab)use gcc -Wl,--version to get the
version of ld in our toolchain.
4 years ago
Xianyi Zhang
fc35b72ae1
Refs #2899
Merge branch 'openblas-open-910' of git://github.com/damonyu1989/OpenBLAS into damonyu1989-openblas-open-910
4 years ago
Xianyi Zhang
913cc9a4ca
Merge branch 'develop' into risc-v
4 years ago
Martin Kroeker
1c4cfdc139
Stay compatible with old gmake that did not support undefine
4 years ago
Martin Kroeker
f6a57d8f63
Update Makefile.system
4 years ago
Martin Kroeker
f4b7ba12b7
Update Makefile.system
4 years ago
Martin Kroeker
a04f532edf
Reset cpu property flags between build cycles in DYNAMIC_ARCH mode
4 years ago