Vaisakh K V
d23eb3b93e
Support for SME1 based sgemm_direct kernel for cblas_sgemm level 3 API
* Added ARMV9SME target
* Added SGEMM_DIRECT kernel based on SME1
10 months ago
Martin Kroeker
cea9df3643
Update Cray compiler options and calling convention
10 months ago
Chip Kerchner
36bd3eeddf
Vectorize BF16 GEMV (VSX & MMA). Use GEMM_GEMV_FORWARD_BF16 (for Power).
11 months ago
Martin Kroeker
b0346e72f4
update names of loongarch64 targets for cross-compilation
1 year ago
Martin Kroeker
9c707dc6b9
Update dynamic arch list to new target scheme
1 year ago
Martin Kroeker
b4495a8fb8
Merge branch 'develop' into arm64_cmake_small_matrix_opt
1 year ago
Martin Kroeker
4f00f02567
Do not add -mabi flags for Loongson when the compiler is flang
1 year ago
Martin Kroeker
de421b7764
Merge pull request #4904 from XiWeiGu/la64_cross_cmake
LoongArch64: Enable cmake cross-compilation
1 year ago
Martin Kroeker
0228d36211
move -fopenmp to CFLAGS
1 year ago
gxw
7087b0a7d0
ARM64: Enable SMALL_MATRIX_OPT when compiling with CMake
1 year ago
gxw
30af9278dc
LoongArch64: Enable cmake cross-compilation
1 year ago
psykose
1265eee85c
fix cmake typo for power10 cc version check
fixes 668f48f4fc
1 year ago
Martin Kroeker
cc36db643e
Support new LAPACK build option LAPACK_STRLEN
1 year ago
Martin Kroeker
e8bd97ab4b
add RISCV64 entries for DYNAMIC_ARCH
1 year ago
Martin Kroeker
9eecd0d33b
enable GEMM/GEMV forwarding for riscv and ppc
1 year ago
Chris Sidebottom
b26424c6a2
Allow opt into GEMM -> GEMV forwarding
1 year ago
yamazaki-mitsufumi
821ef34635
Add A64FX to the list of CPUs supported by DYNAMIC_ARCH
1 year ago
Jaap Aarts
cea4abcac0
Fix compiling on mingw
1 year ago
Jaap Aarts
9d0abe2d26
Add support for RISCV64_GENERIC in cmake
1 year ago
Martin Kroeker
d25ee4d0f5
Fix detection of Intel ifx and apply -fp-model option to it
1 year ago
Martin Kroeker
21c0f769ef
ensure that cpu-specific -march options are always applied to icx
1 year ago
Alexander Neumann
dd4505c5dd
Fix CMake warning
1 year ago
Martin Kroeker
8b4996a2d5
Override icx's default fast math mode to ensure correct NaN handling
1 year ago
Martin Kroeker
6494f432df
Fix INTERFACE64 builds on Loongarch64
1 year ago
Martin Kroeker
a3f6b13bc9
remove spurious brace
1 year ago
Martin Kroeker
668f48f4fc
Use CMAKE_C_COMPILER_VERSION instead of dumpversion calls ( #4698 )
* Use CMAKE_C_COMPILER_VERSION throughout
1 year ago
Martin Kroeker
f5c080f083
Fix CMAKE syntax in kernel file parsing of IFNEQ conditionals ( #4695 )
* Fix syntax in parsing of IFNEQ
1 year ago
Martin Kroeker
3d26837a35
Suppress GCC14 error exit in the f2c-converted LAPACK
1 year ago
Martin Kroeker
69aa93e34f
Fix Loongson compiler flag check
1 year ago
Martin Kroeker
015042f7b5
Fix Loongson compiler flag test
1 year ago
مهدي شينون (Mehdi Chinoune)
cda55f2fd2
Don't pass `-exhaustive-register-search` directly to clang compiler
`-exhaustive-register-search` is an LLVM code generation flag that shouldn't be passed directly to clang compiler.
1 year ago
Martin Kroeker
3af736fb9d
Add support for Cortex-A76
1 year ago
Martin Kroeker
52b71a1673
Filter out FFLAGS that flang-new from LLVM18 no longer supports ( #4569 )
* Filter out FFLAGS that flang-new from LLVM18 no longer supports
1 year ago
Martin Kroeker
2e86faa657
Merge branch 'develop' into issue4468
1 year ago
Martin Kroeker
8fc2c2db04
Fix missing support for INTERFACE64 on ARM64 and MIPS64
1 year ago
Martin Kroeker
82b81c0bbe
Dont fail if there is no Fortran compiler
1 year ago
Martin Kroeker
a0e3f77e0b
add FIXED_LIBNAME, PREFIX and SUFFIX
1 year ago
Martin Kroeker
ffbfc3c692
Add libname prefix and suffix
1 year ago
Martin Kroeker
0c43c6fa99
Merge pull request #4341 from catap/openblas.pc.in
cmake/openblas.pc.in: fixed version and URL
1 year ago
Martin Kroeker
e9c32ed165
Merge pull request #4384 from yetist/develop
Fix: build failed on LoongArch
1 year ago
Martin Kroeker
1106460bb3
remove redundant targets from the default ARM64 DYNAMIC_ARCH list
1 year ago
Wu Xiaotian
0baf462dbc
Fix: build failed on LoongArch
According to the documentation at https://github.com/loongson/la-abi-specs/blob/release/lapcs.adoc#the-base-abi-variants , valid -mabi parameters are lp64s, lp64f, lp64d, ilp32s, ilp32f and ilp32d.
1 year ago
barracuda156
8c143331b0
PPC970: drop -mcpu=970 which seems to produce faulty code
Fixes: https://github.com/OpenMathLib/OpenBLAS/issues/4376
1 year ago
barracuda156
981e315b30
cc.cmake: use -force_cpusubtype_ALL for Darwin PPC
1 year ago
barracuda156
a8d3619f65
cc.cmake: add optflags for G5 and G4 kernels
1 year ago
barracuda156
c732f275a2
system_check.cmake: fix arch detection for Darwin PowerPC
1 year ago
Kirill A. Korinsky
08fde5ebd2
Use 64bit build on `CMAKE_SYSTEM_PROCESSOR=i386` on Darwin
Here a bit tricky things.
A value `CMAKE_SYSTEM_PROCESSOR` is came from output of `uname -m` which
migth be 32bit with 64bit building applicaiton.
So, for that case use `CMAKE_SIZEOF_VOID_P` to detect the target.
See https://trac.macports.org/ticket/68488
1 year ago
Kirill A. Korinsky
01c7010543
cmake/openblas.pc.in: fixed version and URL
1 year ago
Martin Kroeker
5bf87c86f5
Implement truncated QR with pivoting (Reference-LAPACK PR 891)
1 year ago
Martin Kroeker
58427ff74d
Deprecate ?GELQS and ?GEQRS from TESTING/LIN (Reference-LAPACK PR 900) ( #4307 )
* Move ?GELQS and ?GEQRS from TESTING/LIN to DEPRECATED (Reference-LAPACK PR 900)
* Add f2c-converted versions of ?GELQS and ?GEQRS
1 year ago