Martin Kroeker
d9369bda1e
Update and amend parameters for Neoverse cpus
7 months ago
Ruiyang Wu
1b0c0f00e9
CMake: Avoid mixed OpenMP linkage
8 months ago
Ruiyang Wu
02fd1df10b
CMake: Pass `OpenMP` compiler and linker flags through CMake targets
Using `OpenMP::OpenMP_LANG` targets for CMake is less error-prone than
passing the compiler and linker flags manually. Furthermore, it allows
the user to customize those flags by setting `OpenMP_LANG_FLAGS`,
`OpenMP_LANG_LIB_NAMES`, and `OpenMP_omp_LIBRARY`.
8 months ago
Martin Kroeker
b34235ca66
Fix inclusion of deprecated interfaces and cgesvdq/strsyl3
8 months ago
Martin Kroeker
f1fa370579
fix missing endif
9 months ago
Martin Kroeker
6d1444be3a
Add ARM64 options for NVIDIA HPC
9 months ago
Vaisakh K V
f66ca05b31
Merge branch 'develop' into topic/sgemm_direct_sme1
9 months ago
Vaisakh K V
d23eb3b93e
Support for SME1 based sgemm_direct kernel for cblas_sgemm level 3 API
* Added ARMV9SME target
* Added SGEMM_DIRECT kernel based on SME1
11 months ago
Martin Kroeker
877d5a5be6
Add -O2 to flang flags when building on WoA in Release mode
9 months ago
Martin Kroeker
262018f14c
Merge pull request #5092 from XiWeiGu/la64_fixed_cmake
LoongArch64: Fixed cmake
10 months ago
Martin Kroeker
180ba5e7d0
Merge pull request #5069 from tingboliao/dev_rotm_20250107
Further rearranged the rotm kernel for the different architectures.
10 months ago
gxw
1ebcbdbab3
LoongArch64: Fixed the issue of using the old-style TARGET in cmake builds
10 months ago
Martin Kroeker
111c9b0733
Add translations for C_COMPILER and OSNAME
10 months ago
tingbo.liao
3c8df6358f
Further rearranged the rotm kernel for the different architectures.
Signed-off-by: tingbo.liao <tingbo.liao@starfivetech.com>
10 months ago
Martin Kroeker
fbf594b62f
Guard against empty CMAKE_Fortran_COMPILER_ID
11 months ago
Martin Kroeker
d78fbe425c
Assume no underline suffixes on symbols when compiling with ifx on Windows
11 months ago
Martin Kroeker
30188a55d1
Don't assume underlined symbols for ifx; make cpuid.S inclusion conditional
11 months ago
Martin Kroeker
32319a33ac
Add options for Intel oneAPI 2025.0 ifx on Windows
11 months ago
Matthew Thompson
c4e8bac5a5
Fix indent
11 months ago
Matthew Thompson
be19966d3b
Fixes for NAG CMake
11 months ago
Matthew Thompson
2eaf285de5
Use F_COMPILER name
1 year ago
Matthew Thompson
a8b1705dbd
CMake build has wrong PIC flag for NAG
1 year ago
Martin Kroeker
57a51d74c9
translate CMAKE_SYSTEM_NAME in compilations on or for IOS
1 year ago
Martin Kroeker
cea9df3643
Update Cray compiler options and calling convention
1 year ago
Chip Kerchner
36bd3eeddf
Vectorize BF16 GEMV (VSX & MMA). Use GEMM_GEMV_FORWARD_BF16 (for Power).
1 year ago
Martin Kroeker
b0346e72f4
update names of loongarch64 targets for cross-compilation
1 year ago
Martin Kroeker
9c707dc6b9
Update dynamic arch list to new target scheme
1 year ago
Martin Kroeker
b4495a8fb8
Merge branch 'develop' into arm64_cmake_small_matrix_opt
1 year ago
Martin Kroeker
4f00f02567
Do not add -mabi flags for Loongson when the compiler is flang
1 year ago
Martin Kroeker
de421b7764
Merge pull request #4904 from XiWeiGu/la64_cross_cmake
LoongArch64: Enable cmake cross-compilation
1 year ago
Martin Kroeker
0228d36211
move -fopenmp to CFLAGS
1 year ago
gxw
7087b0a7d0
ARM64: Enable SMALL_MATRIX_OPT when compiling with CMake
1 year ago
gxw
30af9278dc
LoongArch64: Enable cmake cross-compilation
1 year ago
psykose
1265eee85c
fix cmake typo for power10 cc version check
fixes 668f48f4fc
1 year ago
Martin Kroeker
cc36db643e
Support new LAPACK build option LAPACK_STRLEN
1 year ago
Martin Kroeker
e8bd97ab4b
add RISCV64 entries for DYNAMIC_ARCH
1 year ago
Martin Kroeker
9eecd0d33b
enable GEMM/GEMV forwarding for riscv and ppc
1 year ago
Chris Sidebottom
b26424c6a2
Allow opt into GEMM -> GEMV forwarding
1 year ago
yamazaki-mitsufumi
821ef34635
Add A64FX to the list of CPUs supported by DYNAMIC_ARCH
1 year ago
Jaap Aarts
cea4abcac0
Fix compiling on mingw
1 year ago
Jaap Aarts
9d0abe2d26
Add support for RISCV64_GENERIC in cmake
1 year ago
Martin Kroeker
d25ee4d0f5
Fix detection of Intel ifx and apply -fp-model option to it
1 year ago
Martin Kroeker
21c0f769ef
ensure that cpu-specific -march options are always applied to icx
1 year ago
Alexander Neumann
dd4505c5dd
Fix CMake warning
1 year ago
Martin Kroeker
8b4996a2d5
Override icx's default fast math mode to ensure correct NaN handling
1 year ago
Martin Kroeker
6494f432df
Fix INTERFACE64 builds on Loongarch64
1 year ago
Martin Kroeker
a3f6b13bc9
remove spurious brace
1 year ago
Martin Kroeker
668f48f4fc
Use CMAKE_C_COMPILER_VERSION instead of dumpversion calls ( #4698 )
* Use CMAKE_C_COMPILER_VERSION throughout
1 year ago
Martin Kroeker
f5c080f083
Fix CMAKE syntax in kernel file parsing of IFNEQ conditionals ( #4695 )
* Fix syntax in parsing of IFNEQ
1 year ago
Martin Kroeker
3d26837a35
Suppress GCC14 error exit in the f2c-converted LAPACK
1 year ago