Srangrang
0a967797a1
Add FP16 support for RISCV
4 months ago
Martin Kroeker
f2022c23ac
Remove sve capability from NeoverseN1 and specify CortexX2/A?10 as arm8.4a
4 months ago
Martin Kroeker
d9369bda1e
Update and amend parameters for Neoverse cpus
5 months ago
Ruiyang Wu
1b0c0f00e9
CMake: Avoid mixed OpenMP linkage
6 months ago
Ruiyang Wu
02fd1df10b
CMake: Pass `OpenMP` compiler and linker flags through CMake targets
Using `OpenMP::OpenMP_LANG` targets for CMake is less error-prone than
passing the compiler and linker flags manually. Furthermore, it allows
the user to customize those flags by setting `OpenMP_LANG_FLAGS`,
`OpenMP_LANG_LIB_NAMES`, and `OpenMP_omp_LIBRARY`.
6 months ago
Martin Kroeker
b34235ca66
Fix inclusion of deprecated interfaces and cgesvdq/strsyl3
6 months ago
Martin Kroeker
f1fa370579
fix missing endif
7 months ago
Martin Kroeker
6d1444be3a
Add ARM64 options for NVIDIA HPC
7 months ago
Vaisakh K V
f66ca05b31
Merge branch 'develop' into topic/sgemm_direct_sme1
7 months ago
Vaisakh K V
d23eb3b93e
Support for SME1 based sgemm_direct kernel for cblas_sgemm level 3 API
* Added ARMV9SME target
* Added SGEMM_DIRECT kernel based on SME1
10 months ago
Martin Kroeker
877d5a5be6
Add -O2 to flang flags when building on WoA in Release mode
7 months ago
Martin Kroeker
262018f14c
Merge pull request #5092 from XiWeiGu/la64_fixed_cmake
LoongArch64: Fixed cmake
8 months ago
Martin Kroeker
180ba5e7d0
Merge pull request #5069 from tingboliao/dev_rotm_20250107
Further rearranged the rotm kernel for the different architectures.
8 months ago
gxw
1ebcbdbab3
LoongArch64: Fixed the issue of using the old-style TARGET in cmake builds
8 months ago
Martin Kroeker
111c9b0733
Add translations for C_COMPILER and OSNAME
8 months ago
tingbo.liao
3c8df6358f
Further rearranged the rotm kernel for the different architectures.
Signed-off-by: tingbo.liao <tingbo.liao@starfivetech.com>
8 months ago
Martin Kroeker
fbf594b62f
Guard against empty CMAKE_Fortran_COMPILER_ID
9 months ago
Martin Kroeker
d78fbe425c
Assume no underline suffixes on symbols when compiling with ifx on Windows
9 months ago
Martin Kroeker
30188a55d1
Don't assume underlined symbols for ifx; make cpuid.S inclusion conditional
9 months ago
Martin Kroeker
32319a33ac
Add options for Intel oneAPI 2025.0 ifx on Windows
9 months ago
Matthew Thompson
c4e8bac5a5
Fix indent
10 months ago
Matthew Thompson
be19966d3b
Fixes for NAG CMake
10 months ago
Matthew Thompson
2eaf285de5
Use F_COMPILER name
10 months ago
Matthew Thompson
a8b1705dbd
CMake build has wrong PIC flag for NAG
10 months ago
Martin Kroeker
57a51d74c9
translate CMAKE_SYSTEM_NAME in compilations on or for IOS
10 months ago
Martin Kroeker
cea9df3643
Update Cray compiler options and calling convention
10 months ago
Chip Kerchner
36bd3eeddf
Vectorize BF16 GEMV (VSX & MMA). Use GEMM_GEMV_FORWARD_BF16 (for Power).
11 months ago
Martin Kroeker
b0346e72f4
update names of loongarch64 targets for cross-compilation
1 year ago
Martin Kroeker
9c707dc6b9
Update dynamic arch list to new target scheme
1 year ago
Martin Kroeker
b4495a8fb8
Merge branch 'develop' into arm64_cmake_small_matrix_opt
1 year ago
Martin Kroeker
4f00f02567
Do not add -mabi flags for Loongson when the compiler is flang
1 year ago
Martin Kroeker
de421b7764
Merge pull request #4904 from XiWeiGu/la64_cross_cmake
LoongArch64: Enable cmake cross-compilation
1 year ago
Martin Kroeker
0228d36211
move -fopenmp to CFLAGS
1 year ago
gxw
7087b0a7d0
ARM64: Enable SMALL_MATRIX_OPT when compiling with CMake
1 year ago
gxw
30af9278dc
LoongArch64: Enable cmake cross-compilation
1 year ago
psykose
1265eee85c
fix cmake typo for power10 cc version check
fixes 668f48f4fc
1 year ago
Martin Kroeker
cc36db643e
Support new LAPACK build option LAPACK_STRLEN
1 year ago
Martin Kroeker
e8bd97ab4b
add RISCV64 entries for DYNAMIC_ARCH
1 year ago
Martin Kroeker
9eecd0d33b
enable GEMM/GEMV forwarding for riscv and ppc
1 year ago
Chris Sidebottom
b26424c6a2
Allow opt into GEMM -> GEMV forwarding
1 year ago
yamazaki-mitsufumi
821ef34635
Add A64FX to the list of CPUs supported by DYNAMIC_ARCH
1 year ago
Jaap Aarts
cea4abcac0
Fix compiling on mingw
1 year ago
Jaap Aarts
9d0abe2d26
Add support for RISCV64_GENERIC in cmake
1 year ago
Martin Kroeker
d25ee4d0f5
Fix detection of Intel ifx and apply -fp-model option to it
1 year ago
Martin Kroeker
21c0f769ef
ensure that cpu-specific -march options are always applied to icx
1 year ago
Alexander Neumann
dd4505c5dd
Fix CMake warning
1 year ago
Martin Kroeker
8b4996a2d5
Override icx's default fast math mode to ensure correct NaN handling
1 year ago
Martin Kroeker
6494f432df
Fix INTERFACE64 builds on Loongarch64
1 year ago
Martin Kroeker
a3f6b13bc9
remove spurious brace
1 year ago
Martin Kroeker
668f48f4fc
Use CMAKE_C_COMPILER_VERSION instead of dumpversion calls ( #4698 )
* Use CMAKE_C_COMPILER_VERSION throughout
1 year ago