Vaisakh K V
f66ca05b31
Merge branch 'develop' into topic/sgemm_direct_sme1
7 months ago
Vaisakh K V
d23eb3b93e
Support for SME1 based sgemm_direct kernel for cblas_sgemm level 3 API
* Added ARMV9SME target
* Added SGEMM_DIRECT kernel based on SME1
10 months ago
Martin Kroeker
877d5a5be6
Add -O2 to flang flags when building on WoA in Release mode
7 months ago
gxw
1ebcbdbab3
LoongArch64: Fixed the issue of using the old-style TARGET in cmake builds
8 months ago
Martin Kroeker
d78fbe425c
Assume no underline suffixes on symbols when compiling with ifx on Windows
9 months ago
Matthew Thompson
c4e8bac5a5
Fix indent
10 months ago
Matthew Thompson
be19966d3b
Fixes for NAG CMake
10 months ago
Matthew Thompson
2eaf285de5
Use F_COMPILER name
10 months ago
Matthew Thompson
a8b1705dbd
CMake build has wrong PIC flag for NAG
10 months ago
Chip Kerchner
36bd3eeddf
Vectorize BF16 GEMV (VSX & MMA). Use GEMM_GEMV_FORWARD_BF16 (for Power).
11 months ago
Martin Kroeker
b4495a8fb8
Merge branch 'develop' into arm64_cmake_small_matrix_opt
1 year ago
gxw
7087b0a7d0
ARM64: Enable SMALL_MATRIX_OPT when compiling with CMake
1 year ago
gxw
30af9278dc
LoongArch64: Enable cmake cross-compilation
1 year ago
psykose
1265eee85c
fix cmake typo for power10 cc version check
fixes 668f48f4fc
1 year ago
Martin Kroeker
cc36db643e
Support new LAPACK build option LAPACK_STRLEN
1 year ago
Martin Kroeker
e8bd97ab4b
add RISCV64 entries for DYNAMIC_ARCH
1 year ago
Martin Kroeker
9eecd0d33b
enable GEMM/GEMV forwarding for riscv and ppc
1 year ago
Chris Sidebottom
b26424c6a2
Allow opt into GEMM -> GEMV forwarding
1 year ago
yamazaki-mitsufumi
821ef34635
Add A64FX to the list of CPUs supported by DYNAMIC_ARCH
1 year ago
Jaap Aarts
cea4abcac0
Fix compiling on mingw
1 year ago
Martin Kroeker
a3f6b13bc9
remove spurious brace
1 year ago
Martin Kroeker
668f48f4fc
Use CMAKE_C_COMPILER_VERSION instead of dumpversion calls ( #4698 )
* Use CMAKE_C_COMPILER_VERSION throughout
1 year ago
Martin Kroeker
3d26837a35
Suppress GCC14 error exit in the f2c-converted LAPACK
1 year ago
مهدي شينون (Mehdi Chinoune)
cda55f2fd2
Don't pass `-exhaustive-register-search` directly to clang compiler
`-exhaustive-register-search` is an LLVM code generation flag that shouldn't be passed directly to clang compiler.
1 year ago
Martin Kroeker
52b71a1673
Filter out FFLAGS that flang-new from LLVM18 no longer supports ( #4569 )
* Filter out FFLAGS that flang-new from LLVM18 no longer supports
1 year ago
Martin Kroeker
a0e3f77e0b
add FIXED_LIBNAME, PREFIX and SUFFIX
1 year ago
Martin Kroeker
49689fbef7
Add support for compiling SVE kernels with the NVIDIA HPC compiler
2 years ago
Martin Kroeker
ac698cedad
Add compiler options for ARM64 SVE targets in DYNAMIC_ARCH builds
2 years ago
Martin Kroeker
d2144b2981
Add NVHPC
2 years ago
Martin Kroeker
de937b3194
Add clang option to avoid running out of registers in AVX512 assembly
2 years ago
Martin Kroeker
e964ebd0d0
Add compiler option for AVX512-capable Ryzen(4)
2 years ago
Martin Kroeker
a0a4f7c447
Add -mfma to -mavx2 for clang, and add AVX2 declaration for Zen in DYNAMIC_ARCH builds
3 years ago
Martin Kroeker
85fd3c4279
Support compilation with the Cray C and Fortran compilers ( #3712 )
* Add support for the Cray Fortran compiler
3 years ago
Martin Kroeker
18b19d135b
C_LAPACK: Fixes to make it compile with MSVC ( #3605 )
* Fix f2c-like support functions to compile with MSVC, and
re-enable C_LAPACK for MSVC in CMAKE
* Add MSVC&flang build to Azure CI in order to check C_LAPACK correctness
3 years ago
Martin Kroeker
b7873605d4
Use f2c translations of LAPACK when no Fortran compiler is available ( #3539 )
* Add C equivalents of the Fortran routines from Reference-LAPACK as fallbacks, and C_LAPACK variable to trigger their use
3 years ago
Rafael Cardoso Fernandes Sousa
d38110a5ce
Use CMake variables instead of as
3 years ago
Rafael Cardoso Fernandes Sousa
214fbcee15
Fix cmake for power
3 years ago
Markus Mützel
de2ed66596
cmake: Set SUFFIX64 also for NOFORTRAN
3 years ago
Wangyang Guo
3dc6052c7e
initial support for Sapphire Rapids platform
4 years ago
Martin Kroeker
e02df9fc55
Propagate BUILD_BFLOAT16 to CFLAGS
4 years ago
Wangyang Guo
76ea8db4da
Small Matrix: enable by default for x86_64 arch
If no customized GEMM_SMALL_M_PERMIT kernel defined, it will just by pass to normal path.
4 years ago
Wangyang Guo
fee5abd84b
Small Matrix: support cmake build
4 years ago
Martin Kroeker
30f23be0f9
Rework setting of -mfma to only apply it where necessary
4 years ago
User User-User
91e2b11d3c
add to cmake listings too
4 years ago
刘雨培
725432efaa
pass NO_AVX512 macro def
4 years ago
Martin Kroeker
33b5670122
Merge pull request #3096 from martin-frbg/fixclangcmake
Fix Cooperlake/DYNAMIC_ARCH builds with clang on Windows
4 years ago
Martin Kroeker
95e19e2e23
fix case in compiler name check
Co-authored-by: xoviat <49173759+xoviat@users.noreply.github.com>
4 years ago
Martin Kroeker
99ac042702
remove spurious lines (probably editor malfunction)
4 years ago
Martin Kroeker
774b9f8653
handle AppleClang in Cooperlake support condition
4 years ago
Martin Kroeker
eb1d2344f7
Fix compiler version check for Intel Cooperlake support (clang-cl does not accept -dumpversion)
4 years ago