Martin Kroeker
c504aedca1
Merge pull request #5400 from Mousius/neoversev2-target
Add NEOVERSEV2 target support
2 months ago
Martin Kroeker
2f89a5970e
fix NeoverseV2 typo
2 months ago
Chris Sidebottom
87247daadc
Add NEOVERSEV2 target support
Did a quick run around to make `TARGET=NEVOERSEV2` build successfully.
Fixes #5385
2 months ago
Martin Kroeker
a5b55f6fe3
remove CBLAS restriction on GEMM_GEMV forwarding
2 months ago
Martin Kroeker
82954ba4ca
Update ?GEMM-to-?GEMV forwarding settings
2 months ago
Chris Sidebottom
f95e7b0e32
Add infrastructure for BGEMM
Setting up all the infrastructure for BGEMM support in OpenBLAS, hopefully I found all the right places.
Derived mostly from the previous work done in https://github.com/OpenMathLib/OpenBLAS/pull/5287
Co-authored-by: Ye Tao <ye.tao@arm.com>
3 months ago
Chris Sidebottom
552e1c7a7a
Correct compiler flags for NEOVERSEV1 target
2 months ago
Srangrang
0a967797a1
Add FP16 support for RISCV
4 months ago
Ruiyang Wu
1b0c0f00e9
CMake: Avoid mixed OpenMP linkage
6 months ago
Ruiyang Wu
02fd1df10b
CMake: Pass `OpenMP` compiler and linker flags through CMake targets
Using `OpenMP::OpenMP_LANG` targets for CMake is less error-prone than
passing the compiler and linker flags manually. Furthermore, it allows
the user to customize those flags by setting `OpenMP_LANG_FLAGS`,
`OpenMP_LANG_LIB_NAMES`, and `OpenMP_omp_LIBRARY`.
6 months ago
Vaisakh K V
f66ca05b31
Merge branch 'develop' into topic/sgemm_direct_sme1
7 months ago
Vaisakh K V
d23eb3b93e
Support for SME1 based sgemm_direct kernel for cblas_sgemm level 3 API
* Added ARMV9SME target
* Added SGEMM_DIRECT kernel based on SME1
10 months ago
Martin Kroeker
877d5a5be6
Add -O2 to flang flags when building on WoA in Release mode
7 months ago
gxw
1ebcbdbab3
LoongArch64: Fixed the issue of using the old-style TARGET in cmake builds
8 months ago
Martin Kroeker
d78fbe425c
Assume no underline suffixes on symbols when compiling with ifx on Windows
9 months ago
Matthew Thompson
c4e8bac5a5
Fix indent
10 months ago
Matthew Thompson
be19966d3b
Fixes for NAG CMake
10 months ago
Matthew Thompson
2eaf285de5
Use F_COMPILER name
10 months ago
Matthew Thompson
a8b1705dbd
CMake build has wrong PIC flag for NAG
10 months ago
Chip Kerchner
36bd3eeddf
Vectorize BF16 GEMV (VSX & MMA). Use GEMM_GEMV_FORWARD_BF16 (for Power).
11 months ago
Martin Kroeker
b4495a8fb8
Merge branch 'develop' into arm64_cmake_small_matrix_opt
1 year ago
gxw
7087b0a7d0
ARM64: Enable SMALL_MATRIX_OPT when compiling with CMake
1 year ago
gxw
30af9278dc
LoongArch64: Enable cmake cross-compilation
1 year ago
psykose
1265eee85c
fix cmake typo for power10 cc version check
fixes 668f48f4fc
1 year ago
Martin Kroeker
cc36db643e
Support new LAPACK build option LAPACK_STRLEN
1 year ago
Martin Kroeker
e8bd97ab4b
add RISCV64 entries for DYNAMIC_ARCH
1 year ago
Martin Kroeker
9eecd0d33b
enable GEMM/GEMV forwarding for riscv and ppc
1 year ago
Chris Sidebottom
b26424c6a2
Allow opt into GEMM -> GEMV forwarding
1 year ago
yamazaki-mitsufumi
821ef34635
Add A64FX to the list of CPUs supported by DYNAMIC_ARCH
1 year ago
Jaap Aarts
cea4abcac0
Fix compiling on mingw
1 year ago
Martin Kroeker
a3f6b13bc9
remove spurious brace
1 year ago
Martin Kroeker
668f48f4fc
Use CMAKE_C_COMPILER_VERSION instead of dumpversion calls ( #4698 )
* Use CMAKE_C_COMPILER_VERSION throughout
1 year ago
Martin Kroeker
3d26837a35
Suppress GCC14 error exit in the f2c-converted LAPACK
1 year ago
مهدي شينون (Mehdi Chinoune)
cda55f2fd2
Don't pass `-exhaustive-register-search` directly to clang compiler
`-exhaustive-register-search` is an LLVM code generation flag that shouldn't be passed directly to clang compiler.
1 year ago
Martin Kroeker
52b71a1673
Filter out FFLAGS that flang-new from LLVM18 no longer supports ( #4569 )
* Filter out FFLAGS that flang-new from LLVM18 no longer supports
1 year ago
Martin Kroeker
a0e3f77e0b
add FIXED_LIBNAME, PREFIX and SUFFIX
1 year ago
Martin Kroeker
49689fbef7
Add support for compiling SVE kernels with the NVIDIA HPC compiler
2 years ago
Martin Kroeker
ac698cedad
Add compiler options for ARM64 SVE targets in DYNAMIC_ARCH builds
2 years ago
Martin Kroeker
d2144b2981
Add NVHPC
2 years ago
Martin Kroeker
de937b3194
Add clang option to avoid running out of registers in AVX512 assembly
2 years ago
Martin Kroeker
e964ebd0d0
Add compiler option for AVX512-capable Ryzen(4)
2 years ago
Martin Kroeker
a0a4f7c447
Add -mfma to -mavx2 for clang, and add AVX2 declaration for Zen in DYNAMIC_ARCH builds
3 years ago
Martin Kroeker
85fd3c4279
Support compilation with the Cray C and Fortran compilers ( #3712 )
* Add support for the Cray Fortran compiler
3 years ago
Martin Kroeker
18b19d135b
C_LAPACK: Fixes to make it compile with MSVC ( #3605 )
* Fix f2c-like support functions to compile with MSVC, and
re-enable C_LAPACK for MSVC in CMAKE
* Add MSVC&flang build to Azure CI in order to check C_LAPACK correctness
3 years ago
Martin Kroeker
b7873605d4
Use f2c translations of LAPACK when no Fortran compiler is available ( #3539 )
* Add C equivalents of the Fortran routines from Reference-LAPACK as fallbacks, and C_LAPACK variable to trigger their use
3 years ago
Rafael Cardoso Fernandes Sousa
d38110a5ce
Use CMake variables instead of as
3 years ago
Rafael Cardoso Fernandes Sousa
214fbcee15
Fix cmake for power
3 years ago
Markus Mützel
de2ed66596
cmake: Set SUFFIX64 also for NOFORTRAN
3 years ago
Wangyang Guo
3dc6052c7e
initial support for Sapphire Rapids platform
4 years ago
Martin Kroeker
e02df9fc55
Propagate BUILD_BFLOAT16 to CFLAGS
4 years ago