475 Commits (7c1839899e81829b096c62e73804d6859a0beed1)

Author SHA1 Message Date
  Martin Kroeker 7c1839899e
Increase assumed L2 sizes for RISCV X280 / ZVL256B and for SVE-capable ARM64 1 month ago
  Martin Kroeker c504aedca1
Merge pull request #5400 from Mousius/neoversev2-target 2 months ago
  Martin Kroeker 2f89a5970e
fix NeoverseV2 typo 2 months ago
  Chris Sidebottom 87247daadc Add NEOVERSEV2 target support 2 months ago
  Martin Kroeker a5b55f6fe3
remove CBLAS restriction on GEMM_GEMV forwarding 2 months ago
  Martin Kroeker 82954ba4ca
Update ?GEMM-to-?GEMV forwarding settings 2 months ago
  Martin Kroeker b24212f5df
fix numbers 2 months ago
  Martin Kroeker 6ff06f5483
Add cross-compilation data for RISCV64 targets 2 months ago
  Chris Sidebottom 947d7af4c9 Fix CMake references to bscal and bgemv 2 months ago
  Chris Sidebottom 72d2ebb4dd Re-add GEMV fallback for Level3 2 months ago
  Chris Sidebottom e105411460 Add infrastructure for bgemv/bscal 2 months ago
  Chris Sidebottom 66d9185ebe Fix CMake support 2 months ago
  Chris Sidebottom f95e7b0e32 Add infrastructure for BGEMM 3 months ago
  Chris Sidebottom 552e1c7a7a Correct compiler flags for NEOVERSEV1 target 2 months ago
  Usui, Tetsuzo 14107e37d9 Add parallel laed3 3 months ago
  Martin Kroeker 560fa88c96
Add cross-build parameters for Ampere One 3 months ago
  Martin Kroeker 55bb5ef867
Add compiler options for Ampere One 3 months ago
  Srangrang 0a967797a1 Add FP16 support for RISCV 4 months ago
  Martin Kroeker f2022c23ac
Remove sve capability from NeoverseN1 and specify CortexX2/A?10 as arm8.4a 4 months ago
  Martin Kroeker d9369bda1e
Update and amend parameters for Neoverse cpus 5 months ago
  Ruiyang Wu 1b0c0f00e9 CMake: Avoid mixed OpenMP linkage 6 months ago
  Ruiyang Wu 02fd1df10b CMake: Pass `OpenMP` compiler and linker flags through CMake targets 6 months ago
  Martin Kroeker b34235ca66
Fix inclusion of deprecated interfaces and cgesvdq/strsyl3 6 months ago
  Martin Kroeker f1fa370579
fix missing endif 7 months ago
  Martin Kroeker 6d1444be3a
Add ARM64 options for NVIDIA HPC 7 months ago
  Vaisakh K V f66ca05b31
Merge branch 'develop' into topic/sgemm_direct_sme1 7 months ago
  Vaisakh K V d23eb3b93e Support for SME1 based sgemm_direct kernel for cblas_sgemm level 3 API 10 months ago
  Martin Kroeker 877d5a5be6
Add -O2 to flang flags when building on WoA in Release mode 7 months ago
  Martin Kroeker 262018f14c
Merge pull request #5092 from XiWeiGu/la64_fixed_cmake 8 months ago
  Martin Kroeker 180ba5e7d0
Merge pull request #5069 from tingboliao/dev_rotm_20250107 8 months ago
  gxw 1ebcbdbab3 LoongArch64: Fixed the issue of using the old-style TARGET in cmake builds 8 months ago
  Martin Kroeker 111c9b0733
Add translations for C_COMPILER and OSNAME 8 months ago
  tingbo.liao 3c8df6358f Further rearranged the rotm kernel for the different architectures. 8 months ago
  Martin Kroeker fbf594b62f
Guard against empty CMAKE_Fortran_COMPILER_ID 9 months ago
  Martin Kroeker d78fbe425c
Assume no underline suffixes on symbols when compiling with ifx on Windows 9 months ago
  Martin Kroeker 30188a55d1
Don't assume underlined symbols for ifx; make cpuid.S inclusion conditional 9 months ago
  Martin Kroeker 32319a33ac
Add options for Intel oneAPI 2025.0 ifx on Windows 9 months ago
  Matthew Thompson c4e8bac5a5 Fix indent 10 months ago
  Matthew Thompson be19966d3b Fixes for NAG CMake 10 months ago
  Matthew Thompson 2eaf285de5 Use F_COMPILER name 10 months ago
  Matthew Thompson a8b1705dbd CMake build has wrong PIC flag for NAG 10 months ago
  Martin Kroeker 57a51d74c9
translate CMAKE_SYSTEM_NAME in compilations on or for IOS 10 months ago
  Martin Kroeker cea9df3643
Update Cray compiler options and calling convention 10 months ago
  Chip Kerchner 36bd3eeddf Vectorize BF16 GEMV (VSX & MMA). Use GEMM_GEMV_FORWARD_BF16 (for Power). 11 months ago
  Martin Kroeker b0346e72f4
update names of loongarch64 targets for cross-compilation 1 year ago
  Martin Kroeker 9c707dc6b9
Update dynamic arch list to new target scheme 1 year ago
  Martin Kroeker b4495a8fb8
Merge branch 'develop' into arm64_cmake_small_matrix_opt 1 year ago
  Martin Kroeker 4f00f02567
Do not add -mabi flags for Loongson when the compiler is flang 1 year ago
  Martin Kroeker de421b7764
Merge pull request #4904 from XiWeiGu/la64_cross_cmake 1 year ago
  Martin Kroeker 0228d36211
move -fopenmp to CFLAGS 1 year ago