118 Commits (a8b1705dbd39f079ecf120622fa889ecdd92ac04)

Author SHA1 Message Date
  Matthew Thompson a8b1705dbd CMake build has wrong PIC flag for NAG 10 months ago
  Chip Kerchner 36bd3eeddf Vectorize BF16 GEMV (VSX & MMA). Use GEMM_GEMV_FORWARD_BF16 (for Power). 11 months ago
  Martin Kroeker b4495a8fb8
Merge branch 'develop' into arm64_cmake_small_matrix_opt 1 year ago
  gxw 7087b0a7d0 ARM64: Enable SMALL_MATRIX_OPT when compiling with CMake 1 year ago
  gxw 30af9278dc LoongArch64: Enable cmake cross-compilation 1 year ago
  psykose 1265eee85c fix cmake typo for power10 cc version check 1 year ago
  Martin Kroeker cc36db643e
Support new LAPACK build option LAPACK_STRLEN 1 year ago
  Martin Kroeker e8bd97ab4b
add RISCV64 entries for DYNAMIC_ARCH 1 year ago
  Martin Kroeker 9eecd0d33b
enable GEMM/GEMV forwarding for riscv and ppc 1 year ago
  Chris Sidebottom b26424c6a2 Allow opt into GEMM -> GEMV forwarding 1 year ago
  yamazaki-mitsufumi 821ef34635 Add A64FX to the list of CPUs supported by DYNAMIC_ARCH 1 year ago
  Jaap Aarts cea4abcac0
Fix compiling on mingw 1 year ago
  Martin Kroeker a3f6b13bc9
remove spurious brace 1 year ago
  Martin Kroeker 668f48f4fc
Use CMAKE_C_COMPILER_VERSION instead of dumpversion calls (#4698) 1 year ago
  Martin Kroeker 3d26837a35
Suppress GCC14 error exit in the f2c-converted LAPACK 1 year ago
  مهدي شينون (Mehdi Chinoune) cda55f2fd2
Don't pass `-exhaustive-register-search` directly to clang compiler 1 year ago
  Martin Kroeker 52b71a1673
Filter out FFLAGS that flang-new from LLVM18 no longer supports (#4569) 1 year ago
  Martin Kroeker a0e3f77e0b
add FIXED_LIBNAME, PREFIX and SUFFIX 1 year ago
  Martin Kroeker 49689fbef7
Add support for compiling SVE kernels with the NVIDIA HPC compiler 2 years ago
  Martin Kroeker ac698cedad
Add compiler options for ARM64 SVE targets in DYNAMIC_ARCH builds 2 years ago
  Martin Kroeker d2144b2981
Add NVHPC 2 years ago
  Martin Kroeker de937b3194
Add clang option to avoid running out of registers in AVX512 assembly 2 years ago
  Martin Kroeker e964ebd0d0
Add compiler option for AVX512-capable Ryzen(4) 2 years ago
  Martin Kroeker a0a4f7c447
Add -mfma to -mavx2 for clang, and add AVX2 declaration for Zen in DYNAMIC_ARCH builds 3 years ago
  Martin Kroeker 85fd3c4279
Support compilation with the Cray C and Fortran compilers (#3712) 3 years ago
  Martin Kroeker 18b19d135b
C_LAPACK: Fixes to make it compile with MSVC (#3605) 3 years ago
  Martin Kroeker b7873605d4
Use f2c translations of LAPACK when no Fortran compiler is available (#3539) 3 years ago
  Rafael Cardoso Fernandes Sousa d38110a5ce Use CMake variables instead of as 3 years ago
  Rafael Cardoso Fernandes Sousa 214fbcee15 Fix cmake for power 3 years ago
  Markus Mützel de2ed66596 cmake: Set SUFFIX64 also for NOFORTRAN 3 years ago
  Wangyang Guo 3dc6052c7e initial support for Sapphire Rapids platform 4 years ago
  Martin Kroeker e02df9fc55
Propagate BUILD_BFLOAT16 to CFLAGS 4 years ago
  Wangyang Guo 76ea8db4da Small Matrix: enable by default for x86_64 arch 4 years ago
  Wangyang Guo fee5abd84b Small Matrix: support cmake build 4 years ago
  Martin Kroeker 30f23be0f9
Rework setting of -mfma to only apply it where necessary 4 years ago
  User User-User 91e2b11d3c add to cmake listings too 4 years ago
  刘雨培 725432efaa pass NO_AVX512 macro def 4 years ago
  Martin Kroeker 33b5670122
Merge pull request #3096 from martin-frbg/fixclangcmake 4 years ago
  Martin Kroeker 95e19e2e23
fix case in compiler name check 4 years ago
  Martin Kroeker 99ac042702
remove spurious lines (probably editor malfunction) 4 years ago
  Martin Kroeker 774b9f8653
handle AppleClang in Cooperlake support condition 4 years ago
  Martin Kroeker eb1d2344f7
Fix compiler version check for Intel Cooperlake support (clang-cl does not accept -dumpversion) 4 years ago
  xoviat b60de4447a add cortex-m platform 4 years ago
  Martin Kroeker 438a8e5624
Fix placement of getarch call and spurious cpu property accumulation in DYNAMIC_ARCH builds 4 years ago
  Martin Kroeker 0155cd53a3
Add -msse3 where needed for DYNAMIC_ARCH builds 4 years ago
  Martin Kroeker b9bc76aec4
Add files via upload 4 years ago
  Martin Kroeker f64243ff57
Add compiler options for sse/sse2/ssse3/sse4.1 5 years ago
  Martin Kroeker e3a29f6b58
Change "HALF" and "sh" to "BFLOAT16" and "sb" 5 years ago
  Martin Kroeker 68e6823d36
Adapt for supporting only a subset of variable types 5 years ago
  Martin Kroeker e1b7123bbe
Merge pull request #2867 from Qiyu8/usimd-floatdot 5 years ago