613 Commits (d96daa220de9e5c9d8d69f332a6fa550181c7f7e)

Author SHA1 Message Date
  Martin Kroeker d96daa220d
Merge pull request #5290 from Srangrang/develop 3 months ago
  Martin Kroeker e541bf68f5
support AmpereOne/OneA as NeoverseN1 3 months ago
  gkdddd 670ec6f757 Added shgemm_kernel_8x8 for RISCV64_ZVL128B and shgemm_kernel_16x8 for RISCV64_ZVL256B 4 months ago
  Martin Kroeker 5141a90993
Fix ARMV9SME target in DYNAMIC_ARCH and add SME query code for MacOS (#5222) 4 months ago
  Ruiyang Wu 02fd1df10b CMake: Pass `OpenMP` compiler and linker flags through CMake targets 6 months ago
  Martin Kroeker 39eb43d441
Improve thread safety of pthreads builds that rely on C11 atomic operations for locking (#5170) 6 months ago
  Martin Kroeker 1533fe49be
Merge pull request #5144 from taoye9/dispatch_neoversve2_to_neoversven2 7 months ago
  Ye Tao f0bea79a6e dispatch NEOVERSEV2 to NEOVERSEN2 under dynamic setting 7 months ago
  Vaisakh K V f66ca05b31
Merge branch 'develop' into topic/sgemm_direct_sme1 7 months ago
  Vaisakh K V d23eb3b93e Support for SME1 based sgemm_direct kernel for cblas_sgemm level 3 API 10 months ago
  Martin Kroeker a182251284
fix typo 9 months ago
  Martin Kroeker ed95791618
fix conflicting variables 9 months ago
  Martin Kroeker 3c3d1c4849
Identify all cores and select the most performant one as TARGET 9 months ago
  Ralf Gommers 765ad8bcd2 Fix guard around `alloc_hugetlb`, fixes compile warning 9 months ago
  Ralf Gommers 48caf2303d Fix build warning about discarding volatile qualifier in memory.c 9 months ago
  Martin Kroeker 4060dd43e3
Add dummy implementations of openblas_get/set_affinity 10 months ago
  Martin Kroeker de421b7764
Merge pull request #4904 from XiWeiGu/la64_cross_cmake 1 year ago
  gxw 30af9278dc LoongArch64: Enable cmake cross-compilation 1 year ago
  gxw 48698b2b1d LoongArch64: Rename core 1 year ago
  Martin Kroeker 3ee9e9d8d0
Merge pull request #4879 from martin-frbg/issue4868-2 1 year ago
  Martin Kroeker a8d6b0219a
Merge pull request #4877 from XiWeiGu/fixed_undefined_blas_set_parameter 1 year ago
  Martin Kroeker d24b3cf393
properly fix buffer allocation and assignment 1 year ago
  gxw fd033467ac Fixed the undefined reference to blas_set_parameter 1 year ago
  Martin Kroeker 23b5d66a86
Ensure a memory buffer has been allocated for each thread before invoking it 1 year ago
  Martin Kroeker 753c7ebe17
Merge pull request #4835 from martin-frbg/revertwin4359 1 year ago
  Martin Kroeker 50397e017a
Merge pull request #4838 from martin-frbg/fix4662-3 1 year ago
  Martin Kroeker 5257f807a9
fix invalid ifdef syntax in HUGETLB handling 1 year ago
  Martin Kroeker 2aed90171a
Add riscv sources for DYNAMIC_ARCH 1 year ago
  Martin Kroeker 6468dc1142
restore the coarse locking of the pre-4359 version 1 year ago
  yamazaki-mitsufumi 821ef34635 Add A64FX to the list of CPUs supported by DYNAMIC_ARCH 1 year ago
  Martin Kroeker a815594fd1
Merge pull request #4801 from markdryan/markdryan/riscv-dynamic-arch 1 year ago
  Martin Kroeker a373d0f107
Improve the error message for thread creation failure 1 year ago
  Mark Ryan 3b715e6162 Add autodetection for riscv64 1 year ago
  Martin Kroeker d0b9948b23
Guard against invalid thread_status.queue 1 year ago
  Martin Kroeker 9b2a0c79cb
Add Zhaoxin KX7000 1 year ago
  Deeksha Goplani 0dc80a5c8d locks improvement 1 year ago
  Martin Kroeker 8da6f7e5f2
Merge pull request #4686 from XiWeiGu/loongarch64_dgemm_kernel_16x6 1 year ago
  gxw 637c650f4f loongarch64: Add buffer offset for target LOONGSON3R5 1 year ago
  Martin Kroeker 5500b4ab26
Merge pull request #4680 from theAeon/develop 1 year ago
  Martin Kroeker f0f1ff7820
fix HUGETLB allocation for TLS mode as well 1 year ago
  Andrew Robbins edfe1aa471
Expose whether locking is enabled in get_config 1 year ago
  Martin Kroeker dc99b61380
sort unwanted interdependencies of alloc_shm and alloc_hugetlb 1 year ago
  Martin Kroeker ddcd7d6fa8
Merge branch 'develop' into Threading_Callback 1 year ago
  gxw d8c4ea8793 loongarch: Optimizing the performance of the GEMM on servers 1 year ago
  shivammonaka 7102367fde Introduced callback to Pthread, Win32 and OpenMP backend 1 year ago
  Mark Seminatore b0ad8a78ff code to fix lost work in case of re-entrant calls to exec_blas_async() 1 year ago
  Martin Kroeker 88b5330ae7
Restore outer loop of blas_buffer_inuse setup 1 year ago
  shivammonaka d49ebc54e1 Merge branch 'shivam-develop' into shivam-Locks 1 year ago
  shivammonaka bc191015e3 Using OpenMP locks with NUM_PARALLEL 1 year ago
  Mark Seminatore b29fd48998
Merge branch 'develop' into win_tidy 1 year ago