Deeksha Goplani
0dc80a5c8d
locks improvement
1 year ago
Martin Kroeker
f0f1ff7820
fix HUGETLB allocation for TLS mode as well
1 year ago
Martin Kroeker
dc99b61380
sort unwanted interdependencies of alloc_shm and alloc_hugetlb
1 year ago
gxw
d8c4ea8793
loongarch: Optimizing the performance of the GEMM on servers
1 year ago
Martin Kroeker
d938aed7fe
reset "mem structure overflowed" state on shutdown
1 year ago
Martin Kroeker
90f890ee67
fix improper function prototypes (empty parentheses) (USE_TLS branch)
2 years ago
Martin Kroeker
c6b1d8e7a3
fix improper function prototypes (empty parentheses)
2 years ago
Martin Kroeker
7e939fb831
Fix handling of additional buffer structures in case of overflow
2 years ago
Tiziano Müller
6a611db560
memory: show correct number of max threads
2 years ago
Martin Kroeker
3326b924b3
remove status variable blas_num_threads_set; initialize openmp thread maximum on startup
2 years ago
Martin Kroeker
e5538a62cb
Add suggestions to NUM_THREADS/auxiliary buffer message
2 years ago
Martin Kroeker
e298d613fa
initialize status variable for openblas_set_num_threads
2 years ago
Martin Kroeker
e38ab079a0
Fix OpenMP thread counting returning places rather than cores
2 years ago
Martin Kroeker
69148ae795
Guard against sysconf returning zero processors
3 years ago
Martin Kroeker
b329e45288
Guard against omp_get_num_places returning zero
3 years ago
Martin Kroeker
c8d05aa7a5
Move the threads overflow flag under the protection of the local blas lock ( #3476 )
* Move accesses to the overflow flag into the scope of the blas lock
3 years ago
Martin Kroeker
4f057bffd6
Fix NULL pointer checks in blas_memory_alloc
3 years ago
Martin Kroeker
efb16fafb0
Fix miscounting of threadpool size on Linux with OMP_PROC_BIND=TRUE ( #3437 )
* return OMP places (if available, or SC_NPROCESSORS_CONF) for maximum thread count when built with OpenMP
3 years ago
Martin Kroeker
dd09f0173e
Remove extraneous qualifiers from struct definition
4 years ago
Martin Kroeker
cd10d1c03b
Fix typo
4 years ago
Martin Kroeker
2db1a99aca
Clean up debug messages
4 years ago
Martin Kroeker
89fc5b8f4f
Fix unmap logic
4 years ago
Martin Kroeker
7fd12a5e69
Add likely() hints for gcc
4 years ago
Martin Kroeker
2ba9a567aa
Fix typo
4 years ago
Martin Kroeker
b4b952eece
Add auxiliary tracking space for thread buffer frees too
4 years ago
Martin Kroeker
7d1becc575
Allocate an auxiliary struct when running out of preconfigured threads
4 years ago
Martin Kroeker
898212efcd
Actually add the message to the TLS section
4 years ago
Martin Kroeker
210a1584c5
Rebase source and edit TLS version of the message as well
4 years ago
Martin Kroeker
f2a7a67f5a
Improve the "tried to allocate too many buffers" error message
4 years ago
Craig Watson
4d7dfe4845
Include Haiku in processor count checks
4 years ago
River Dillon
2f6326a630
Remove <linux/unistd.h>
4 years ago
Martin Kroeker
1a3ad4b670
Fix signatures of the TLS-mode dll_callback and p_process_term functions for Win64
4 years ago
Martin Kroeker
b0bded3f2f
Fix get_num_procs() in the USE_TLS branch for non-glibc systems
4 years ago
Martin Kroeker
0cc36770f1
Merge pull request #3073 from xoviat/embedded
add embedded option
4 years ago
Alex Henrie
113840da12
Fix null pointer check in blas_memory_alloc
4 years ago
xoviat
2e8d6e8690
add functions for embedded
4 years ago
xoviat
b60de4447a
add cortex-m platform
4 years ago
Martin Kroeker
fd7da56965
Move definitions that are neither needed nor supported on SUNOS
5 years ago
Martin Kroeker
ac653c94f3
Merge branch 'develop' into issue2588-cmake
5 years ago
Alexander Grund
3c05f54df8
Avoid out of bounds access on invalid memory free
5 years ago
Alexander Grund
dee7c49938
Fix TABs and trailing space
5 years ago
Martin Kroeker
357bff06b5
Add BUILD_vartype defines
5 years ago
Martin Kroeker
09eb9d2584
Update conditional for atomics to HAVE_C11
5 years ago
Martin Kroeker
f4248af26e
Fix compiler warnings
5 years ago
Martin Kroeker
f41600e66f
Add a read barrier in the traversing of the buffer list
Needed on systems with weak memory ordering - the inferior, partially working fix from #2544 was already removed in #2551
5 years ago
Martin Kroeker
2a28448a96
Add safeguards for sufficient BUFFER_SIZE
5 years ago
Martin Kroeker
69f277f8ee
Add another memory barrier for ARM and a multicore test run on ThunderX to help detect such issues ( #2544 )
* Add another memory barrier in memory.c to prevent races in memory slot allocation
* Add an all-core test on Drone.io's ThunderX platform and modify dgemm_tester to use all 96 cores
5 years ago
Martin Kroeker
78100b8093
Free Windows thread memory with MEM_RELEASE rather than MEM_DECOMMIT
as suggested by hjmndv in #2370
5 years ago
Martin Kroeker
1b90989662
Add NetBSD to the xBSD conditionals
6 years ago
Martin Kroeker
1776ad82c0
Add files via upload
6 years ago