Martin Kroeker
d636b418af
Merge pull request #1504 from ararslan/aa/openbsd
Allow building on OpenBSD
7 years ago
Alex Arslan
a41d241a0e
Add support for DragonFly BSD
7 years ago
Alex Arslan
8da6b6ae52
Allow building on OpenBSD
With this change, OpenBLAS builds and all tests pass on OpenBSD 6.2
using Clang. Tested on x86-64 only, with and without DYNAMIC_ARCH=1.
7 years ago
Martin Kroeker
01c4b82f04
Update memory.c
7 years ago
Martin Kroeker
93db123f7e
Update memory.c
7 years ago
Martin Kroeker
752fdb5dd8
Add workaround for old gcc and clang versions
Old gcc and clang do not handle constructor arguments, finally fix #875 as discussed there, using the fedora patch
7 years ago
Martin Kroeker
7646974227
Limit the additional locking from PRs 1052,1299 to non-OpenMP multithreading
7 years ago
Martin Kroeker
8866e393a2
Revert "Add locks only for non-OPENMP multithreading"
7 years ago
Martin Kroeker
3119b2ab4c
Add locks only for non-OPENMP multithreading
to migitate performance problems caused by #1052 and #1299 as seen in #1461
7 years ago
Erik M. Bray
8f5f614615
On Cygwin use mmap instead of Windows native allocation functions, which are not fork-safe.
7 years ago
Erik M. Bray
f5fc109fbd
Perform blas_thread_shutdown with pthread_atfork() on Cygwin
Even if we're directly using the win32 threading driver and not pthreads,
pthread_atfork still works fine to register a pre-fork handler, and is
necessary to restore the threading server to a pre-initialized state.
7 years ago
Martin Kroeker
e388459a27
Merge pull request #1419 from brada4/develop
Initialize unitialized values for repeated calls
7 years ago
Andrew
e5752ff9b3
take out unused variables
7 years ago
Andrew
8a0b086b28
add missing bracket for old glibc (cppcheck)
7 years ago
Martin Kroeker
42285d8e70
Merge pull request #1410 from brada4/develop
Address warnings #1357
7 years ago
Andrew
8aafa0473c
address last warnings as seen by gcc7
7 years ago
Andrew
11a627c54e
remove surplus parentheses to silence clang5
7 years ago
Martin Kroeker
cc9500db41
Merge pull request #1403 from brada4/develop
Address few more warnings
7 years ago
Andrew
bfc2a88594
remove unused buffer
7 years ago
Martin Kroeker
177b78c8b4
Issue1388 ( #1389 )
* Calculation of chunk range limits was ignoring num_cpu
bug introduced by me in #1262 - should fix #1388
* Calculation of range limits was ignoring num_cpu
bug introduced by me in #1262
* Calculation of chunk range limits was ignoring num_cpu
bug introduced by me in #1262
* Calculation of chunk range limits was ignoring num_cpu
bug introduced by me in #1262
* Calculation of chunk range limits was ignoring num_cpu
bug introduced by me in #1262
* Calculation of chunk range limits was ignoring num_cpu
bug introduced by me in #1262
7 years ago
Andrew
281a2b952f
warning cleanup ( #1380 )
* dead increments in driver/level2
* dead increments in kernel/generic
* part dead increments in kernel/x86_64
7 years ago
Martin Kroeker
c49c6b237d
Merge pull request #1382 from martin-frbg/dtrmv-1332
Work around errors in multithreaded dtrmv
7 years ago
Martin Kroeker
28ae3ca76f
Limit MAX_CPU to 1024 for now
Some Linux distributions (notably SuSE) have raised CPU_SETSIZE to 4096, apparently disregarding API limitations.
From #1348 , the highest value to survive array initialization (on a desktop system) is 3232, and 1024 - which is the
more usual CPU_SETSIZE limit, was demonstrated to work fine on an actual bignuma system.
7 years ago
Martin Kroeker
b414283f48
Disable gemv unrolling
as a (hopefully temporary) workaround for #1332
7 years ago
Andrew
ef95cd471f
elminate unread variable, after reiteration 3 of them (clang4)
7 years ago
Andrew
e14d50d86e
eliminate Wunused-const gcc7 warning
7 years ago
Martin Kroeker
07e7c36dac
Handle shmem init failures in cpu affinity setup code
Failures to obtain or attach shared memory segments would lead to an exit without explanation of the exact cause.
This change introduces a more verbose error message and tries to make the code continue without setting cpu affinity.
Fixes #1351
7 years ago
Martin Kroeker
2a6fef9a55
Try to handle shmget or shmat failing
also replaces one verbatim sched_yield with the YIELDING macro for consistency as suggested in #1351
8 years ago
Martin Kroeker
db72ad8f6a
Merge pull request #1320 from timmoon10/develop
2D thread distribution for multi-threaded GEMMs
8 years ago
Martin Kroeker
514d237257
Merge pull request #1279 from xsacha/develop
CMake improvements
8 years ago
Tim Moon
30486a356c
Reduce number of data partitions in n.
8 years ago
Tim Moon
9de52b489a
Cleaning up and documenting multi-threaded GEMM code.
8 years ago
Tim Moon
860dcfc703
Use 2D thread distribution for small GEMMs.
Allows maximum use of available cores if one of M and N is small and the other is large.
8 years ago
Tim Moon
6aaa107865
Reducing threads for multi-threaded GEMMs on small matrices.
8 years ago
Martin Kroeker
ba1f91f17b
Convert another caller of "allocation" to LOCK_COMMAND
... as the "allocation" code jumped to now does UNLOCK_COMMAND instead of blas_unlock
8 years ago
Martin Kroeker
f460776f0f
Fix thread data races
8 years ago
Martin Kroeker
e882f3d6f3
Fix thread data race in memory.c
8 years ago
Sacha Refshauge
37858d1146
Fix threading usage in CMake: s/SMP/USE_THREAD/
8 years ago
Isuru Fernando
2f12ea017b
No strncasecmp with MSVC
8 years ago
Martin Kroeker
719fcc56b0
Merge pull request #1262 from martin-frbg/xmv_thread-splitting
Make sure that range limit of last thread never exceeds data size
8 years ago
Martin Kroeker
ebb04e3265
Merge pull request #1259 from isuruf/cmake
CMake Improvements
8 years ago
Martin Kroeker
0ba64cee60
Update trmv_thread.c
8 years ago
Martin Kroeker
c4e5ba1bfe
Make sure that range_n of last thread never exceeds the actual data size when splitting the workload
8 years ago
Martin Kroeker
a6f533b248
Revert "Fix calculated range limit exceeding actual data size for last thread"
8 years ago
Isuru Fernando
d245caa49a
Support out-of-source build
8 years ago
Martin Kroeker
e70a6b92bf
Merge pull request #1257 from martin-frbg/cgroups-prereq
Rework __GLIBC_PREREQ checks to avoid breaking non-glibc builds
8 years ago
Martin Kroeker
63cfa32691
Rework __GLIBC_PREREQ checks to avoid breaking non-glibc builds
8 years ago
Martin Kroeker
585c0010a5
Fix range limit exceeding actual data size in last step
8 years ago
Martin Kroeker
857f61bc5d
Fix range limit exceeding data size in last step
8 years ago
Martin Kroeker
9332042d5f
Fix range exceeding actual data size in quick_divide
8 years ago