Maximilien Levesque
770fac92eb
Correct typo /proc/ instead of /pros/
10 years ago
Zhang Xianyi
e95d64333a
Refs #519 . Avoid calling strncpy.
10 years ago
Zhang Xianyi
75c40bcc48
Refs #520 . Fixed ONLY_CBLAS=1 compiling bug on OSX.
10 years ago
Zhang Xianyi
b62f9f4120
Merge pull request #518 from ton/issue-508
Fix issue #508
10 years ago
Ton van den Heuvel
b6438dedea
Fix issue #508
Fix race condition during shutdown causing a crash in
gotoblas_set_affinity().
10 years ago
Zhang Xianyi
cdefdb21cd
Refs #492 . Fixed c/zsyr bug with negative incx.
10 years ago
Zhang Xianyi
ea7f9dacf4
Refs #509 . Fixed geadd building bug with DYNAMIC_ARCH=1.
10 years ago
Zhang Xianyi
bf5dbb7e2a
Refs#509. Merge branch 'grisuthedragon-develop' into develop
10 years ago
Martin Koehler
39cc6b21d3
Add ATLAS-style ?geadd function
10 years ago
Zhang Xianyi
771b18ae9c
Detect the wrong combined flags of USE_OPENMP=1 and USE_THREAD=0.
10 years ago
Zhang Xianyi
cfa9392ffa
Fix openblas_get_num_threads and openblas_get_num_procs bug with single thread.
10 years ago
Zhang Xianyi
1ccd57ce80
Merge pull request #497 from eschnett/develop
Introduce openblas_get_num_threads and openblas_get_num_procs
10 years ago
Erik Schnetter
65a847cd36
Introduce openblas_get_num_threads and openblas_get_num_procs
10 years ago
Zhang Xianyi
07ff001981
Merge pull request #495 from jeromerobert/develop
Fix a segfault in gemv when MAX_STACK_ALLOC is set
10 years ago
Jerome Robert
b17ccb4c5c
Fix a segfault in gemv when MAX_STACK_ALLOC is set
* stack_alloc_size is needed after the implementation call
but it may be overwritten if it's optimized to a register,
because some gemv implementation (ex: dgemv_n.S) do not
restore all register (ex: r10).
* do the same in ger.c for the same reasons even if the bug
has not been observed.
10 years ago
Zhang Xianyi
63c6fcfa0a
Merge pull request #490 from eschnett/develop
Move #include statements outside extern "C" blocks
10 years ago
Erik Schnetter
29cb47fc06
Move #include statements outside extern "C" blocks
10 years ago
Zhang Xianyi
4e6c4046f7
Fix cortex-a15 detecting bug.
10 years ago
Zhang Xianyi
229ce2ccd1
Add cortex-a9 and cortex-a15 targets.
10 years ago
Zhang Xianyi
ef75be0e51
Merge pull request #487 from kortschak/dromtg-test
Add test for drotmg bug fixed by 692b14c
10 years ago
kortschak
5344f335a8
Add test for drotmg bug fixed by 692b14c
Test requested in issue xianyi/OpenBLAS#484 .
Run tests by applying the following change and then make:
diff --git a/Makefile.rule b/Makefile.rule
index bea1fe1..9852ff3 100644
--- a/Makefile.rule
+++ b/Makefile.rule
@@ -140,7 +140,7 @@ NO_AFFINITY = 1
-# UTEST_CHECK = 1
+UTEST_CHECK = 1
10 years ago
Zhang Xianyi
5cb5af9333
Add configuration options.
10 years ago
Zhang Xianyi
41aad0407f
Merge pull request #482 from jeromerobert/develop
Allow to do gemv and ger buffer allocation on the stack
10 years ago
Zhang Xianyi
f8f2e84659
Merge pull request #486 from wernsaar/develop
Optimizations for steamroller
10 years ago
Werner Saar
34633fef01
Merge branch 'develop' of github.com:wernsaar/OpenBLAS into develop
10 years ago
Werner Saar
ddf983d643
added optimizations for steamroller
10 years ago
Zhang Xianyi
17b9db20f1
Merge pull request #483 from wernsaar/develop
added Steamroller as a cpu target
10 years ago
Werner Saar
0dc559ed30
bugfix in dynamic.c
10 years ago
Werner Saar
9566f5fdb0
added Steamroller as a target processor
10 years ago
Werner Saar
4319769b79
added target processor STEAMROLLER
10 years ago
Jerome Robert
e9d9a8eae3
Allow to do gemv and ger buffer allocation on the stack
ger and gemv call blas_memory_alloc/free which in their turn
call blas_lock. blas_lock create thread contention when matrices
are small and the number of thread is high enough. We avoid
call blas_memory_alloc by replacing it with stack allocation.
This can be enabled with:
make -DMAX_STACK_ALLOC=2048
The given size (in byte) must be high enough to avoid thread contention
and small enough to avoid stack overflow.
Fix #478
10 years ago
Zhang Xianyi
cbb3ab80e7
Merge pull request #481 from eschnett/develop
Correct ilaver C declaration
10 years ago
Erik Schnetter
cd9868b1b4
Correct ilaver C declaration
10 years ago
Zhang Xianyi
eb738148fe
Merge pull request #479 from wernsaar/develop
workaround for sandybridge zgemm kernel
10 years ago
Werner Saar
587e16fba3
Ref #458 : Backport, sandybrigde uses nehalem zgemm kernel
10 years ago
Werner Saar
4de7b9ae47
increased NMAX to 128
10 years ago
Werner Saar
887aed634d
modified sources for OS Darwin
10 years ago
Werner Saar
6261342de3
small optimization on dgemm_kernel for N=1
10 years ago
Werner Saar
1e566223ed
added code for the size of n
10 years ago
Werner Saar
113b48ca22
modified makefile for acml6.1
10 years ago
Zhang Xianyi
3e81c99b6b
Fixed installation bug on Mac OSX.
10 years ago
Werner Saar
ec85c4a51d
Increased the Threshold value in sep.in
10 years ago
Werner Saar
97de657d38
added tests to sep.as as workaround for gfortran-4.8.x
10 years ago
Zhang Xianyi
71966eba6c
Merge pull request #475 from xantares/patch-2
add OpenBLAS_VERSION to cmake config file
10 years ago
Zhang Xianyi
a359979e17
Merge pull request #474 from xantares/patch-1
set OPENBLAS_CMAKE_DIR to <prefix>/lib/cmake/<package_name>
10 years ago
xantares
7a6a141bc4
add OpenBLAS_VERSION to cmake config file
10 years ago
xantares
b8ff6892f6
set OPENBLAS_CMAKE_DIR to <prefix>/lib/cmake/<package_name>
usually these files are more often located in this subdir
10 years ago
Zhang Xianyi
8fe7a9ce6f
Merge pull request #473 from wernsaar/develop
changed inline assembler labels to short form
10 years ago
Werner Saar
bc5fff7085
changed inline assembler labels to short form
10 years ago
Zhang Xianyi
1943ea91a8
Merge branch 'develop' of github.com:xianyi/OpenBLAS into develop
10 years ago