Martin Kroeker
376048156b
Use in-place transform shortcut only if matrix is square
8 years ago
Martin Kroeker
d1c5b8f913
Add files via upload
8 years ago
Martin Kroeker
91bde7d315
Exchange rows and cols in final omatcopy with BlasTrans
This is MicMuc's patch from #899
8 years ago
Martin Kroeker
1e06b49854
Update xerbla.c
8 years ago
Martin Kroeker
7f546f54fa
Add cblas_xerbla
8 years ago
Martin Kroeker
a809431e34
Add cblas_xerbla()
8 years ago
Andrew
99880f7906
Address unlikely memleak in zimatcopy interface ( #1129 )
* fix unlikely memleak in zimatcopy interface
* fix only unlikely memleak in zimatcopy interface
* fix only unlikely memleak in zimatcopy interface
8 years ago
Martin Kroeker
211d2eceb5
Update zdot.c
8 years ago
Martin Kroeker
5813ed095b
Update zdot.c
8 years ago
Martin Kroeker
e44b028fe5
Replace gnu _real_, _imag_ extensions in initializers
8 years ago
Ashwin Sekhar T K
071a830e8b
THUNDERX2T99: Add optimized S/D/C/Z SWAP Implementations
8 years ago
Werner Saar
dd6212e684
updated some level1 funcions, that are not thread save
8 years ago
jiahaipeng
84b8170bfb
Adding multi-threading for copy, dot, rot, and asum funcitons
8 years ago
Werner Saar
ae4ac6f984
removed obj-files, that are moved to lapack 3.7.0
8 years ago
Jerome Robert
d346c533b1
Fix z/ctrmv stack allocation on AMD bulldozer and barcelona target
* Hopefully, because this was found by error and trial (dark magic)
* Ref #786
9 years ago
Werner Saar
f04af36ad0
Merge pull request #898 from wernsaar/develop
added experimental support for optimized lapack fortran functions
9 years ago
Werner Saar
41000c8443
added directory for optimized lapack fortan codes and added dlaqr5.f
9 years ago
John Biddiscombe
053044ae4d
Replace CMAKE_SOURCE_DIR/CMAKE_BINARY_DIR with PROJECT_SOURCE_DIR/PROJECT_BINARY_DIR
If OpenBLAS is built using add_subdirectory(OpenBlas) as part of another project
then the paths set by CMAKE_XXX_DIR are relative to the parent project
and not the OpenBLAS project.
9 years ago
Jerome Robert
40af513669
Disable multi-threading in swap
* Close #873
9 years ago
Jerome Robert
16ec5323c9
Fix zgemv.c compilation when stack allocation is disabled
9 years ago
Jerome Robert
5fc2203d8a
zgemv: Add a workaround for #746
9 years ago
Jerome Robert
78dcf5c3d5
Improve performances of ztrmv on small matrices
* Use stack allocation
* Disable multi-threading
* Ref #727
9 years ago
Jerome Robert
32f793195f
Use stack allocation in zgemv and zger
For better performance with small matrices
Ref #727
9 years ago
Jerome Robert
1fe3aab047
Use GEMM_MULTITHREAD_THRESHOLD as a number of ops
...not a matrix size. For GEMM_MULTITHREAD_THRESHOLD=4
(the default value) this does not change anything but
for other values it make the GEMM and GEMV thresholds
changing in the same way.
Close #742
9 years ago
Jerome Robert
1a1935507b
[z]ger: increase multithread threshold
The ones given in 3ae30cd
was by far to low because I
mixed m and m*n in my measures. Note that the new ones
are closed to the [z]gemv ones which is comforting
that both are right.
9 years ago
Jerome Robert
66eafb16cf
swap: disable multi-threading for small matrices
Close #731
9 years ago
Jerome Robert
3ae30cd6b9
Disable multi-threading for small matrices in [z]ger
Ref #731
9 years ago
Jerome Robert
87a2ccc37c
Factorize MAX_STACK_ALLOC code to common_stackalloc.h
Ref #727
9 years ago
Jerome Robert
f9890a6452
Fix compilation when MAX_STACK_ALLOC is not set
Close #722
9 years ago
Zhang Xianyi
285d042b10
Fixed rotg bug on ARM.
9 years ago
Zhang Xianyi
640cccc2b1
Refs #697 . Fixed gemv bug for Windows.
Thank matzeri's patch.
9 years ago
Ralph Campbell
55a0b27c01
Minor C code fixes in interface/
10 years ago
Zhang Xianyi
2feef49fa8
Merge branch 'develop' into cmake
Conflicts:
driver/others/memory.c
10 years ago
Zhang Xianyi
5a291606ad
Refs #671 . the return of i?max cannot larger than N.
10 years ago
Zhang Xianyi
8fade093aa
Fixed cmake bug on Visual Studio.
10 years ago
Zhang Xianyi
94b125255f
Merge branch 'develop' into cmake
Conflicts:
driver/others/memory.c
10 years ago
Zhang Xianyi
baec8f5cac
Refs #638 . Fixed compiling bug with clang on Mac OS X.
10 years ago
Martin Koehler
711ca33bc6
Improved Ximatcopy when lda==ldb.
The Ximatcopy functions create a copy of the input matrix
although they seem to work inplace. The new routines
XIMATCOPY_K_YY perform the operations inplace if the leading
dimension does not change.
10 years ago
Zhang Xianyi
f874465bb8
Use cmake to build OpenBLAS GENERIC Target on MSVC x86 64-bit.
Disable CBLAS and LAPACK.
10 years ago
Zhang Xianyi
dcd5ba4443
Merge branch 'cmake' of https://github.com/hpanderson/OpenBLAS into hpanderson_cmake
10 years ago
Werner Saar
f8f2e261fe
use only 1 thread if m or n < 2*GEMM_MULTITHREAD_THRESHOLD
10 years ago
Jerome Robert
ab567d8443
gemv: Ensure stack buffer is large enough to handle memory alignment
Ref #478
10 years ago
Zhang Xianyi
847e19c04e
Refs #478,#482, Enable stack alloc for s/dgemv_t.(revert 9798491)
10 years ago
Zhang Xianyi
fd9fd42936
Refs #478 , #482 . Fixed bug on previous commit.
10 years ago
Zhang Xianyi
9798481979
Refs #478 , #482 . Fix segfault bug for gemv_t with MAX_ALLOC_STACK flag.
For gemv_t, directly use malloc to create the buffer.
10 years ago
Zhang Xianyi
cdefdb21cd
Refs #492 . Fixed c/zsyr bug with negative incx.
10 years ago
Hank Anderson
0d8e227ea7
Changed strategy for setting preprocessor definitions.
Instead of generating separate object files for each permutation of
defines for a source file, GenerateNamedObjects now writes an entirely
new source file and inserts the defines as #define c statements.
This solves a problem I ran into with ar.exe where it was refusing to
link objects that had the same filename despite having different paths.
10 years ago
Hank Anderson
b2284647a3
More complex objects.
10 years ago
Hank Anderson
a6116e5859
Added some more complex-only objects.
10 years ago
Hank Anderson
67e39bd8fb
Added mangled complex filenames to interface and lapack CMakeLists.txt.
10 years ago