Werner Saar
|
f615dc7603
|
added optimized saxpy kernel for steamroller
|
10 years ago |
Werner Saar
|
331c417637
|
optimized saxpy for piledriver
|
10 years ago |
Zhang Xianyi
|
6c3a0b5d46
|
Enable MAX_STACK_ALLOC by default.
|
10 years ago |
Zhang Xianyi
|
fd9fd42936
|
Refs #478, #482. Fixed bug on previous commit.
|
10 years ago |
Zhang Xianyi
|
9798481979
|
Refs #478, #482. Fix segfault bug for gemv_t with MAX_ALLOC_STACK flag.
For gemv_t, directly use malloc to create the buffer.
|
10 years ago |
Werner Saar
|
d7a17ad85d
|
optimized sdot-kernel for pilediver
|
10 years ago |
Werner Saar
|
d35f6c63c2
|
add optimized daxpy-kernel for steamroller
|
10 years ago |
Werner Saar
|
166d76e864
|
added optimized sdot-kernel for steamroller
|
10 years ago |
Werner Saar
|
f9f127d838
|
added optimized ddot kernel for steamroller
|
10 years ago |
wernsaar
|
62231ab337
|
Merge pull request #538 from wernsaar/develop
Added optimized cdot- and zdot-kernels
|
10 years ago |
Werner Saar
|
3119def9a7
|
updated cdot and zdot
|
10 years ago |
Werner Saar
|
33b332372a
|
add optimized cdot- and zdot-kernel for sandybridge
|
10 years ago |
Werner Saar
|
fd838c75bc
|
add optimized cdot- and zdot-kernel for haswell
|
10 years ago |
Werner Saar
|
b57a60dac8
|
updated cdot and zdot for piledriver
|
10 years ago |
Werner Saar
|
5c51163972
|
added optimized cdot- and zdot-kernel for steamroller
|
10 years ago |
Werner Saar
|
9299d8cfd6
|
added optimized cdot- and zdot-kernels for bulldozer
|
10 years ago |
Zhang Xianyi
|
0a3d3b945d
|
Refs #535. Fix the wrong vector instruction in sgemm sandy bridge kernel.
|
10 years ago |
Zhang Xianyi
|
4f680a7d61
|
Merge pull request #534 from wernsaar/develop
Refs #533. added optimized saxpy- and daxpy-kernel for haswell and sandybridge
|
10 years ago |
Werner Saar
|
ba926e807c
|
added cdot- and zdot benchmark
|
10 years ago |
Werner Saar
|
60c6dec6e6
|
updated some lines for bulldozer
|
10 years ago |
Werner Saar
|
47898cca35
|
added optimized saxpy- and daxpy-kernel for sandybridge
|
10 years ago |
Werner Saar
|
53bb924287
|
added optimized saxpy- and daxpy-kernel for haswell
|
10 years ago |
Zhang Xianyi
|
1e80b8b0d3
|
Merge pull request #531 from wernsaar/develop
added optimized sdot- and ddot-kernels for Haswell and Sandybridge
|
10 years ago |
Werner Saar
|
a901b065d3
|
added optimized ddot-kernel for sandybridge
|
10 years ago |
Werner Saar
|
3937e2a0a0
|
add optimized sdot-kernel for sandybridge
|
10 years ago |
Werner Saar
|
9707d608d5
|
removed double definition line
|
10 years ago |
Werner Saar
|
701b9d7556
|
added optimized sdot- and ddot-kernel for HASWELL
|
10 years ago |
Zhang Xianyi
|
8977b3f235
|
Refs #529. Support Intel Broadwell by Haswell kernels.
|
10 years ago |
Zhang Xianyi
|
f6426395ea
|
Merge pull request #527 from xantares/patch-1
fix mingw install
|
10 years ago |
xantares
|
0ac787eefe
|
fix mingw install
|
10 years ago |
Zhang Xianyi
|
e5b96e55a7
|
Fix build bug for ARM64.
|
10 years ago |
Zhang Xianyi
|
a3491e1e88
|
Update the doc for 0.2.14.
|
10 years ago |
Zhang Xianyi
|
e81a5d61e4
|
Merge branch 'develop' of github.com:xianyi/OpenBLAS into develop
|
10 years ago |
Zhang Xianyi
|
c674fa32be
|
Add ARM targets.
|
10 years ago |
Zhang Xianyi
|
e34911a73d
|
Fix compiling bug for ARM with setting BINARY.
|
10 years ago |
Zhang Xianyi
|
76dcaf2281
|
Merge pull request #521 from maxlevesque/patch-1
Correct typo /proc/ instead of /pros/
|
10 years ago |
Maximilien Levesque
|
770fac92eb
|
Correct typo /proc/ instead of /pros/
|
10 years ago |
Zhang Xianyi
|
e95d64333a
|
Refs #519. Avoid calling strncpy.
|
10 years ago |
Zhang Xianyi
|
75c40bcc48
|
Refs #520. Fixed ONLY_CBLAS=1 compiling bug on OSX.
|
10 years ago |
Zhang Xianyi
|
b62f9f4120
|
Merge pull request #518 from ton/issue-508
Fix issue #508
|
10 years ago |
Ton van den Heuvel
|
b6438dedea
|
Fix issue #508
Fix race condition during shutdown causing a crash in
gotoblas_set_affinity().
|
10 years ago |
Zhang Xianyi
|
cdefdb21cd
|
Refs #492. Fixed c/zsyr bug with negative incx.
|
10 years ago |
Zhang Xianyi
|
ea7f9dacf4
|
Refs #509. Fixed geadd building bug with DYNAMIC_ARCH=1.
|
10 years ago |
Zhang Xianyi
|
bf5dbb7e2a
|
Refs#509. Merge branch 'grisuthedragon-develop' into develop
|
10 years ago |
Martin Koehler
|
39cc6b21d3
|
Add ATLAS-style ?geadd function
|
10 years ago |
Zhang Xianyi
|
771b18ae9c
|
Detect the wrong combined flags of USE_OPENMP=1 and USE_THREAD=0.
|
10 years ago |
Zhang Xianyi
|
cfa9392ffa
|
Fix openblas_get_num_threads and openblas_get_num_procs bug with single thread.
|
10 years ago |
Zhang Xianyi
|
1ccd57ce80
|
Merge pull request #497 from eschnett/develop
Introduce openblas_get_num_threads and openblas_get_num_procs
|
10 years ago |
Erik Schnetter
|
65a847cd36
|
Introduce openblas_get_num_threads and openblas_get_num_procs
|
10 years ago |
Zhang Xianyi
|
07ff001981
|
Merge pull request #495 from jeromerobert/develop
Fix a segfault in gemv when MAX_STACK_ALLOC is set
|
10 years ago |