wernsaar
22e5aee2dd
fixed zgemv bug for older AMD Processors
11 years ago
wernsaar
35d37e124f
bugfix for barcelona zgemv-kernel
11 years ago
wernsaar
d8ba46efdb
bugfix for bulldozer cgemm-, zgemm- and zgemv-kernel
11 years ago
wernsaar
a15f22a1f6
bugfix for piledriver cgemm-, zgemm- and zgemv-kernel
11 years ago
wernsaar
b94ea89f52
bugfix for haswell cgemm- and zgemm-kernel
11 years ago
wernsaar
35f668bb14
bugfix for cgemm_kernel_8x2_sandy.S
11 years ago
Timothy Gu
6c2ead30f0
Remove all trailing whitespace except lapack-netlib
Signed-off-by: Timothy Gu <timothygu99@gmail.com>
11 years ago
wernsaar
365e8de346
added optimized cgemm-kernel for SANDYBRIDGE
11 years ago
wernsaar
578d1b6219
added DSDOT definition and enabled optimized sdot kernel
11 years ago
wernsaar
dabab2b5f4
added new optimized sgemm kernel for SANDYBRIGE
11 years ago
wernsaar
aa2709c4e0
enabled optimized dgemm kernel for NEHALEM
11 years ago
wernsaar
a13bcc1716
enabled optimized sgemv kernel for barcelona and piledriver
11 years ago
wernsaar
d2c82d7543
enabled optimized sgemv kernel for HASWELL
11 years ago
wernsaar
0517672dd0
enabled optimized sgemv kernels for nehalem, sandybridge and bulldozer
11 years ago
wernsaar
23203d52c1
Ref #380 : lowered stack usage for haswell kernels
11 years ago
wernsaar
73545a79cd
Ref #380 : lowered stack usage for piledriver and bulldozer kernels
11 years ago
wernsaar
ff9cfca24c
Ref #385 : added missing return instruction
11 years ago
wernsaar
cee257f384
Ref #51 : added blas extensions zomatcopy and comatcopy
11 years ago
wernsaar
7bfb3011e8
Ref #51 : added blas extension somatcopy
11 years ago
wernsaar
8c8f596238
Ref #51 : added blas extension domatcopy as not opimized reference
11 years ago
wernsaar
faf3ac0aad
Ref #285 : added axpby kernels
11 years ago
Zhang Xianyi
406f5bd22b
Merge branch 'develop' of https://github.com/wernsaar/OpenBLAS into wernsaar-develop
Conflicts:
kernel/arm/KERNEL.ARMV6
11 years ago
wernsaar
aaddb05411
bugfix for ARMV6
11 years ago
wernsaar
e826a5a6af
some modifications regarding lapack test
11 years ago
wernsaar
c38379c9dd
bugfixes for ARM regarding lapack tests
11 years ago
wernsaar
a0b07c1440
bugfixs for ARM regarding lapack tests
11 years ago
wernsaar
43fbdb7a5a
added ARMV5 as reference platform
11 years ago
wernsaar
777cebc8c7
added ZERO check to zscal.c because bug in lapack-testing
11 years ago
wernsaar
aa5c73e20f
added ZERO check to zscal.c because bug in lapack-test
11 years ago
wernsaar
5e5ef28ca0
added ZERO check because bug in lapack-test
11 years ago
wernsaar
650ed34336
added ZERO check because bug in lapack-test
11 years ago
wernsaar
5f3b68b4d4
replaced sgemm and cgemm kernels because lapack bugs
11 years ago
wernsaar
2424af62fd
replaced dgemm-kernel because bug in lapack
11 years ago
wernsaar
793509a3b5
replaced files for sdot, sgemv_n and sgemv_t for bug #348
11 years ago
wernsaar
47b22763f8
reduced stack usage on windows to 16K
11 years ago
wernsaar
9db0fb8b02
bugfix for sdsdot
11 years ago
wernsaar
f9daebba0a
checked in bugfixes for ARM
11 years ago
Zhang Xianyi
9a557e90da
Refs #340 . Fixed SEGFAULT bug of dgemv_n on OSX.
11 years ago
wangqian
2d557eb1e0
Fixed computational error of dgemv_n.
11 years ago
Zhang Xianyi
05bb391c3a
Refs #330 . Fixed the compatible issue with clang on Mac OSX.
11 years ago
Zhang Xianyi
9b5be29886
Refs #310 . Fixed Segfault bug on nehalem when Julia calling dgeqrt3 on OSX.
Please also check JuliaLang/julia#4099
Julia test script:
A=rand(256, 256)
qrfact(A)
I found this was a bug in kernel/x86_64/dgemm_ncopy_8.S.
However, I cannot use gdb with julia. Thus, this is a walkaround fix.
11 years ago
wernsaar
53eaf41901
added support for HASWELL
12 years ago
wernsaar
9423f980f6
modified trsm kernel
12 years ago
wernsaar
c6156b2ef2
added trsm kernels from origin
12 years ago
wernsaar
034a5b2083
modified zsymv
12 years ago
wernsaar
27d4234d4d
merged symv
12 years ago
wernsaar
402d6e91db
Merge remote branch 'origin/develop' into armv7
12 years ago
wernsaar
b3254eecaf
Merge remote branch 'origin/haswell' into develop
12 years ago
wernsaar
d910404f00
Merge remote branch 'origin/piledriver' into develop
12 years ago
wernsaar
ffe70b1fdc
modified Makefile.L3
12 years ago