Jerome Robert
|
53ba1a77c8
|
ztrmv_L.c: no longer need a 4kB buffer
Fix #786
|
9 years ago |
Zhang Xianyi
|
8f758eeff9
|
Refs #786. avoid old assembly c/zgemv kernels.
|
9 years ago |
Zhang Xianyi
|
8577be2a95
|
Modify travis script.
|
9 years ago |
Zhang Xianyi
|
1edf30b790
|
Change Opteron(SSE3) to Opteron_SSE3 at dyanmaic core name.
|
9 years ago |
Zhang Xianyi
|
4fc8c937d4
|
Refs #695 add testcase.
|
9 years ago |
Zhang Xianyi
|
efa4f5c936
|
Refs #695 #783. Replace default x86_64 cgemv_t
asm kernel by C kernel.
|
9 years ago |
Zhang Xianyi
|
17d655fa64
|
Merge pull request #784 from peterph/develop
collected usage notes
|
9 years ago |
Petr Cerny
|
f68141cf1d
|
collected usage notes
|
9 years ago |
Zhang Xianyi
|
6b85dbb6dc
|
Refs #696. Turn off stack limit setting on Linux.
I cannot reproduce SEGFAULT of lapack-test with default stack size
on ARM Linux.
|
9 years ago |
Zhang Xianyi
|
74b0672223
|
Fix c/zaxpyc kernel bug on Cortex-A57.
|
9 years ago |
Zhang Xianyi
|
6e7be06e07
|
Refs JuliaLang/julia#5728. Fix gemv performance bug on Haswell Mac OSX.
On Mac OS X, it should use .align 4 (equal to .align 16 on Linux).
I didn't get the performance benefit from .align. Thus, I deleted it.
|
9 years ago |
Zhang Xianyi
|
a04d0555ba
|
[av skip] Fix utest makefile bug on travis ci.
|
9 years ago |
Zhang Xianyi
|
3761c30ba4
|
Fix makefile bug for utest.
|
9 years ago |
Zhang Xianyi
|
38593cd3a3
|
Fix compiling bug on ARM Cortex-A57.
|
9 years ago |
Zhang Xianyi
|
e3b7781c2b
|
Update readme.
|
9 years ago |
Zhang Xianyi
|
5e6965ea47
|
Run utest when building.
|
9 years ago |
Zhang Xianyi
|
5cc0301fc3
|
Enable utest for appveyor.
|
9 years ago |
Zhang Xianyi
|
19a6dedfd6
|
Add utest for CMake.
|
9 years ago |
Zhang Xianyi
|
0e2b92e216
|
Added mising lapacke files for CMake.
|
9 years ago |
Zhang Xianyi
|
d06b92906a
|
Add gemm3m building for CMake.
|
9 years ago |
Zhang Xianyi
|
8e98478ff3
|
Update ctest.h from github.com:xianyi/ctest.git.
|
9 years ago |
Zhang Xianyi
|
fb8968fb83
|
Refs #707. Bugfix for previous commit.
|
9 years ago |
Zhang Xianyi
|
dae6b82a71
|
Refs #707. Add BUILD_LAPACK_DEPRECATED flag in Makefile.rule.
If you want to build LAPACK deprecated functions since LAPACK 3.6.0
make BUILD_LAPACK_DEPRECATED=1
|
9 years ago |
Zhang Xianyi
|
d73244b825
|
Refs #727. Align stack buffer address on 32-bytes.
|
9 years ago |
Zhang Xianyi
|
233c6b959f
|
Merge pull request #780 from jeromerobert/bug727
Bug727
|
9 years ago |
Jerome Robert
|
16ec5323c9
|
Fix zgemv.c compilation when stack allocation is disabled
|
9 years ago |
Jerome Robert
|
0ad02ef2d6
|
update CONTRIBUTORS.md
|
9 years ago |
Jerome Robert
|
73397faf68
|
Add benchmark/smallscaling.c
* Bench small matrices with multi-threading
* Close #727
|
9 years ago |
Jerome Robert
|
5fc2203d8a
|
zgemv: Add a workaround for #746
|
9 years ago |
Jerome Robert
|
78dcf5c3d5
|
Improve performances of ztrmv on small matrices
* Use stack allocation
* Disable multi-threading
* Ref #727
|
9 years ago |
Jerome Robert
|
32f793195f
|
Use stack allocation in zgemv and zger
For better performance with small matrices
Ref #727
|
9 years ago |
Zhang Xianyi
|
be4e5fcd20
|
Fixed #778. Merge branch 'buffer51-develop' into develop
|
9 years ago |
buffer51
|
855e0cb700
|
Restored LAPACK_COMPLEX_STRUCTURE for Android prior to 21. Refs #682.
|
9 years ago |
buffer51
|
7f7d04dcd2
|
Fixed linking error when compiling ARMv7 for Android (disabled -lpthread and added -Wl,--no-warn-mismatch).
|
9 years ago |
buffer51
|
4e1b521e27
|
Fix lapack complex implementation of lauu2 and potf2 for Android (use FLOAT instead of FLOAT[2] as imaginary part is not used).
|
10 years ago |
Zhang Xianyi
|
a1a96589aa
|
Fixed #773 blas_quickdivide bug on CMake and Visual Studio x86 32-bit.
|
9 years ago |
Zhang Xianyi
|
0e68beb89f
|
Fixed #711, #698. Merge branch 'byzhang-develop' into develop
|
9 years ago |
Zhang Xianyi
|
926ba8b7ca
|
Merge branch 'develop' of https://github.com/byzhang/OpenBLAS into byzhang-develop
|
9 years ago |
Zhang Xianyi
|
9f080c47e1
|
Merge pull request #743 from tkelman/patch-1
re enable Fortran optimization flag on windows
|
9 years ago |
Zhang Xianyi
|
52eba814ce
|
Fixed #769. Merge branch 'martin-frbg-develop' into develop
|
9 years ago |
Martin Kroeker
|
935356c34f
|
Update dynamic.c and cpuid_x86.c for Intel Avoton.
Second part of "support Intel Avoton via Nehalem kernel"
|
9 years ago |
Zhang Xianyi
|
ff9388d625
|
Refs #768. Swap the result of zdot x87 fp kernel.
|
9 years ago |
Martin Kroeker
|
4f05c23673
|
Update cpuid_x86.c
Add recognition of Intel Atom C27xx (Avoton, model code 4D)
|
9 years ago |
Benyu Zhang
|
4a1263f609
|
Fix the source paths
|
9 years ago |
Zhang Xianyi
|
962376664d
|
Refs #768. Swap the result of zdot x87 fp kernel.
|
9 years ago |
Tony Kelman
|
5fef0d1b75
|
re enable Fortran optimization flag on windows
partial revert of 299cdcdc29
from #696, was not explained why that was needed
|
9 years ago |
Zhang Xianyi
|
578f471808
|
Fix utest bug when INTERFACE64=1.
|
9 years ago |
Zhang Xianyi
|
5a8447e97e
|
Use ctest.h for unit test. Enable unit test on travis CI.
|
9 years ago |
Zhang Xianyi
|
be95bdaf47
|
Detect ARMV8 on 32-bit mode by using ARMV7 kernels.
|
9 years ago |
Zhang Xianyi
|
c44ff4d648
|
Refs #714. avoid compiling warnings.
|
9 years ago |