Zhang Xianyi
|
937493bfeb
|
Release 0.2.16 rc1
|
9 years ago |
Zhang Xianyi
|
74b0672223
|
Fix c/zaxpyc kernel bug on Cortex-A57.
|
9 years ago |
Zhang Xianyi
|
6e7be06e07
|
Refs JuliaLang/julia#5728. Fix gemv performance bug on Haswell Mac OSX.
On Mac OS X, it should use .align 4 (equal to .align 16 on Linux).
I didn't get the performance benefit from .align. Thus, I deleted it.
|
9 years ago |
Zhang Xianyi
|
a04d0555ba
|
[av skip] Fix utest makefile bug on travis ci.
|
9 years ago |
Zhang Xianyi
|
3761c30ba4
|
Fix makefile bug for utest.
|
9 years ago |
Zhang Xianyi
|
38593cd3a3
|
Fix compiling bug on ARM Cortex-A57.
|
9 years ago |
Zhang Xianyi
|
e3b7781c2b
|
Update readme.
|
9 years ago |
Zhang Xianyi
|
5e6965ea47
|
Run utest when building.
|
9 years ago |
Zhang Xianyi
|
5cc0301fc3
|
Enable utest for appveyor.
|
9 years ago |
Zhang Xianyi
|
19a6dedfd6
|
Add utest for CMake.
|
9 years ago |
Zhang Xianyi
|
0e2b92e216
|
Added mising lapacke files for CMake.
|
9 years ago |
Zhang Xianyi
|
d06b92906a
|
Add gemm3m building for CMake.
|
9 years ago |
Zhang Xianyi
|
8e98478ff3
|
Update ctest.h from github.com:xianyi/ctest.git.
|
9 years ago |
Zhang Xianyi
|
fb8968fb83
|
Refs #707. Bugfix for previous commit.
|
9 years ago |
Zhang Xianyi
|
dae6b82a71
|
Refs #707. Add BUILD_LAPACK_DEPRECATED flag in Makefile.rule.
If you want to build LAPACK deprecated functions since LAPACK 3.6.0
make BUILD_LAPACK_DEPRECATED=1
|
9 years ago |
Zhang Xianyi
|
d73244b825
|
Refs #727. Align stack buffer address on 32-bytes.
|
9 years ago |
Zhang Xianyi
|
233c6b959f
|
Merge pull request #780 from jeromerobert/bug727
Bug727
|
9 years ago |
Jerome Robert
|
16ec5323c9
|
Fix zgemv.c compilation when stack allocation is disabled
|
9 years ago |
Jerome Robert
|
0ad02ef2d6
|
update CONTRIBUTORS.md
|
9 years ago |
Jerome Robert
|
73397faf68
|
Add benchmark/smallscaling.c
* Bench small matrices with multi-threading
* Close #727
|
9 years ago |
Jerome Robert
|
5fc2203d8a
|
zgemv: Add a workaround for #746
|
9 years ago |
Jerome Robert
|
78dcf5c3d5
|
Improve performances of ztrmv on small matrices
* Use stack allocation
* Disable multi-threading
* Ref #727
|
9 years ago |
Jerome Robert
|
32f793195f
|
Use stack allocation in zgemv and zger
For better performance with small matrices
Ref #727
|
9 years ago |
Zhang Xianyi
|
be4e5fcd20
|
Fixed #778. Merge branch 'buffer51-develop' into develop
|
9 years ago |
buffer51
|
855e0cb700
|
Restored LAPACK_COMPLEX_STRUCTURE for Android prior to 21. Refs #682.
|
9 years ago |
buffer51
|
7f7d04dcd2
|
Fixed linking error when compiling ARMv7 for Android (disabled -lpthread and added -Wl,--no-warn-mismatch).
|
9 years ago |
buffer51
|
4e1b521e27
|
Fix lapack complex implementation of lauu2 and potf2 for Android (use FLOAT instead of FLOAT[2] as imaginary part is not used).
|
10 years ago |
Zhang Xianyi
|
a1a96589aa
|
Fixed #773 blas_quickdivide bug on CMake and Visual Studio x86 32-bit.
|
9 years ago |
Zhang Xianyi
|
0e68beb89f
|
Fixed #711, #698. Merge branch 'byzhang-develop' into develop
|
9 years ago |
Zhang Xianyi
|
926ba8b7ca
|
Merge branch 'develop' of https://github.com/byzhang/OpenBLAS into byzhang-develop
|
9 years ago |
Zhang Xianyi
|
9f080c47e1
|
Merge pull request #743 from tkelman/patch-1
re enable Fortran optimization flag on windows
|
9 years ago |
Zhang Xianyi
|
52eba814ce
|
Fixed #769. Merge branch 'martin-frbg-develop' into develop
|
9 years ago |
Martin Kroeker
|
935356c34f
|
Update dynamic.c and cpuid_x86.c for Intel Avoton.
Second part of "support Intel Avoton via Nehalem kernel"
|
9 years ago |
Zhang Xianyi
|
ff9388d625
|
Refs #768. Swap the result of zdot x87 fp kernel.
|
9 years ago |
Martin Kroeker
|
4f05c23673
|
Update cpuid_x86.c
Add recognition of Intel Atom C27xx (Avoton, model code 4D)
|
9 years ago |
Benyu Zhang
|
4a1263f609
|
Fix the source paths
|
9 years ago |
Zhang Xianyi
|
962376664d
|
Refs #768. Swap the result of zdot x87 fp kernel.
|
9 years ago |
Tony Kelman
|
5fef0d1b75
|
re enable Fortran optimization flag on windows
partial revert of 299cdcdc29
from #696, was not explained why that was needed
|
9 years ago |
Zhang Xianyi
|
578f471808
|
Fix utest bug when INTERFACE64=1.
|
9 years ago |
Zhang Xianyi
|
5a8447e97e
|
Use ctest.h for unit test. Enable unit test on travis CI.
|
9 years ago |
Zhang Xianyi
|
be95bdaf47
|
Detect ARMV8 on 32-bit mode by using ARMV7 kernels.
|
9 years ago |
Zhang Xianyi
|
c44ff4d648
|
Refs #714. avoid compiling warnings.
|
9 years ago |
Zhang Xianyi
|
e003a1294c
|
Merge pull request #764 from martin-frbg/develop
Update Makefile.system to fix awk/nawk issue #763
|
9 years ago |
Martin Kroeker
|
44062517eb
|
Update Makefile.system
Define AWK as "nawk" for SunOS (actually Illumos) only - fixes #763
|
9 years ago |
Zhang Xianyi
|
13f0f8c10e
|
Refs #723. Avoid out of boundary for getf2.
|
9 years ago |
Zhang Xianyi
|
f5df444ceb
|
Merge pull request #762 from jeromerobert/bug760
Let openblas_get_num_threads return the number of active threads
|
9 years ago |
Zhang Xianyi
|
e382713423
|
Merge pull request #759 from jeromerobert/bug742
Bug742
|
9 years ago |
Zhang Xianyi
|
aaa8551c57
|
Merge pull request #749 from lotheac/illumos_fixes
illumos fixes
|
9 years ago |
Jerome Robert
|
0d87c1ffb6
|
Let openblas_get_num_threads return the number of active threads
... not the number of allocated threads.
Close #760
|
9 years ago |
wernsaar
|
0b194426f8
|
Merge pull request #761 from wernsaar/develop
Ref #740: all assembly codes now clear floating point register correctly
|
9 years ago |