Zhang Xianyi
c44ff4d648
Refs #714 . avoid compiling warnings.
9 years ago
Werner Saar
63a7d7fb24
updated gemv_n_vfpv3.S for armv7
9 years ago
Werner Saar
b4ede558a5
updated nrm2 kernel for armv7
9 years ago
Werner Saar
de3e2d4349
updated trmm kernels for armv7
9 years ago
Werner Saar
a0e51e96f1
updated gemm kernels for armv7
9 years ago
Werner Saar
c2891330bc
updated KERNEL.ARMV6
9 years ago
Werner Saar
ceaa931e48
updated gemv kernel for armv6
9 years ago
Werner Saar
eaa63165df
updated cgemv and zgemv kernels for armv6
9 years ago
Werner Saar
c65357c566
updated trmm_kernels for armv6
9 years ago
Werner Saar
e63e9f9f26
updated gemm_kernels for armv6
9 years ago
Werner Saar
aafd3ab60e
updated cdot and zdot on arm
9 years ago
Werner Saar
d2f84c9c8a
Ref #740 : updated nrm2_vfp.S
9 years ago
Werner Saar
ca32253f32
Ref #740 : updated asum_vfp.S and iamax_vfp.S
9 years ago
Werner Saar
9066d1f982
Ref #750 and Ref #740 : bugfix for sdot, dsdot and ddot on arm
9 years ago
Werner Saar
692d9c881c
Ref #740 : simple solution to clear floating point register on arm
9 years ago
Zhang Xianyi
3602a2cd1f
#736 Revert #733 patch to fix bus error on ARM.
9 years ago
Zhang Xianyi
e3e20e2242
Merge pull request #733 from yuyichao/arm-asm
Do not use vsub to clear the register values
9 years ago
Yichao Yu
594b9f4c73
Do not use vsub to clear the register values since it doesn't work with non-normal numbers.
9 years ago
Werner Saar
c8f2c5d636
added optimized trsm_kernels
9 years ago
Ashwin Sekhar T K
318f0949c3
lapack-test fixes in nrm2 kernels for Cortex A57
10 years ago
Ashwin Sekhar T K
98965da2e8
lapack-test fixes for Cortex A57
10 years ago
Ashwin Sekhar T K
c99c43d51e
Optimized trmm kernels for CORTEXA57
10 years ago
Ashwin Sekhar T K
1397b47197
Optimized zgemm kernel for CORTEXA57
10 years ago
Ashwin Sekhar T K
45f78963ac
Optimized cgemm kernel for CORTEXA57
Also, add a generic ztrmm 4x4 kernel
10 years ago
Ashwin Sekhar T K
402443bf9c
Optimized dgemm kernel for CORTEXA57
10 years ago
Ashwin Sekhar T K
19fdbee291
Improve the sgemm kernel for CORTEXA57
10 years ago
Ashwin Sekhar T K
3b0cdfab1e
Optimized gemv kernels for CORTEXA57
Co-Authored-By: Ralph Campbell <ralph.campbell@broadcom.com>
10 years ago
Ashwin Sekhar T K
46efa6a1da
Optimized swap kernels for CORTEXA57
Co-Authored-By: Ralph Campbell <ralph.campbell@broadcom.com>
10 years ago
Ashwin Sekhar T K
ea1465cdf8
Optimized scal kernels for CORTEXA57
Co-Authored-By: Ralph Campbell <ralph.campbell@broadcom.com>
10 years ago
Ashwin Sekhar T K
fb4be3b3eb
Optimized rot kernels for CORTEXA57
Co-Authored-By: Ralph Campbell <ralph.campbell@broadcom.com>
10 years ago
Ashwin Sekhar T K
6c2f4ddbcd
Optimized nrm2 kernels for CORTEXA57
10 years ago
Ashwin Sekhar T K
870c4d49c0
Optimized dot kernels for CORTEXA57
Co-Authored-By: Ralph Campbell <ralph.campbell@broadcom.com>
10 years ago
Ashwin Sekhar T K
cd7684097c
Optimized copy kernels for CORTEXA57
Co-Authored-By: Ralph Campbell <ralph.campbell@broadcom.com>
10 years ago
Ashwin Sekhar T K
2690b71b1f
Optimized axpy kernels for CORTEXA57
Co-Authored-By: Ralph Campbell <ralph.campbell@broadcom.com>
10 years ago
Ashwin Sekhar T K
3e4acedf0e
Optimized asum kernels for CORTEXA57
Co-Authored-By: Ralph Campbell <ralph.campbell@broadcom.com>
10 years ago
Ashwin Sekhar T K
2610752dbb
Optimized iamax kernels for CORTEXA57
Co-Authored-By: Ralph Campbell <ralph.campbell@broadcom.com>
10 years ago
Ashwin Sekhar T K
dbb213655e
Optimized amax kernels for CORTEXA57
Co-Authored-By: Ralph Campbell <ralph.campbell@broadcom.com>
10 years ago
Ashwin Sekhar T K
f2f8a0fe8b
Adding arm64 target CORTEXA57
Co-Authored-By: Ralph Campbell <ralph.campbell@broadcom.com>
10 years ago
Ralph Campbell
c053559ed9
Minor C code fixes in kernel/arm
10 years ago
Ralph Campbell
55e4332f00
Remove duplicate -D args in kernel/Makefile.L1
10 years ago
Zhang Xianyi
69363622a8
Fix DYNAMIC_ARCH=1 bug.
10 years ago
Zhang Xianyi
53b6023a6c
Fix cmake bug on MSVC 32-bit.
10 years ago
Zhang Xianyi
309875de3c
Fix cmake bug on x86 32-bit.
e.g. Build 32-bit on 64-bit Linux.
cmake -DBINARY=32
10 years ago
Zhang Xianyi
8fade093aa
Fixed cmake bug on Visual Studio.
10 years ago
Zhang Xianyi
96f0bbe067
Fixed cmake bug on haswell.
10 years ago
Zhang Xianyi
d8392c1245
Fixe cmake config bugs.
10 years ago
Zhang Xianyi
94b125255f
Merge branch 'develop' into cmake
Conflicts:
driver/others/memory.c
10 years ago
Martin Koehler
711ca33bc6
Improved Ximatcopy when lda==ldb.
The Ximatcopy functions create a copy of the input matrix
although they seem to work inplace. The new routines
XIMATCOPY_K_YY perform the operations inplace if the leading
dimension does not change.
10 years ago
Zhang Xianyi
7df0820160
Use C kernels for s/dgemv on x86.
10 years ago
Zhang Xianyi
f874465bb8
Use cmake to build OpenBLAS GENERIC Target on MSVC x86 64-bit.
Disable CBLAS and LAPACK.
10 years ago