Ashwin Sekhar T K
1d121852c1
Fix blas_lock for arm64
10 years ago
Ashwin Sekhar T K
39937d15cd
Change BUFFER_SIZE for Cortex A57 to 20 MB
Change the GEMM_P, GEMM_Q, GEMM_R values for Cortex A57
10 years ago
Zhang Xianyi
233ec2a1cc
Use 40 MB buffer for ARM Cortex A57.
10 years ago
Ashwin Sekhar T K
f2f8a0fe8b
Adding arm64 target CORTEXA57
Co-Authored-By: Ralph Campbell <ralph.campbell@broadcom.com>
10 years ago
Grazvydas Ignotas
abade3f896
really fix ARM64 locking
10 years ago
Grazvydas Ignotas
6b92204a7c
add fallback blas_lock implementation
to be used on armv5 and new platforms
10 years ago
Grazvydas Ignotas
e12cf1123e
add fallback rpcc implementation
- use on arm, arm64 and any new platform
- use faster integer math instead of double
- use similar scale as rdtsc so that timeouts work
10 years ago
Zhang Xianyi
3f1b57668e
Fix blas lock bug on AArch64.
10 years ago
Werner Saar
19b8fd2aed
smp lock bugfix
10 years ago
Zhang Xianyi
2fb02626da
Update organization info.
11 years ago
Benedikt Huber
58c90d5937
# The first commit's message is:
Optimizations for APM's xgene-1 (aarch64).
1) general system updates to support armv8 better. Make all did not work, one needed to supply TARGET=ARMV8.
2) sgem 4x4 kernel in assembler using SIMD, and configuration changes to use it.
3) strmm 4x4 kernel in C. Since the sgem kernel does 4x4, the trmm kernel must also do 4xN.
Added Dave Nuechterlein to the contributors list.
11 years ago
Timothy Gu
6c2ead30f0
Remove all trailing whitespace except lapack-netlib
Signed-off-by: Timothy Gu <timothygu99@gmail.com>
11 years ago
wernsaar
fe5f46c330
added experimental support for ARMV8
12 years ago