Martin Kroeker
48f5a89f92
Merge pull request #2282 from martin-frbg/issue2281
Optimize RPCC function on ARM64
6 years ago
Martin Kroeker
b687fba5bc
Disable direct clock register access on IOS and Android
as I find conflicting information on accessibility from non-priviledged processes
6 years ago
Martin Kroeker
5f6206fa2d
Simplify OSX/IOS cross-compilation and add a CI test for it ( #2279 )
* Add automatic fixups for OSX/IOS cross-compilation
* Add OSX/IOS cross-compilation test to Travis CI
* Handle platforms that lack hwcap.h by falling back to ARMV8
* Fix PROLOGUE for OSX/IOS
6 years ago
Martin Kroeker
f2cde2ccfb
Update common_arm64.h
6 years ago
Martin Kroeker
bb5413863f
Rewrite ARM64 PROLOGUE to make it compatible with xcode/ios
6 years ago
Paul Osmialowski
42bbe74791
build: LLVM: Add Flang compiler support and enable OpenMP for Clang
Signed-off-by: Paul Osmialowski <pawel.osmialowski@arm.com>
8 years ago
Ashwin Sekhar T K
1d121852c1
Fix blas_lock for arm64
10 years ago
Ashwin Sekhar T K
39937d15cd
Change BUFFER_SIZE for Cortex A57 to 20 MB
Change the GEMM_P, GEMM_Q, GEMM_R values for Cortex A57
10 years ago
Zhang Xianyi
233ec2a1cc
Use 40 MB buffer for ARM Cortex A57.
10 years ago
Ashwin Sekhar T K
f2f8a0fe8b
Adding arm64 target CORTEXA57
Co-Authored-By: Ralph Campbell <ralph.campbell@broadcom.com>
10 years ago
Grazvydas Ignotas
abade3f896
really fix ARM64 locking
10 years ago
Grazvydas Ignotas
6b92204a7c
add fallback blas_lock implementation
to be used on armv5 and new platforms
10 years ago
Grazvydas Ignotas
e12cf1123e
add fallback rpcc implementation
- use on arm, arm64 and any new platform
- use faster integer math instead of double
- use similar scale as rdtsc so that timeouts work
10 years ago
Zhang Xianyi
3f1b57668e
Fix blas lock bug on AArch64.
10 years ago
Werner Saar
19b8fd2aed
smp lock bugfix
10 years ago
Zhang Xianyi
2fb02626da
Update organization info.
11 years ago
Benedikt Huber
58c90d5937
# The first commit's message is:
Optimizations for APM's xgene-1 (aarch64).
1) general system updates to support armv8 better. Make all did not work, one needed to supply TARGET=ARMV8.
2) sgem 4x4 kernel in assembler using SIMD, and configuration changes to use it.
3) strmm 4x4 kernel in C. Since the sgem kernel does 4x4, the trmm kernel must also do 4xN.
Added Dave Nuechterlein to the contributors list.
11 years ago
Timothy Gu
6c2ead30f0
Remove all trailing whitespace except lapack-netlib
Signed-off-by: Timothy Gu <timothygu99@gmail.com>
11 years ago
wernsaar
fe5f46c330
added experimental support for ARMV8
12 years ago