Grazvydas Ignotas
|
6b92204a7c
|
add fallback blas_lock implementation
to be used on armv5 and new platforms
|
10 years ago |
Grazvydas Ignotas
|
e12cf1123e
|
add fallback rpcc implementation
- use on arm, arm64 and any new platform
- use faster integer math instead of double
- use similar scale as rdtsc so that timeouts work
|
10 years ago |
Zhang Xianyi
|
2fb02626da
|
Update organization info.
|
11 years ago |
Timothy Gu
|
6c2ead30f0
|
Remove all trailing whitespace except lapack-netlib
Signed-off-by: Timothy Gu <timothygu99@gmail.com>
|
11 years ago |
Zhang Xianyi
|
16eb780e13
|
Refs #262. Fixed compatibility issues of GNU stack markings with PathScale EKOPath(tm) Compiler Suite: Version 4.0.12.1
|
12 years ago |
Zhang Xianyi
|
a2930664f4
|
Refs #262. Added executable stack markings.
|
12 years ago |
Xianyi Zhang
|
6958c1a1aa
|
Fixed the SEGFAULT bug with Loongcc and Loongson3.
|
12 years ago |
Wang Qian
|
c2dad58ad1
|
Adding n32 multiple threads condition.
|
14 years ago |
Xianyi Zhang
|
285e69e2d1
|
Disable using simple thread level3 to fix a bug on Loongson 3B.
|
14 years ago |
Xianyi Zhang
|
d1baf14a64
|
Enable thread affinity on Loongson 3B. Fixed the bug of reading cycle counter.
In Loongson 3A and 3B, the CPU core increases the counter in every 2 cycles by default.
|
14 years ago |
Xianyi Zhang
|
b95ad4cfaf
|
Support detecting ICT Loongson-3B CPU.
|
14 years ago |
traz
|
7fa3d23dd9
|
Complete cgemm function, but no optimization.
|
14 years ago |
traz
|
cb0214787b
|
Modify compile options.
|
14 years ago |
Xianyi Zhang
|
921e040b15
|
Changed default page size to 16KB on Loongson 3A.
|
14 years ago |
Xianyi Zhang
|
c0b5992fab
|
added axpy kernel with prefetch for Loongson3A. To-Do: tuning prefetch distance & instruction order.
|
14 years ago |
Xianyi Zhang
|
376677452f
|
Modified the unsupported instruction on Loongson3A. Closed #1 OpenBLAS could run on Loongson3A now.
|
14 years ago |
Xianyi Zhang
|
342bbc3871
|
Import GotoBLAS2 1.13 BSD version codes.
|
14 years ago |