Martin Kroeker
0256294921
Fix syntax mixup
4 years ago
Martin Kroeker
60e1fddca7
Ensure that the same (large) BUFFERSIZE is used for all cpus in DYNAMIC_ARCH builds
4 years ago
Martin Kroeker
1d4c96fa0c
Increase BUFFERSIZE further
5 years ago
Martin Kroeker
ee90f30384
Increase BUFFERSIZE for POWER8-10 and use same value for POWER6
to fix overflow warning for PWR8 ZGEMM and PWR9 C/ZGEMM and avoid size mismatches in DYNAMIC_ARCH
5 years ago
Martin Kroeker
c9d32674ea
Add memory barrier to the blas_lock implementation for Linux
as recommended by cparrott73 in #2760
5 years ago
Rajalakshmi Srinivasaraghavan
9fe930f205
powerpc: Add support for future processor
This is the initial patch to support build infrastructure
for POWER10 architecture.
5 years ago
Martin Kroeker
3d4db4d002
Add read barrier definition
5 years ago
Martin Kroeker
1a6ea8ee6d
Merge pull request #2338 from kavanabhat/aix_mod
Changes to build on AIX in POWER8 mode
5 years ago
Kavana Bhat
6baa9b07d7
AIX changes for Power8
5 years ago
Martin Kroeker
6fa89b06a1
Use the two-operand form of DCBT on all PPC970 regardless of OS
There seems to be no advantage to the three-operand form used in the earliest GotoBLAS kernels, and it causes compilation problems on other than the previously special-cased platforms as well
5 years ago
Kavana Bhat
3dc6b26eff
AIX changes for Power8
6 years ago
pkubaj
5a4f1a2118
Fix build for PPC970 on FreeBSD pt. 1
FreeBSD needs DCBT_ARG=0 as well.
6 years ago
Piotr Kubaj
eebfeba768
Fix build on FreeBSD/powerpc64.
Signed-off-by: Piotr Kubaj <pkubaj@anongoth.pl>
6 years ago
Martin Kroeker
7c51cc8527
Merge branch 'develop' into develop
6 years ago
AbdelRauf
853a18bc17
power9 makefile. dgemm based on power8 kernel with following changes : 32x unrolled 16x4 kernel and 8x4 kernel using (lxv stxv butterfly rank1 update). improvement from 17 to 22-23gflops. dtrmm cases were added into dgemm itself
6 years ago
Ayappan P
b043a5962e
AIX asm syntax changes needed for shared object creation
6 years ago
ken-cunningham-webuse
f7a06463d9
common_power.h: force DCBT_ARG 0 on PPC970 Darwin
without this, we see
../kernel/power/gemv_n.S:427:Parameter syntax error
and many more similar entries
that relates to this assembly command
dcbt 8, r24, r18
this change makes the DCBT_ARG = 0
and openblas builds through to completion on PowerMac 970
Tests pass
6 years ago
Werner Saar
8310d4d3f7
optimized dgemm for 20 threads
9 years ago
Werner Saar
9276c9012f
Optimized sgemm and dgemm and tested again.
9 years ago
Werner Saar
9c42f0374a
Updated cgemm- and sgemm-kernel for POWER8 SMP
9 years ago
Werner Saar
cc26d888b8
BUGFIX: increased BUFFER_SIZE for POWER8
9 years ago
Werner Saar
b752858d6c
added dgemm-, dtrmm-, zgemm- and ztrmm-kernel for power8
9 years ago
Grazvydas Ignotas
6b92204a7c
add fallback blas_lock implementation
to be used on armv5 and new platforms
10 years ago
Grazvydas Ignotas
e12cf1123e
add fallback rpcc implementation
- use on arm, arm64 and any new platform
- use faster integer math instead of double
- use similar scale as rdtsc so that timeouts work
10 years ago
Matthew Brandyberry
7ba4fe5afb
ppc64le platform support (ELF ABI v2)
10 years ago
Timothy Gu
6c2ead30f0
Remove all trailing whitespace except lapack-netlib
Signed-off-by: Timothy Gu <timothygu99@gmail.com>
11 years ago
Xianyi Zhang
342bbc3871
Import GotoBLAS2 1.13 BSD version codes.
14 years ago