Zhang Xianyi
f27942a68a
Fixed make TARGET=CORTEXA9 and CORTEXA15 bug.
10 years ago
Zhang Xianyi
0cc2b3de0b
Merge pull request #652 from larsmans/fixes
Tiny fixes
10 years ago
Lars Buitinck
b9534bbd76
git ignore versioned .so files
10 years ago
Lars Buitinck
45c8b5e756
actually remove cblas_noconst.h
This file hasn't been used since 212463dce9
.
10 years ago
Zhang Xianyi
a96a4cb012
Merge pull request #640 from kortschak/dlansy-fix
Fix LAPACK_*lansy routines
10 years ago
Zhang Xianyi
baec8f5cac
Refs #638 . Fixed compiling bug with clang on Mac OS X.
10 years ago
kortschak
d6e8459f20
Fix LAPACK_*lansy routines
Fixes #639 .
10 years ago
Zhang Xianyi
dfe1eef33b
Merge branch 'yuyichao-skylake-id' into develop
10 years ago
Zhang Xianyi
cc7cab8a45
Detect other Intel Skylake cores.
http://users.atw.hu/instlatx64/
10 years ago
Yichao Yu
61ae47eb99
Ref #632 . Support Intel Skylake by Haswell kernels.
10 years ago
Zhang Xianyi
22353b1727
Merge pull request #634 from kortschak/lantr-trans-prep
Fix lantr preparation for row major matrices
10 years ago
kortschak
efffd28739
Fix lantr preparation for row major matrices
10 years ago
Zhang Xianyi
62cabef857
Merge pull request #633 from grisuthedragon/tune_imatcopy
Improved Ximatcopy when lda==ldb.
10 years ago
Martin Koehler
711ca33bc6
Improved Ximatcopy when lda==ldb.
The Ximatcopy functions create a copy of the input matrix
although they seem to work inplace. The new routines
XIMATCOPY_K_YY perform the operations inplace if the leading
dimension does not change.
10 years ago
Zhang Xianyi
40a3fed6b8
Merge pull request #630 from buffer51/develop
Fixed error in common.h for Android compilation introduced by e12cf11
10 years ago
buffer51
2297a2d989
Fixed error in common.h for Android compilation introduced by e12cf1123e
10 years ago
Zhang Xianyi
5408074941
Add notification.
10 years ago
Zhang Xianyi
bbcdf63bb4
Merge branch 'develop' of github.com:xianyi/OpenBLAS into develop
10 years ago
Zhang Xianyi
43eabab62f
Merge pull request #619 from gitter-badger/gitter-badge
Add a Gitter chat badge to README.md
10 years ago
The Gitter Badger
50901943fd
Added Gitter badge
10 years ago
Zhang Xianyi
7df0820160
Use C kernels for s/dgemv on x86.
10 years ago
Zhang Xianyi
4b7381b7a4
Merge pull request #617 from notaz/arm_fixes
really fix ARM64 locking
10 years ago
Grazvydas Ignotas
abade3f896
really fix ARM64 locking
10 years ago
Zhang Xianyi
d1349e7a11
Merge pull request #616 from notaz/arm_fixes
ARM fixes
10 years ago
Grazvydas Ignotas
3efeaed0d8
correct a minor mistake
10 years ago
Grazvydas Ignotas
d38a1ddc7a
use real armv5 support
there is no more requirement for ARMv6 instructions,
and VFP on ARMv5 is uncommon
10 years ago
Grazvydas Ignotas
6b92204a7c
add fallback blas_lock implementation
to be used on armv5 and new platforms
10 years ago
Grazvydas Ignotas
f2ac1a5cee
set ARMV7 for Cortex-A9 and Cortex-A15
otherwise some macros like YIELDING are not defined correctly
10 years ago
Grazvydas Ignotas
e12cf1123e
add fallback rpcc implementation
- use on arm, arm64 and any new platform
- use faster integer math instead of double
- use similar scale as rdtsc so that timeouts work
10 years ago
Grazvydas Ignotas
d3e2f0a1af
add missing barriers
should fix issue #597
10 years ago
Grazvydas Ignotas
c2323dd4d2
really fix ARM locking
- was writing 0 to lock variable, so was ineffective
- only exit loop if both lock was 0 and strex was successful
10 years ago
Zhang Xianyi
bb6e050509
Merge pull request #614 from xantares/cmake_version
install OpenBLASConfigVersion.cmake
10 years ago
xantares
87336b9acf
install OpenBLASConfigVersion.cmake
10 years ago
Zhang Xianyi
c50661e5b7
Merge pull request #613 from fabioperez/develop
Add POWER7/POWER8 as targets
10 years ago
Fábio Perez
b8d64a856a
Add POWER7/POWER8 as targets
10 years ago
Zhang Xianyi
898fc7552a
Merge pull request #612 from ibmsoe/ppc64le
ppc64le platform support (ELF ABI v2)
10 years ago
Zhang Xianyi
1cf2b10224
Use pure C generic target on x86 and x86_64.
make TARGET=GENERIC
?gemm3m is unimplemented on generic target.
10 years ago
Matthew Brandyberry
7ba4fe5afb
ppc64le platform support (ELF ABI v2)
10 years ago
Zhang Xianyi
3f1b57668e
Fix blas lock bug on AArch64.
10 years ago
Zhang Xianyi
d8f18d32c3
Merge pull request #595 from tanderson92/fixTests
Fix test execution when USE_OPENMP=0
10 years ago
wernsaar
bdb5c842fc
Merge pull request #596 from wernsaar/develop
optimizations for haswell
10 years ago
Werner Saar
e7c969e164
added optimized dtrmm_kernel for haswell
10 years ago
Werner Saar
9bd962f655
modified haswell parameter dgemm_unroll_n
10 years ago
Thomas Anderson
4f5691e5c0
Fix test execution when USE_OPENMP=0
The standard way to disable OpenMP support is to set USE_OPENMP=0,
as indicated by other checks to see if USE_OPENMP equals 1. The
problem is obviously then that `ifdef USE_OPENMP` is very much not
what we want to test for. This causes tests to fail when no OpenMP
library is installed.
10 years ago
Zhang Xianyi
29293160a4
Fix #593 . Change MACOSX_DEPLOYMENT_TARGET to 10.6.
10 years ago
wernsaar
3e33afef2e
Merge pull request #592 from wernsaar/develop
added benchmark scripts
10 years ago
Werner Saar
8614057ea9
added benchmark scripts for numpy, octave and R
10 years ago
Werner Saar
7f375f9e8f
updated geev benchmark
10 years ago
wernsaar
69c5169e7d
Merge pull request #589 from wernsaar/develop
small modification of gemm.c
10 years ago
Werner Saar
e19948baa1
small modification of gemm.c
10 years ago