Zhang Xianyi
f5df444ceb
Merge pull request #762 from jeromerobert/bug760
Let openblas_get_num_threads return the number of active threads
9 years ago
Zhang Xianyi
aaa8551c57
Merge pull request #749 from lotheac/illumos_fixes
illumos fixes
9 years ago
Jerome Robert
0d87c1ffb6
Let openblas_get_num_threads return the number of active threads
... not the number of allocated threads.
Close #760
9 years ago
Lauri Tirkkonen
e737e32fd1
RLIMIT_NPROC doesn't exist on illumos
9 years ago
Lauri Tirkkonen
97cd4b8aee
illumos fixes to memory.c
9 years ago
Werner Saar
b07d733a71
added updates for syrk and syr2k
9 years ago
Zhang Xianyi
055b481386
Fixed CMake bug for single core.
9 years ago
Werner Saar
0d22551a6b
increase the stack size limit in the constructor
10 years ago
Ralph Campbell
fbc21266e6
Minor C code fixes in driver/
10 years ago
Zhang Xianyi
839395fc25
Detect AMD Trinity and Richland.
10 years ago
j-bo
6040858b22
Fix #673
Add lacking headers declarations when compiling for Android ARM7
10 years ago
Zhang Xianyi
70642fe4ed
Refs #668 . Raise the signal when pthread_create fails.
Thank James K. Lowden for the patch.
10 years ago
Zhang Xianyi
2feef49fa8
Merge branch 'develop' into cmake
Conflicts:
driver/others/memory.c
10 years ago
Zhang Xianyi
1ce054fcb3
Refs #669 . Fixed the build bug with gcc on Mac OS X.
10 years ago
Zhang Xianyi
d8392c1245
Fixe cmake config bugs.
10 years ago
Zhang Xianyi
94b125255f
Merge branch 'develop' into cmake
Conflicts:
driver/others/memory.c
10 years ago
Zhang Xianyi
11ac4665c8
Fixed #654 . Make sure the gotoblas_init function is run before all other static initializations.
10 years ago
Zhang Xianyi
cc7cab8a45
Detect other Intel Skylake cores.
http://users.atw.hu/instlatx64/
10 years ago
Yichao Yu
61ae47eb99
Ref #632 . Support Intel Skylake by Haswell kernels.
10 years ago
Grazvydas Ignotas
d3e2f0a1af
add missing barriers
should fix issue #597
10 years ago
Zhang Xianyi
f8eba3d548
Fixed cmake build bugs on Linux.
10 years ago
Zhang Xianyi
f874465bb8
Use cmake to build OpenBLAS GENERIC Target on MSVC x86 64-bit.
Disable CBLAS and LAPACK.
10 years ago
Zhang Xianyi
dcd5ba4443
Merge branch 'cmake' of https://github.com/hpanderson/OpenBLAS into hpanderson_cmake
10 years ago
Zhang Xianyi
a11555c715
Support Android NDK armeabi-v7a-hard ABI. (-mfloat-abi=hard)
e.g.
make HOSTCC=gcc CC=arm-linux-androideabi-gcc NO_LAPACK=1 TARGET=ARMV7
In Android NDK, it uses armeabi-v7a-hard ABI.
TARGET_CFLAGS += -mhard-float -D_NDK_MATH_NO_SOFTFP=1
TARGET_LDFLAGS += -Wl,--no-warn-mismatch -lm_hard
For more information, please check hard-float example at
android_ndk/tests/device/hard-float/jni/.
10 years ago
Zhang Xianyi
51ff17d46e
Add AMD Excavator target.
10 years ago
powderluv
ebb9eba987
Fix build with ALLOC_SHM=0 (Android NDK)
Refactor such that you can build with ALLOC_SHM=0. HughTLB
implicity depends on ALLOC_SHM=1. This patch allows
building for Android NDK r10d.
10 years ago
Zhang Xianyi
8e5a1083bb
Refs #532 . Improve gemv paralel with small m and large n case.
Splite the matrix and reduction.
10 years ago
Zhang Xianyi
9798481979
Refs #478 , #482 . Fix segfault bug for gemv_t with MAX_ALLOC_STACK flag.
For gemv_t, directly use malloc to create the buffer.
10 years ago
Zhang Xianyi
8977b3f235
Refs #529 . Support Intel Broadwell by Haswell kernels.
10 years ago
Zhang Xianyi
e95d64333a
Refs #519 . Avoid calling strncpy.
10 years ago
Ton van den Heuvel
b6438dedea
Fix issue #508
Fix race condition during shutdown causing a crash in
gotoblas_set_affinity().
10 years ago
Hank Anderson
5ae8993752
Added intrinsics for MSVC.
10 years ago
Hank Anderson
84d90d6ed8
Fixed some compiler errors/warnings for clang.
10 years ago
Hank Anderson
9eaea02f33
Added additional gemm defines for complex types.
10 years ago
Hank Anderson
ab7043373f
Fixed bug generating trmv complex source names.
10 years ago
Hank Anderson
0553476fba
Added TRANS defines for complex sources in lapack.
10 years ago
Hank Anderson
2416d9dbac
Fixed TRANSA defines for complex sources in driver/level2.
10 years ago
Hank Anderson
0d8e227ea7
Changed strategy for setting preprocessor definitions.
Instead of generating separate object files for each permutation of
defines for a source file, GenerateNamedObjects now writes an entirely
new source file and inserts the defines as #define c statements.
This solves a problem I ran into with ar.exe where it was refusing to
link objects that had the same filename despite having different paths.
10 years ago
Hank Anderson
1b7f427401
Added conj gemv objects for complex build.
10 years ago
Hank Anderson
fb5d5bb971
Added defines for complex trmv.
10 years ago
Hank Anderson
371071d461
Added CONJ defines for trmm/trsm.
10 years ago
Hank Anderson
8a143516e3
Added alternate_name to a couple of the name mangling schemes.
Added zherk_k sources to driver/level3.
10 years ago
Hank Anderson
e5897ecb9b
Added zherk_kernel.c objects to driver/level3.
10 years ago
Hank Anderson
33c5e8db7f
Added a helper function for setting the L1 kernel defaults.
Added loop to build objects with different KERNEL defines.
10 years ago
Hank Anderson
4662a0b13a
Changed generate functions to iterate through a list of float types.
This will generate obj files for SINGLE/DOUBLE/COMPLEX/DOUBLE COMPLEX.
10 years ago
Hank Anderson
e74462a3f5
Moved declarations to start of functions to satisfy MSVC C89 implementation.
10 years ago
Hank Anderson
056ba26755
Changed a number of inline calls to use __inline.
MSVC doesn't inmplement C99, so can't use the inline keyword. __inline
appears to work in MSVC and GCC.
10 years ago
Hank Anderson
c94fe71278
Removed incoming-stack-boundary for MSVC.
Made float type optional for GenerateNamedObjects.
Called GenerateNamedObjects for a couple of driver/others files that
needed NAME/CNAME set.
10 years ago
Hank Anderson
e8c39138c6
Removed return value from GenerateNamedObjects.
It sets DBLAS_OBJS directly to save a bunch of list appending in the
CMakeLists.txt files.
10 years ago
Hank Anderson
7fa5c4e2fd
Fixed some case issues with ARCH.
Added some kernel and driver/others objects.
10 years ago