Zhang Xianyi
034ffa93fa
Provide iaxpy and cblas_iaxpy for integer vectors. make INTEGER_PRECISION=1
10 years ago
Zhang Xianyi
a11555c715
Support Android NDK armeabi-v7a-hard ABI. (-mfloat-abi=hard)
e.g.
make HOSTCC=gcc CC=arm-linux-androideabi-gcc NO_LAPACK=1 TARGET=ARMV7
In Android NDK, it uses armeabi-v7a-hard ABI.
TARGET_CFLAGS += -mhard-float -D_NDK_MATH_NO_SOFTFP=1
TARGET_LDFLAGS += -Wl,--no-warn-mismatch -lm_hard
For more information, please check hard-float example at
android_ndk/tests/device/hard-float/jni/.
10 years ago
Zhang Xianyi
51ff17d46e
Add AMD Excavator target.
10 years ago
powderluv
ebb9eba987
Fix build with ALLOC_SHM=0 (Android NDK)
Refactor such that you can build with ALLOC_SHM=0. HughTLB
implicity depends on ALLOC_SHM=1. This patch allows
building for Android NDK r10d.
10 years ago
Zhang Xianyi
8e5a1083bb
Refs #532 . Improve gemv paralel with small m and large n case.
Splite the matrix and reduction.
10 years ago
Zhang Xianyi
9798481979
Refs #478 , #482 . Fix segfault bug for gemv_t with MAX_ALLOC_STACK flag.
For gemv_t, directly use malloc to create the buffer.
10 years ago
Zhang Xianyi
8977b3f235
Refs #529 . Support Intel Broadwell by Haswell kernels.
10 years ago
Zhang Xianyi
e95d64333a
Refs #519 . Avoid calling strncpy.
10 years ago
Ton van den Heuvel
b6438dedea
Fix issue #508
Fix race condition during shutdown causing a crash in
gotoblas_set_affinity().
10 years ago
Zhang Xianyi
cfa9392ffa
Fix openblas_get_num_threads and openblas_get_num_procs bug with single thread.
10 years ago
Erik Schnetter
65a847cd36
Introduce openblas_get_num_threads and openblas_get_num_procs
10 years ago
Werner Saar
0dc559ed30
bugfix in dynamic.c
10 years ago
Werner Saar
4319769b79
added target processor STEAMROLLER
10 years ago
Zhang Xianyi
2fb02626da
Update organization info.
11 years ago
Zhang Xianyi
695e0fa649
#463 fixed a compiling bug on AIX.
11 years ago
wernsaar
7aae4a62e7
enabled use of GEMM3M functions
11 years ago
wernsaar
a64fe9bcc9
added optimized sgemv_n kernel for sandybridge
11 years ago
wernsaar
2021d0f9d6
experimentally removed expensive function calls
11 years ago
Isaac Dunham
f7eb81a846
Fix link error on Linux/musl.
get_nprocs() is a GNU convenience function equivalent to POSIX2008
sysconf(_SC_NPROCESSORS_ONLN); the latter should be available in unistd.h
on any current *nix. (OS X supports this call since 10.5, and FreeBSD
currently supports it. But this commit does not change FreeBSD or OS X
versions.)
11 years ago
wernsaar
793175be3a
added experimental support for big numa machines
11 years ago
wernsaar
1d33547222
optimized zgemm kernel for haswell
11 years ago
wernsaar
3ea4dadd30
optimizations for trsm
11 years ago
wernsaar
1b10ff129a
optimizations for trmm
11 years ago
wernsaar
125610d23b
allow to set custom value for ?GEMM_DEFAULT_UNROLL_MN, optimizations for syrk
11 years ago
Zhang Xianyi
c94762bb56
Refs #401 . Added NO_AVX2 flag for old binutils (e.g. RHEL6)
11 years ago
Zhang Xianyi
552119c484
Fixed #407 . Support outputing the CPU corename on runtime.
The user can use char * openblas_get_config() or char * openblas_get_corename().
11 years ago
wernsaar
50e99a52ea
added definitions for PILEDRIVER and HASWELL
11 years ago
wernsaar
be94db096c
disabled *3M functions for x86_64 platforms
11 years ago
Zhang Xianyi
7a8949e0ce
Merge branch 'develop' of https://github.com/TimothyGu/OpenBLAS into TimothyGu-develop
Conflicts:
driver/others/memory.c
11 years ago
Timothy Gu
6c2ead30f0
Remove all trailing whitespace except lapack-netlib
Signed-off-by: Timothy Gu <timothygu99@gmail.com>
11 years ago
Jameson Nash
f41f03ab83
fix #394 . this cleans up some handles after using them, and doesn't disable ALL process privileges upon success
11 years ago
wernsaar
438002204d
Ref #393 : fix for INTERFACE64=0 and ARCH_X86 in divtable
11 years ago
wernsaar
53bfa51ee0
Ref #385 : fixed warnings in dynamic.c
11 years ago
wernsaar
a86d349a51
Ref #380 : enhancements for dynamic_arch
11 years ago
wernsaar
a35a1a9ae7
changed makefiles for lapack development
11 years ago
Olivier Grisel
2c556f093a
Add cast to function pointer to remove warning
11 years ago
Olivier Grisel
3b027d2528
Do not reference pthread_atfork in non-SMP_SERVER mode
11 years ago
Olivier Grisel
49bd98f410
Do not reference pthread_atfork under windows
11 years ago
Olivier Grisel
138a841390
FIX #294 : make OpenBLAS thread-pool resilient to fork via pthread_atfork
11 years ago
Olivier Grisel
046e4013cb
Revert "Refs #294 . Used pthread_atfork to avoid hang after a Unix fork."
This reverts commit 3617c22a56
.
11 years ago
Zhang Xianyi
3617c22a56
Refs #294 . Used pthread_atfork to avoid hang after a Unix fork.
The problem is the mutex we used in blas_server. Thus, we must clear
the mutex before the fork and re-init them at parent and child process.
If you used OpenMP, GOMP has the same problem by now. Please try other OpenMP
implemantation.
11 years ago
Zhang Xianyi
8c7687b419
Refs #338 . Added OPENBLAS_VERBOSE environment variable on runtime
By default, OpenBLAS doesn't output the warning message. You can set
OPENBLAS_VERBOSE (e.g. export OPENBLAS_VERBOSE=1) to enable the warning
message on runtime.
11 years ago
Zhang Xianyi
ab69443bd4
Refs #332 . Added addtional Intel Ivy Bridge and Haswell CPU-id.
11 years ago
Zhang Xianyi
b263e096af
Refs #307 . Delete debug printf.
11 years ago
wernsaar
8191efc420
changed level3.c
12 years ago
wernsaar
0b6e13b689
Merge remote branch 'origin/develop' into haswell
12 years ago
wernsaar
5c648a8984
Merge remote branch 'origin/develop' into haswell
12 years ago
Zhang Xianyi
5048a80032
Refs #283 . Fixed the incorrect usage of long data type for Windows 64.
12 years ago
Zhang Xianyi
a2942456ef
Refs #307 . Fixed the hang bug when free OpenBLAS dll in Windows.
12 years ago
wernsaar
a77c71eaf5
added highly optimized dgemm_kernel for HASWELL
12 years ago