Hank Anderson
6b5d26e07b
Added SMP sources to level2 CMakeLists.txt.
10 years ago
Hank Anderson
627d5e7401
Added SMP objects to driver/level3.
10 years ago
Hank Anderson
943fa2fb58
Fixed object names in level2.
10 years ago
Hank Anderson
461e691127
Codes when define is absent are now a parameter to AllCombinations.
The level3 object names should now be correct.
10 years ago
Hank Anderson
cfaf1c678f
Added option to append define codes with an underscore.
Fixed the code array not getting reset on subsequent AllCombinations
calls.
10 years ago
Hank Anderson
0d7bad1f35
Changed GenerateObjects to append combination codes (e.g. dtrmm_TU).
10 years ago
Hank Anderson
2828f6630c
Added SMP sources to COMMONOBJS.
10 years ago
Erik Schnetter
65a847cd36
Introduce openblas_get_num_threads and openblas_get_num_procs
10 years ago
Hank Anderson
7194424fef
Added missing common objects to the library.
10 years ago
Hank Anderson
d11bde60d0
DOUBLE define for DBLAS objects is now set in main CMakeLists.txt.
Since the objects are the same, could generate SINGLE/COMPLEX/etc here
without having to rewrite all the object enumeration code again.
10 years ago
Hank Anderson
5057a4b4df
Added openblas add_library call that uses DBLAS_OBJS ojbects.
10 years ago
Hank Anderson
3e8ea7a351
Added COMMONOBJS to driver/others CMakeLists.txt.
10 years ago
Hank Anderson
d3dcdddf75
Moved functions into util cmake file.
10 years ago
Hank Anderson
e5e7595bf9
Added paramater to GenerateObjects for defines that affect all sources.
10 years ago
Hank Anderson
7693887d61
Added empty set to the combinations generated by AllCombinations.
10 years ago
Hank Anderson
8d9b196e0d
Moved loop over define combos into a function.
This function takes a set of sources and a set of preprocessor
definitions. It will iterate over the sources and build an object
file for each combination of preprocessor definitions for each
source file.
10 years ago
Hank Anderson
a6cf8aafc0
Updated level3/CMakeLists with correct defines using all combos.
10 years ago
Hank Anderson
dbdca7bf0c
Added first pass at driver/level3 Makefile conversion.
Added a rather convoluted CMake function to find all combinations
of a given list. This will be useful for the object files that are
compiled multiple times with different combinations of preprocessor
definitions.
10 years ago
Hank Anderson
8c23965da3
prebuild.cmake now reads the output from getarch into CMake vars.
10 years ago
Hank Anderson
8ede4a8da4
getarch now compiles and sets config.h defines properly.
Still isn't parsed into CMake variables, and getarch_2 needs to
get the same treatment.
10 years ago
Hank Anderson
1c5b6bb4f7
Added CORE define to config.h in prebuild.cmake (temporarily).
10 years ago
Hank Anderson
9a508abdc7
Added first pass at driver/level2 makefile conversion.
10 years ago
Werner Saar
0dc559ed30
bugfix in dynamic.c
11 years ago
Werner Saar
4319769b79
added target processor STEAMROLLER
11 years ago
Zhang Xianyi
2fb02626da
Update organization info.
11 years ago
Zhang Xianyi
695e0fa649
#463 fixed a compiling bug on AIX.
11 years ago
wernsaar
7aae4a62e7
enabled use of GEMM3M functions
11 years ago
wernsaar
a64fe9bcc9
added optimized sgemv_n kernel for sandybridge
11 years ago
wernsaar
2021d0f9d6
experimentally removed expensive function calls
11 years ago
Isaac Dunham
f7eb81a846
Fix link error on Linux/musl.
get_nprocs() is a GNU convenience function equivalent to POSIX2008
sysconf(_SC_NPROCESSORS_ONLN); the latter should be available in unistd.h
on any current *nix. (OS X supports this call since 10.5, and FreeBSD
currently supports it. But this commit does not change FreeBSD or OS X
versions.)
11 years ago
wernsaar
793175be3a
added experimental support for big numa machines
11 years ago
wernsaar
1d33547222
optimized zgemm kernel for haswell
11 years ago
wernsaar
3ea4dadd30
optimizations for trsm
11 years ago
wernsaar
1b10ff129a
optimizations for trmm
11 years ago
wernsaar
125610d23b
allow to set custom value for ?GEMM_DEFAULT_UNROLL_MN, optimizations for syrk
11 years ago
Zhang Xianyi
c94762bb56
Refs #401 . Added NO_AVX2 flag for old binutils (e.g. RHEL6)
11 years ago
Zhang Xianyi
552119c484
Fixed #407 . Support outputing the CPU corename on runtime.
The user can use char * openblas_get_config() or char * openblas_get_corename().
11 years ago
wernsaar
50e99a52ea
added definitions for PILEDRIVER and HASWELL
11 years ago
wernsaar
be94db096c
disabled *3M functions for x86_64 platforms
11 years ago
Zhang Xianyi
7a8949e0ce
Merge branch 'develop' of https://github.com/TimothyGu/OpenBLAS into TimothyGu-develop
Conflicts:
driver/others/memory.c
11 years ago
Timothy Gu
6c2ead30f0
Remove all trailing whitespace except lapack-netlib
Signed-off-by: Timothy Gu <timothygu99@gmail.com>
11 years ago
Jameson Nash
f41f03ab83
fix #394 . this cleans up some handles after using them, and doesn't disable ALL process privileges upon success
11 years ago
wernsaar
438002204d
Ref #393 : fix for INTERFACE64=0 and ARCH_X86 in divtable
11 years ago
wernsaar
53bfa51ee0
Ref #385 : fixed warnings in dynamic.c
11 years ago
wernsaar
a86d349a51
Ref #380 : enhancements for dynamic_arch
11 years ago
wernsaar
a35a1a9ae7
changed makefiles for lapack development
11 years ago
Olivier Grisel
2c556f093a
Add cast to function pointer to remove warning
11 years ago
Olivier Grisel
3b027d2528
Do not reference pthread_atfork in non-SMP_SERVER mode
11 years ago
Olivier Grisel
49bd98f410
Do not reference pthread_atfork under windows
11 years ago
Olivier Grisel
138a841390
FIX #294 : make OpenBLAS thread-pool resilient to fork via pthread_atfork
11 years ago