Zhang Xianyi
a69dd3fbc5
OpenBLAS 0.2.11 version.
11 years ago
wernsaar
0a22816e70
Ref #433 : removed obsolete lapack entries from common_interface.h
11 years ago
Zhang Xianyi
c3cd6e7e32
Merge pull request #434 from wernsaar/develop
A lot of performance enhancements
11 years ago
wernsaar
11eab4c019
added optimized cgemv_n for haswell
11 years ago
wernsaar
4568d32b6b
added optimized cgemv_t kernel for haswell
11 years ago
wernsaar
c1a6374c6f
optimized zgemv_n kernel for sandybridge
11 years ago
wernsaar
dc05937313
added additional test values
11 years ago
wernsaar
2470129132
added fast return, if m or n < 1
11 years ago
wernsaar
8c582d362d
optimized zgemv_t_microk_haswell-2.c
11 years ago
wernsaar
11e34ddd1b
bugfix for zgemv_n_microk_haswell-2.c
11 years ago
wernsaar
9528f0d9ee
bugfix in zgemv_n_microk_sandy-2.c
11 years ago
wernsaar
b06550519e
added optimized cgemv_t c-kernel
11 years ago
wernsaar
6093ee5363
bugfix in zgemv_n_microk_haswell-2.c
11 years ago
wernsaar
07c66b1960
modified algorithm for better numerical stability
11 years ago
wernsaar
58b075daef
added optimized zgemv_t kernel for haswell
11 years ago
wernsaar
09fcd3a341
add optimized zgemv_t kernel for bulldozer
11 years ago
wernsaar
726ad085cb
added optimized zgemv_t for haswell
11 years ago
wernsaar
6fe416976d
added optimimized zgemv_t c-kernel
11 years ago
wernsaar
dbc2eff029
disabled optimized haswell zgemv_n kernel for windows ( bad rounding )
11 years ago
wernsaar
462b4885ff
added optimized zgemv_n kernel for haswell
11 years ago
wernsaar
aa54fe064c
added zgemv_n c-function
11 years ago
wernsaar
006ef3ea01
added optimized dgemv_t kernel for haswell
11 years ago
wernsaar
60f17628cc
added optimized dgemv_n kernel for haswell
11 years ago
wernsaar
c9bad1403a
added optimized sgemv_t kernel for sandybridge
11 years ago
wernsaar
2f8927376f
enabled optimized nehalem sgemv_t kernel for windows
11 years ago
wernsaar
d945a2b06d
added optimized sgemv_t kernel for nehalem
11 years ago
wernsaar
ca6c8d06ce
enabled optimized sgemv kernels for windows
11 years ago
wernsaar
7aa43c8928
enabled optimized sgemv kernels for windows
11 years ago
wernsaar
891b960854
added optimized sgemv_t kernel for haswell
11 years ago
wernsaar
95a8caa2f3
added optimized sgemv_t kernel
11 years ago
Zhang Xianyi
5c0d0ecbde
Merge pull request #430 from wernsaar/develop
added a better optimized sgemv_n kernel
11 years ago
wernsaar
8c05b8105b
bugfix in sgemv_n.c
11 years ago
wernsaar
c80084a98f
changed default x86_64 sgemv_n kernel to sgemv_n.c
11 years ago
wernsaar
2bab92961f
enabled optimized sgemv_n kernels for windows
11 years ago
wernsaar
9175b8bd5f
changed long to blaslong for windows compatibility
11 years ago
wernsaar
793f2d43b0
added optimized sgemv_n kernel for nehalem
11 years ago
wernsaar
a4dde45f87
optimized sgemv_n kernel for sandybridge
11 years ago
wernsaar
7fa7ea3e1e
updated haswell optimized sgmv_n kernel
11 years ago
wernsaar
3fbc13eb65
modified sgemv_n for haswell
11 years ago
wernsaar
db6917303f
added a better optimized sgemv_n kernel for bulldozer and piledriver
11 years ago
Zhang Xianyi
c2fdeb6c22
Merge pull request #429 from idunham/numprocs
Fix link error on Linux/musl.
11 years ago
Isaac Dunham
f7eb81a846
Fix link error on Linux/musl.
get_nprocs() is a GNU convenience function equivalent to POSIX2008
sysconf(_SC_NPROCESSORS_ONLN); the latter should be available in unistd.h
on any current *nix. (OS X supports this call since 10.5, and FreeBSD
currently supports it. But this commit does not change FreeBSD or OS X
versions.)
11 years ago
Zhang Xianyi
edc329883c
Merge pull request #427 from wernsaar/develop
added experimental support for big numa machines
11 years ago
wernsaar
793175be3a
added experimental support for big numa machines
11 years ago
Zhang Xianyi
83c4ba8d32
Merge pull request #426 from wernsaar/develop
added benchmark program for lapack ?getri functions
11 years ago
wernsaar
271af406f3
bugfix for linux affinity code
11 years ago
wernsaar
f5f50b3563
added benchmarks for lapack potrf, potrs and potri functions
11 years ago
wernsaar
651dd22d7d
added benchmark program for lapack ?getri functions
11 years ago
Zhang Xianyi
f329f77bd0
Merge pull request #425 from wernsaar/develop
added benchmark for lapack ?geev routines
11 years ago
wernsaar
7c611a2f95
bugfix for zgeev
11 years ago