wernsaar
60f17628cc
added optimized dgemv_n kernel for haswell
11 years ago
wernsaar
c9bad1403a
added optimized sgemv_t kernel for sandybridge
11 years ago
wernsaar
2f8927376f
enabled optimized nehalem sgemv_t kernel for windows
11 years ago
wernsaar
d945a2b06d
added optimized sgemv_t kernel for nehalem
11 years ago
wernsaar
ca6c8d06ce
enabled optimized sgemv kernels for windows
11 years ago
wernsaar
7aa43c8928
enabled optimized sgemv kernels for windows
11 years ago
wernsaar
891b960854
added optimized sgemv_t kernel for haswell
11 years ago
wernsaar
95a8caa2f3
added optimized sgemv_t kernel
11 years ago
Zhang Xianyi
5c0d0ecbde
Merge pull request #430 from wernsaar/develop
added a better optimized sgemv_n kernel
11 years ago
wernsaar
8c05b8105b
bugfix in sgemv_n.c
11 years ago
wernsaar
c80084a98f
changed default x86_64 sgemv_n kernel to sgemv_n.c
11 years ago
wernsaar
2bab92961f
enabled optimized sgemv_n kernels for windows
11 years ago
wernsaar
9175b8bd5f
changed long to blaslong for windows compatibility
11 years ago
wernsaar
793f2d43b0
added optimized sgemv_n kernel for nehalem
11 years ago
wernsaar
a4dde45f87
optimized sgemv_n kernel for sandybridge
11 years ago
wernsaar
7fa7ea3e1e
updated haswell optimized sgmv_n kernel
11 years ago
wernsaar
3fbc13eb65
modified sgemv_n for haswell
11 years ago
wernsaar
db6917303f
added a better optimized sgemv_n kernel for bulldozer and piledriver
11 years ago
Zhang Xianyi
c2fdeb6c22
Merge pull request #429 from idunham/numprocs
Fix link error on Linux/musl.
11 years ago
Isaac Dunham
f7eb81a846
Fix link error on Linux/musl.
get_nprocs() is a GNU convenience function equivalent to POSIX2008
sysconf(_SC_NPROCESSORS_ONLN); the latter should be available in unistd.h
on any current *nix. (OS X supports this call since 10.5, and FreeBSD
currently supports it. But this commit does not change FreeBSD or OS X
versions.)
11 years ago
Zhang Xianyi
edc329883c
Merge pull request #427 from wernsaar/develop
added experimental support for big numa machines
11 years ago
wernsaar
793175be3a
added experimental support for big numa machines
11 years ago
Zhang Xianyi
83c4ba8d32
Merge pull request #426 from wernsaar/develop
added benchmark program for lapack ?getri functions
11 years ago
wernsaar
271af406f3
bugfix for linux affinity code
11 years ago
wernsaar
f5f50b3563
added benchmarks for lapack potrf, potrs and potri functions
11 years ago
wernsaar
651dd22d7d
added benchmark program for lapack ?getri functions
11 years ago
Zhang Xianyi
f329f77bd0
Merge pull request #425 from wernsaar/develop
added benchmark for lapack ?geev routines
11 years ago
wernsaar
7c611a2f95
bugfix for zgeev
11 years ago
wernsaar
296564e369
added lapack geev benchmark
11 years ago
Zhang Xianyi
27af6e35d3
Merge pull request #424 from ihnorton/fix_arm_cpuid
cpuid_arm: fix detection when cpuinfo uses "Processor"
11 years ago
Isaiah Norton
a183ad1df4
cpuid_arm: fix detection when cpuinfo uses "Processor"
instead of "model name"
11 years ago
wernsaar
799a0eabbd
bugfix in cholesky.c
11 years ago
wernsaar
ca63503e61
extented plot-filter.sh for linpack and cholesky benchmarks
11 years ago
Zhang Xianyi
4f83217df6
Merge pull request #422 from wernsaar/develop
optimization of sandybridge cgemm-kernel
11 years ago
wernsaar
5087096711
optimization of sandybridge cgemm-kernel
11 years ago
Zhang Xianyi
21f7768b26
Merge pull request #421 from wernsaar/develop
optimized sgemm- and cgemm-kernel for haswell
11 years ago
wernsaar
46bc4fd50c
optimized cgemm kernel for haswell
11 years ago
wernsaar
1cc02b4337
optimized sgemm kernel for haswell
11 years ago
Zhang Xianyi
6e223db7fc
Merge pull request #420 from wernsaar/develop
Optimizations for HASWELL
11 years ago
wernsaar
1d33547222
optimized zgemm kernel for haswell
11 years ago
wernsaar
3ea4dadd30
optimizations for trsm
11 years ago
wernsaar
1b10ff129a
optimizations for trmm
11 years ago
wernsaar
125610d23b
allow to set custom value for ?GEMM_DEFAULT_UNROLL_MN, optimizations for syrk
11 years ago
wernsaar
e213a42cde
added a sample plot-filter scripts and a header file for gnuplot
11 years ago
wernsaar
e4663be46a
added symv benchmark
11 years ago
wernsaar
11637b6926
add benchmark for ger
11 years ago
Zhang Xianyi
80bf3e6a35
Merge pull request #419 from wernsaar/develop
added optimized sgemv kernels for Sandy Bridge, Haswell, Bullldozer, and Piledriver.
11 years ago
wernsaar
6acbafe45b
added sgemv_n microkernel for haswell
11 years ago
wernsaar
5392d11b04
optimized sgemv_n_microk_sandy.c
11 years ago
wernsaar
c0fe95fb72
added sgemv_n microkernel for sandybridge
11 years ago