wernsaar
|
f0f9b25bb6
|
added test for CGEMM3M function
|
11 years ago |
wernsaar
|
7aae4a62e7
|
enabled use of GEMM3M functions
|
11 years ago |
wernsaar
|
7a911569b8
|
added test for GEMM3M functions
|
11 years ago |
wernsaar
|
466bfb8b86
|
updated README.md
|
11 years ago |
Zhang Xianyi
|
70d1ba09b2
|
Update the doc for target list.
|
11 years ago |
Zhang Xianyi
|
d293b78b64
|
Merge pull request #451 from eshelman/patch-1
Add HASWELL to TargetList.txt
|
11 years ago |
Eliot Eshelman
|
9912dbbcf9
|
Add HASWELL to TargetList.txt
The Intel "Haswell" architecture is missing from the list of build targets.
|
11 years ago |
Zhang Xianyi
|
01bc462e8e
|
Merge pull request #449 from wernsaar/develop
optimized multithreading lower limits
|
11 years ago |
wernsaar
|
3300f5ebff
|
optimized multithreading lower limits
|
11 years ago |
Zhang Xianyi
|
59e2c20557
|
Merge pull request #448 from wernsaar/develop
Optimized cgemv and zgemv kernels
|
11 years ago |
wernsaar
|
b7c9566eea
|
removed obsolete gemv kernel files
|
11 years ago |
wernsaar
|
6df1b0be81
|
optimized zgemv_n_microk_sandy-4.c
|
11 years ago |
wernsaar
|
2ac1e076c1
|
added optimized zgemv_n kernel for sandybridge
|
11 years ago |
wernsaar
|
9908b6031c
|
bugfix in KERNEL.PILEDRIVER
|
11 years ago |
wernsaar
|
8f100a14f2
|
optimized cgemv_t kernel for haswell
|
11 years ago |
wernsaar
|
53b5726b04
|
added optimized cgemv_t kernel for haswell
|
11 years ago |
wernsaar
|
1a352b24e6
|
updated KERNEL.HASWELL
|
11 years ago |
wernsaar
|
5194818d4b
|
updated zgemv_t_4.c
|
11 years ago |
wernsaar
|
8a39cdb1c1
|
added optimized zgemv_t kernel for haswell
|
11 years ago |
wernsaar
|
fd2478c9e2
|
optimized interface/zgemv.c for multithreading
|
11 years ago |
wernsaar
|
0a1390f2d8
|
enabled optimized zgemv_t kernel for bulldozer
|
11 years ago |
wernsaar
|
a8b0812feb
|
optimized zgemv_t for bulldozer
|
11 years ago |
wernsaar
|
a0fb68ab42
|
added optimized zgemv_t kernel for bulldozer
|
11 years ago |
wernsaar
|
44c11165d5
|
bugfix in cgemv_t_4.c
|
11 years ago |
wernsaar
|
564be4eb72
|
added optimized cgemv_t kernel
|
11 years ago |
wernsaar
|
107c3ea7d5
|
added optimized zgemv_t routine
|
11 years ago |
wernsaar
|
bb8d698335
|
optimized zgemv_n_microk_haswell-4.c for small size
|
11 years ago |
wernsaar
|
e0192a6914
|
bugfix in zgemv_n_4.c
|
11 years ago |
wernsaar
|
bced4594bb
|
added optimized zgemv_n kernel
|
11 years ago |
wernsaar
|
cafba99b6b
|
bufix in cgemv_n_microk_haswell-4.c
|
11 years ago |
wernsaar
|
ac8f232b2a
|
more optimizations
|
11 years ago |
wernsaar
|
f98e1244c4
|
optimized cgemv_n_4.c
|
11 years ago |
wernsaar
|
be95700b30
|
added optimized cgemv_kernel for haswell
|
11 years ago |
wernsaar
|
4aa534ae93
|
added cgemv_n kernel, optimized for small sizes
|
11 years ago |
Zhang Xianyi
|
1cba8e7b11
|
Merge pull request #446 from grisuthedragon/cblas_matcopy
Add a CBLAS interface for the BLAS extension s/d/c/z*matcopy routines.
|
11 years ago |
Zhang Xianyi
|
d13e92f07e
|
Merge pull request #445 from wernsaar/develop
A lot of optimizations for gemv kernels
|
11 years ago |
wernsaar
|
baa46e4fba
|
added and tested optimized dgemv_n kernel for haswell
|
11 years ago |
wernsaar
|
faab7a181d
|
added optimized dgemv_n kernel for haswell
|
11 years ago |
wernsaar
|
8109d8232c
|
optimized dgemv_t kernel for haswell
|
11 years ago |
wernsaar
|
debc6d1a05
|
bugfix in KERNEL.HASWELL
|
11 years ago |
wernsaar
|
e73a0113ec
|
added optimized gemv kernels
|
11 years ago |
wernsaar
|
44f2bf9bae
|
added optimized dgemv_t kernel for haswell
|
11 years ago |
Martin Koehler
|
a057e5434d
|
add CBLAS interface for s/d/c/zimatcopy
|
11 years ago |
wernsaar
|
cd34e9701b
|
removed obsolete files
|
11 years ago |
Martin Köhler
|
7794766d3c
|
Add cblas_(s/d/c/z)omatcopy in order to have cblas interface for them.
|
11 years ago |
wernsaar
|
658939faaa
|
optimized dgemv_n kernel for small sizes
|
11 years ago |
wernsaar
|
f511807fc0
|
modified multithreading threshold
|
11 years ago |
wernsaar
|
c4d9d4e5f8
|
added haswell optimized kernel
|
11 years ago |
wernsaar
|
7c0a94ff47
|
bugfix in sgemv_n_microk_haswell-4.c
|
11 years ago |
wernsaar
|
cbbc80aad3
|
added optimized sgemv_t kernel for haswell
|
11 years ago |