Zhang Xianyi
|
898fc7552a
|
Merge pull request #612 from ibmsoe/ppc64le
ppc64le platform support (ELF ABI v2)
|
10 years ago |
Zhang Xianyi
|
1cf2b10224
|
Use pure C generic target on x86 and x86_64.
make TARGET=GENERIC
?gemm3m is unimplemented on generic target.
|
10 years ago |
Matthew Brandyberry
|
7ba4fe5afb
|
ppc64le platform support (ELF ABI v2)
|
10 years ago |
Werner Saar
|
e7c969e164
|
added optimized dtrmm_kernel for haswell
|
10 years ago |
Werner Saar
|
9bd962f655
|
modified haswell parameter dgemm_unroll_n
|
10 years ago |
Werner Saar
|
24f58c8bb1
|
added optimized cscal and zscal kernels for steamroller
|
10 years ago |
Werner Saar
|
95b1faf667
|
added optimized cscal and zscal kernels for steamroller and piledriver
|
10 years ago |
Werner Saar
|
2d9e406050
|
added optimized cscal kernel for sandybridge
|
10 years ago |
Werner Saar
|
59083e3ce1
|
added optimized cscal kernel for bulldozer
|
10 years ago |
wernsaar
|
685be40339
|
Merge pull request #571 from wernsaar/develop
added optimized cscal and zscal functions
|
10 years ago |
Werner Saar
|
31c9e399e9
|
added optimized cscal kernel for haswell
|
10 years ago |
Werner Saar
|
7de6bb9889
|
added optimized zscal kernel for bulldozer
|
10 years ago |
Werner Saar
|
d63034303b
|
added optimized zscal kernel for haswell
|
10 years ago |
Zhang Xianyi
|
51ff17d46e
|
Add AMD Excavator target.
|
10 years ago |
Werner Saar
|
18e90ee2e3
|
bugfix: added static to functions
|
10 years ago |
Werner Saar
|
e00cccc41e
|
added optimized dscal kernel for piledriver
|
10 years ago |
Werner Saar
|
73f09bf64f
|
optimized dscal kernel for increment != 1
|
10 years ago |
Werner Saar
|
02e772c7e4
|
added optimized dscal kernel for haswell
|
10 years ago |
Werner Saar
|
7aee913991
|
added optimized dscal kernel for sandybridge
|
10 years ago |
Werner Saar
|
e50a933037
|
added optimized dscal kernel for bulldozer
|
10 years ago |
Werner Saar
|
133c11a156
|
updated dgemv_n kernel for nehalem
|
10 years ago |
Werner Saar
|
30f52d53df
|
optimized dgemv_n kernel for haswell
|
10 years ago |
Werner Saar
|
5e83d80725
|
optimized dger kernel for sandybridge
|
10 years ago |
Werner Saar
|
b2e1797dc6
|
added optimized sger kernel for sandybridge
|
10 years ago |
Werner Saar
|
e216f686cb
|
optimized saxpy and daxpy for sandybridge
|
10 years ago |
Werner Saar
|
fc0e0391f3
|
bugfixes: replaced int with BLASLONG
|
10 years ago |
Werner Saar
|
c22068c406
|
optimized sdot.c for increments != 1
|
10 years ago |
Werner Saar
|
dee100d0e4
|
optimized saxpy.c for increments != 1
|
10 years ago |
Werner Saar
|
0273966abb
|
optimized daxpy kernel for increments != 1
|
10 years ago |
Werner Saar
|
3a67daa954
|
optimized ddot.c for increments != 1
|
10 years ago |
Werner Saar
|
b4f2153dcd
|
added optimized ssymv kernels for sandybridge
|
10 years ago |
Werner Saar
|
1c4b0eeae3
|
added optimized ssymv kernels for haswell
|
10 years ago |
Werner Saar
|
1bec9abb9a
|
added optimized dsymv kernels for sandybridge
|
10 years ago |
Werner Saar
|
3814bf60d3
|
added optimized dsymv kernels for haswell
|
10 years ago |
Werner Saar
|
6d0db0151f
|
added optimized zaxpy-kernels
|
10 years ago |
Zhang Xianyi
|
37b9033c90
|
Merge pull request #543 from jeromerobert/develop
Fix a buffer overflow with MAX_STACK_ALLOC size in dgemv_t
|
10 years ago |
Werner Saar
|
13889515b3
|
added optimized caxpy-kernel for sandybridge
|
10 years ago |
Werner Saar
|
248c9340c3
|
added optimized caxpy-kernel for haswell
|
10 years ago |
Werner Saar
|
e9f33b4ca7
|
added optimized caxpy-kernel for steamroller
|
10 years ago |
Werner Saar
|
f5d847122a
|
updated caxpy_microk_bulldozer-2.c and caxpy.c
|
10 years ago |
Jerome Robert
|
a4c96eca67
|
Fix a buffer overflow with MAX_STACK_ALLOC size in dgemv_t
Refs #478, #482, 9798481, fd9fd42
|
10 years ago |
Werner Saar
|
baa0363ea2
|
add optimized ddot-kernel for piledriver
|
10 years ago |
Werner Saar
|
34ba66606a
|
add optimized daxpy-kernel for piledriver
|
10 years ago |
Werner Saar
|
f615dc7603
|
added optimized saxpy kernel for steamroller
|
10 years ago |
Werner Saar
|
331c417637
|
optimized saxpy for piledriver
|
10 years ago |
Werner Saar
|
d7a17ad85d
|
optimized sdot-kernel for pilediver
|
10 years ago |
Werner Saar
|
d35f6c63c2
|
add optimized daxpy-kernel for steamroller
|
10 years ago |
Werner Saar
|
166d76e864
|
added optimized sdot-kernel for steamroller
|
10 years ago |
Werner Saar
|
f9f127d838
|
added optimized ddot kernel for steamroller
|
10 years ago |
wernsaar
|
62231ab337
|
Merge pull request #538 from wernsaar/develop
Added optimized cdot- and zdot-kernels
|
10 years ago |