wernsaar
9964ed2f79
bugfix for CORE2
11 years ago
wernsaar
d5b976f92d
fallback to zgemm_kernel_4x2_sse.S
11 years ago
wernsaar
f7267d9b0e
added missing definition for DUNNINGTON
11 years ago
wernsaar
e0c080a28c
removed reference to zgemm_kernel_4x2_sse3.S (bug in lapack-test)
11 years ago
wernsaar
e80b144932
enabled compiling of *3M functions
11 years ago
wernsaar
02a504c0b8
fixed my bug in ger.c
11 years ago
wernsaar
be94db096c
disabled *3M functions for x86_64 platforms
11 years ago
wernsaar
b079df9ef4
added optimized sdot- and dsdot-kernel, written in C
11 years ago
wernsaar
aee61456a4
disabled SMP for sbmv and zsbmv again
11 years ago
wernsaar
01a119abfc
enabled SMP for sbmv and zsbmv, but only for 64bit binaries
11 years ago
wernsaar
1fad2b759f
enabled smp for ger.c and zger.c, but only for 64bit binaries
11 years ago
wernsaar
e1e83a1b71
modification, to run blas-test on Windows
11 years ago
Zhang Xianyi
1127f5a2d7
OpenBLAS 0.2.10 rc1 version.
11 years ago
Zhang Xianyi
0ae4cc2803
Merge branch 'wernsaar-develop' into develop
11 years ago
Zhang Xianyi
99efbbbad5
Fixed #395 . Enable optimized cgemm for Sandybridge. Added optimized sdot kernel.
Fixed c/zgemm, zgemv computational error of haswell, piledriver, bullldozer, and
barcelona on Windows.
Merge branch 'develop' of https://github.com/wernsaar/OpenBLAS into wernsaar-develop
Conflicts:
kernel/Makefile.L1
kernel/x86_64/KERNEL
param.h
11 years ago
wernsaar
22e5aee2dd
fixed zgemv bug for older AMD Processors
11 years ago
Zhang Xianyi
249917700d
Merge branch 'TimothyGu-develop' into develop
Fixed #398 . Remove all trailing whitespace except lapack-netlib.
11 years ago
Zhang Xianyi
7a8949e0ce
Merge branch 'develop' of https://github.com/TimothyGu/OpenBLAS into TimothyGu-develop
Conflicts:
driver/others/memory.c
11 years ago
Zhang Xianyi
b82108f899
Merge pull request #399 from TimothyGu/upstr
Build import libs as .dll.a instead of .lib
11 years ago
Zhang Xianyi
8373ad4ec2
Merge pull request #397 from vtjnash/develop
fix #394
11 years ago
wernsaar
35d37e124f
bugfix for barcelona zgemv-kernel
11 years ago
wernsaar
d8ba46efdb
bugfix for bulldozer cgemm-, zgemm- and zgemv-kernel
11 years ago
wernsaar
a15f22a1f6
bugfix for piledriver cgemm-, zgemm- and zgemv-kernel
11 years ago
wernsaar
b94ea89f52
bugfix for haswell cgemm- and zgemm-kernel
11 years ago
wernsaar
35f668bb14
bugfix for cgemm_kernel_8x2_sandy.S
11 years ago
Timothy Gu
4ebbf758f5
.gitignore: add some more entries concerned with kernel
Signed-off-by: Timothy Gu <timothygu99@gmail.com>
11 years ago
Timothy Gu
8615d6ec87
Build import libs as .dll.a instead of .lib
This is MinGW convention.
Signed-off-by: Timothy Gu <timothygu99@gmail.com>
11 years ago
Timothy Gu
6c2ead30f0
Remove all trailing whitespace except lapack-netlib
Signed-off-by: Timothy Gu <timothygu99@gmail.com>
11 years ago
Jameson Nash
f41f03ab83
fix #394 . this cleans up some handles after using them, and doesn't disable ALL process privileges upon success
11 years ago
wernsaar
365e8de346
added optimized cgemm-kernel for SANDYBRIDGE
11 years ago
wernsaar
578d1b6219
added DSDOT definition and enabled optimized sdot kernel
11 years ago
wernsaar
a6ae079b17
added blas-test from lapack
11 years ago
Zhang Xianyi
d10db52edb
Merge pull request #390 from wernsaar/develop
Ref #103 : enhancement for small matrix dimensions. Fixed some bugs. Enable sgemm for SNB and dgemm for NEHALEM
11 years ago
wernsaar
dabab2b5f4
added new optimized sgemm kernel for SANDYBRIGE
11 years ago
wernsaar
aa2709c4e0
enabled optimized dgemm kernel for NEHALEM
11 years ago
wernsaar
9d6f2b594e
Fortran flag -frecursive is disabled by default
11 years ago
wernsaar
a13bcc1716
enabled optimized sgemv kernel for barcelona and piledriver
11 years ago
wernsaar
d2c82d7543
enabled optimized sgemv kernel for HASWELL
11 years ago
wernsaar
0517672dd0
enabled optimized sgemv kernels for nehalem, sandybridge and bulldozer
11 years ago
wernsaar
15d5dfa92c
fixed compiler warnings
11 years ago
wernsaar
d83373db61
added parameter for gemm3m kernels
11 years ago
wernsaar
88b6bf251a
force fallback for x86 32bit
11 years ago
wernsaar
4a2ab7460b
Ref #391 : force fallback for x86 32bit
11 years ago
wernsaar
86d8c8978b
Ref #391 : disabled SMP in ger.c and zger.c
11 years ago
wernsaar
316df0e821
fixed bug for INTERFACE64
11 years ago
wernsaar
438002204d
Ref #393 : fix for INTERFACE64=0 and ARCH_X86 in divtable
11 years ago
wernsaar
23203d52c1
Ref #380 : lowered stack usage for haswell kernels
11 years ago
wernsaar
73545a79cd
Ref #380 : lowered stack usage for piledriver and bulldozer kernels
11 years ago
wernsaar
a19d209005
Ref #103 : enhancement for small matrix dimensions
11 years ago
Zhang Xianyi
8602816536
Merge pull request #387 from davidanthoff/fixbuilderroronwin
Add -lgfortran flag to gcc call in a makefile
11 years ago