wernsaar
b985cea65d
adjust number of threads for sgemv and dgemv
11 years ago
wernsaar
d286daa2ba
adjusted number of threads for small size
11 years ago
wernsaar
bcb115b55b
added benchmark for gemv
11 years ago
wernsaar
7424e2b609
added additional test value
11 years ago
wernsaar
880597b301
segment violation in sgemv kernels
11 years ago
wernsaar
9c835431d0
modified pathes to atlas, mkl and acml
11 years ago
wernsaar
1d4ffddf69
added conf option for number of loops
11 years ago
wernsaar
b0e7810a6b
added her2k benchmark
11 years ago
wernsaar
2b92a8c499
added herk benchmark
11 years ago
wernsaar
274b8dc91a
add hemm benchmark
11 years ago
wernsaar
74b237ca22
added syr2k benchmark
11 years ago
wernsaar
c353abd38c
added syrk benchmark
11 years ago
wernsaar
0acce17979
added trsm benchmark
11 years ago
wernsaar
2016a685e6
added trmm benchmark
11 years ago
wernsaar
1b9a6aac30
added benchmark for symm
11 years ago
wernsaar
e27433ab6a
added gemm benchmark and modified Makefile for benchmark
11 years ago
Zhang Xianyi
7961404a40
Merge pull request #411 from wernsaar/develop
Lapack-test on x86 32bit now runs without errors.
11 years ago
wernsaar
cedc1f4b14
Ref #410 : disabled optimized potri functions ( single threading bug)
11 years ago
wernsaar
0884b73c69
Lapack-test Windows 32bit now error free
11 years ago
wernsaar
9bd9472ae9
Lapack-test: cleanup of x86 32bit KERNEL file
11 years ago
Zhang Xianyi
2e2473f390
Merge pull request #409 from wernsaar/develop
some fixes for Lapack and ARM platform
11 years ago
wernsaar
c4a423a642
bugfixes for lapack on ARM Platform
11 years ago
Zhang Xianyi
47688e24e9
OpenBLAS 0.2.10 rc2 version.
11 years ago
wernsaar
61ef0c3419
added cross compiler examples for 32bit and 64bit ARM
11 years ago
Zhang Xianyi
698e77dba4
Refs #406 . Fixed utest building bug.
11 years ago
wernsaar
2081f6e8ff
Lapack bug114: replaced cgesvd.f and zgesvd.f
11 years ago
wernsaar
dc6b809f15
Lapack bug117: replaced zstemr.f
11 years ago
wernsaar
0f08684649
Lapack bug118: replaced clanhf.f and zlanhf.f
11 years ago
Zhang Xianyi
552119c484
Fixed #407 . Support outputing the CPU corename on runtime.
The user can use char * openblas_get_config() or char * openblas_get_corename().
11 years ago
Zhang Xianyi
94d3cfaa10
Merge pull request #404 from wernsaar/develop
A lot of fixes for v0.2.10-rc2
11 years ago
wernsaar
13348b2137
removed reference to daxpy_bulldozer kernel (Windows bug in lapack-test)
11 years ago
wernsaar
783a7d2202
bugfix for fortran compiler
11 years ago
wernsaar
50e99a52ea
added definitions for PILEDRIVER and HASWELL
11 years ago
wernsaar
9964ed2f79
bugfix for CORE2
11 years ago
wernsaar
d5b976f92d
fallback to zgemm_kernel_4x2_sse.S
11 years ago
wernsaar
f7267d9b0e
added missing definition for DUNNINGTON
11 years ago
wernsaar
e0c080a28c
removed reference to zgemm_kernel_4x2_sse3.S (bug in lapack-test)
11 years ago
wernsaar
e80b144932
enabled compiling of *3M functions
11 years ago
wernsaar
02a504c0b8
fixed my bug in ger.c
11 years ago
wernsaar
be94db096c
disabled *3M functions for x86_64 platforms
11 years ago
wernsaar
b079df9ef4
added optimized sdot- and dsdot-kernel, written in C
11 years ago
wernsaar
aee61456a4
disabled SMP for sbmv and zsbmv again
11 years ago
wernsaar
01a119abfc
enabled SMP for sbmv and zsbmv, but only for 64bit binaries
11 years ago
wernsaar
1fad2b759f
enabled smp for ger.c and zger.c, but only for 64bit binaries
11 years ago
wernsaar
e1e83a1b71
modification, to run blas-test on Windows
11 years ago
Zhang Xianyi
1127f5a2d7
OpenBLAS 0.2.10 rc1 version.
11 years ago
Zhang Xianyi
0ae4cc2803
Merge branch 'wernsaar-develop' into develop
11 years ago
Zhang Xianyi
99efbbbad5
Fixed #395 . Enable optimized cgemm for Sandybridge. Added optimized sdot kernel.
Fixed c/zgemm, zgemv computational error of haswell, piledriver, bullldozer, and
barcelona on Windows.
Merge branch 'develop' of https://github.com/wernsaar/OpenBLAS into wernsaar-develop
Conflicts:
kernel/Makefile.L1
kernel/x86_64/KERNEL
param.h
11 years ago
wernsaar
22e5aee2dd
fixed zgemv bug for older AMD Processors
11 years ago
Zhang Xianyi
249917700d
Merge branch 'TimothyGu-develop' into develop
Fixed #398 . Remove all trailing whitespace except lapack-netlib.
11 years ago