wernsaar
|
9db0fb8b02
|
bugfix for sdsdot
|
11 years ago |
wernsaar
|
692b14cecd
|
rewrote rotmg.c instead of modifying very old code
|
11 years ago |
Zhang Xianyi
|
3e0a7b931c
|
Refs #333. Detect the wrong parameter for zherk/zher2k.
|
11 years ago |
Zhang Xianyi
|
73770e60b8
|
Refs #309. Fixed trtri_U single thread computational bug.
|
12 years ago |
Lars Buitinck
|
3f7b0cd994
|
Merge pull request #290 from larsmans/missing-threshold
check if GEMM_MULTITHREAD_THRESHOLD defined in gemm.c
Set a fallback value.
|
12 years ago |
Zhang Xianyi
|
c92ae012a6
|
Refs #279. Provide ONLY_CBLAS flag. If you only need CBLAS without
a fortran compiler, please try make ONLY_CBLAS=1.
This mode only compiler CBLAS without BLAS fortran interface and LAPACK.
|
12 years ago |
Zhang Xianyi
|
a07cc39571
|
Refs #266. Fixed the compiling bug with Open64 5.0.
|
12 years ago |
Zhang Xianyi
|
b5c2ac4fd6
|
Fixed #264 the memory leak bug in dtrtri_U.
|
12 years ago |
Elliot Saba
|
6f5b395009
|
Fix xianyi/OpenBLAS#256
|
12 years ago |
Zhang Xianyi
|
fd0c388681
|
Refs #191. A walk around for dtrtri_U single thread bug.
This function caused the failure of ERKALE serial test.
I replaced it with LAPACK source code.
|
12 years ago |
Jameson Nash
|
d0e731e8b8
|
provide support for passing CFLAGS, FFLAGS, PFLAGS, FPFLAGS to make on the command line
|
13 years ago |
Xianyi Zhang
|
83ecfbb9b3
|
Merge branch 'loongson3a' into release-0.1.0
|
13 years ago |
Xianyi Zhang
|
31c836ac25
|
Ref #79 Added GEMM_MULTITHREAD_THRESHOLD flag to use single thread in gemm function with small matrices.
|
13 years ago |
Xianyi Zhang
|
722dd08703
|
ref #80. On P4 CPU with 32-bit Windows XP, Octave crashed with OpenBLAS. Walkaroud: Use netlib reference gemv instead of own funtions.
For example, make USE_NETLIB_GEMV=1
|
13 years ago |
traz
|
a4292976e9
|
Adding detection of complex situations in symm.c, otherwise the buffer address of sb will overlap the end of sa.
|
14 years ago |
Xianyi Zhang
|
aeed8d6225
|
Fixed #27. Temporarily walk around axpy's low performance issue with small imput size & multithreads.
|
14 years ago |
Xianyi Zhang
|
1496383224
|
Print the wall time (cycles) with enabling FUNCTION_PROFILE.
|
14 years ago |
Xianyi Zhang
|
fcb5ce011b
|
Fixed #28. Convert the result to double precision in MIPS64 dsdot_k kernel.
|
14 years ago |
Xianyi Zhang
|
fa8e4fd879
|
Fixed #26 the wrong result of rotmg. Used fabs() instead of abs().
|
14 years ago |
Xianyi Zhang
|
8f1090d32a
|
Support NO_LAPACK=1 to build the lib without LAPACK functions.
|
14 years ago |
Xianyi Zhang
|
0cfd29a819
|
Fixed #7. 1)Disable the multi-thread and 2) Modified kernel codes to avoid unloop in axpy function when incx==0 or incy==0.
|
14 years ago |
Xianyi Zhang
|
78da0e0a0c
|
Fixed #6. Disable multi-thread swap when incx==0 or incy==0.
|
14 years ago |
Xianyi Zhang
|
342bbc3871
|
Import GotoBLAS2 1.13 BSD version codes.
|
14 years ago |