Sebastien Fabbro
9f0fb6e662
Respect user's LDFLAGS
12 years ago
Zhang Xianyi
63f14189e3
Refs #259 . Fixed missing LAPACK functions in shared library.
12 years ago
Zhang Xianyi
c5437149c0
Merge pull request #257 from staticfloat/develop
Add in return value for `interface/trtri.c`
12 years ago
Elliot Saba
6f5b395009
Fix xianyi/OpenBLAS#256
12 years ago
Zhang Xianyi
d4f9571818
Refs #255 . Didn't use f77 compiler.
12 years ago
Zhang Xianyi
937d838619
Update CONTRIBUTORS.md
12 years ago
Zhang Xianyi
6209c8fc44
Fixed #253 . Update doc for v0.2.7 version.
12 years ago
Zhang Xianyi
238ceb4ac0
Merge branch 'loongson3b' into develop
12 years ago
Zhang Xianyi
77b572fa0b
Merge branch 'loongson3a' into develop
Conflicts:
Makefile.system
12 years ago
Zhang Xianyi
f69f89b846
Fixed #254 . Added the date of changes in contributors file.
12 years ago
Zhang Xianyi
c77032b0cc
create contributor file.
12 years ago
wangqian
1b3b9e841d
Fixed a computational error in zgemm_kernel_4x4_sandy.S file.
12 years ago
Zhang Xianyi
b67252c2e4
Ensure the correct stack alignment on Win32.
12 years ago
Zhang Xianyi
c69e73b868
Fixed typo in generating shared library on x86_64.
12 years ago
Zhang Xianyi
b51e2ba1ee
Modified Makefile to avoid redundant echo.
12 years ago
Zhang Xianyi
9c0a834f98
Modified Makefile.install
12 years ago
Zhang Xianyi
2a7503e563
Refs #225 . Fixed a bug in GEMM OpenMP threading.
12 years ago
Zhang Xianyi
fd0c388681
Refs #191 . A walk around for dtrtri_U single thread bug.
This function caused the failure of ERKALE serial test.
I replaced it with LAPACK source code.
12 years ago
Zhang Xianyi
61a9582987
Changed makefile for lapack.
12 years ago
Zhang Xianyi
b681064c6c
Updated travis.
12 years ago
Zhang Xianyi
e80e285928
Update build matrix for Travis CI.
12 years ago
Zhang Xianyi
2ed0f6ab60
Fixed the typo.
12 years ago
Zhang Xianyi
5448643557
Fixed generating dll bug in last commit.
12 years ago
Zhang Xianyi
824c3c4df3
Fixed #251 . Merge branch 'grisuthedragon-develop' into develop
12 years ago
grisuthedragon
c19a488af2
create openblas_get_parallel to retrieve information which
parallelization model is used by OpenBLAS.
12 years ago
Zhang Xianyi
32d2ca3035
Refs #214 , #221 , #246 . Fixed the getrf overflow bug on Windows.
I used a smaller threshold since the stack size is 1MB on windows.
12 years ago
Zhang Xianyi
6df39ad9e7
Refs #248 . Support LAPACK and LAPACKE with lsbcc.
For LAPACKE, use LAPACK_COMPLEX_STRUCTURE.
The reson is lsbcc didn't define complex I in complex.h.
12 years ago
Zhang Xianyi
3a96e4cbcb
Merge pull request #249 from wernsaar/develop
replaced defined(DOUBLE) by !defined(XDOUBLE)
12 years ago
wernsaar
6f008abcef
replaced defined(DOUBLE) by !defined(XDOUBLE)
12 years ago
Zhang Xianyi
3eb5af1955
Refs #247 . Included lapack source codes. Avoid downloading tar.gz from netlib.org
Based on 3.4.2 version, apply patch.for_lapack-3.4.2.
12 years ago
Zhang Xianyi
fbb75e58b1
Fixed the typo in getarch.c
12 years ago
Zhang Xianyi
f54f5bac9e
Refs #248 . Fixed the LSB compatiable issue for BLAS only.
For example, make CC=lsbcc NO_LAPACK=1.
12 years ago
Zhang Xianyi
5d3312142a
Refs #221 #246 . Fixed the overflowing stack bug in mutlithreading BLAS3.
When NUM_THREADS(MAX_CPU_NUNBERS) is very large ,e.g. 256.
typedef struct {
volatile BLASLONG working[MAX_CPU_NUMBER][CACHE_LINE_SIZE * DIVIDE_RATE];
} job_t;
job_t job[MAX_CPU_NUMBER];
The job array is equal 8MB.
Thus, We use malloc instead of stack allocation.
12 years ago
Zhang Xianyi
886cbaf4e4
Support AMD Piledriver by bulldozer kernels.
12 years ago
Zhang Xianyi
0c4074e10b
Added Travis CI status image.
12 years ago
Zhang Xianyi
cc522aa21d
Use quiet make for Travis CI.
12 years ago
Zhang Xianyi
9c78fad721
Install gfortran in Travis CI.
12 years ago
Zhang Xianyi
6028232ad1
Added travis.yml file.
12 years ago
Zhang Xianyi
feb9a3889a
Improved make clean on Mac OS X.
12 years ago
Zhang Xianyi
32dbeb636d
Refs #221 . Set stack limit to 16MB to prevent a SEGFAULT bug on Mac OS X with DYNAMIC_ARCH=1 & NUM_THREADS=256.
12 years ago
Zhang Xianyi
57944538b6
Use ALIGN_5 instead of .algin 32 in assembly kernel. Added ALIGN_5 for 32-bit OSX.
12 years ago
Zhang Xianyi
3ce2c62b0b
Merge pull request #242 from danluu/readme.haswell
Update README to reflect Haswell support, etc.
12 years ago
Dan Luu
50464997a3
Fix miscellaneous typos
12 years ago
Zhang Xianyi
8e7cad1650
Fixed #217 openblas_config.h bug on Windows 64.
12 years ago
Dan Luu
590e6aeafc
Add Haswell support
12 years ago
Dan Luu
88ef307cef
Refs #241 . Add Haswell support (using sandybridge optimizations)
12 years ago
Zhang Xianyi
6e8501c8a1
Fixed #239 bug in param.h about BARCELONA and BULLDOZER.
12 years ago
Zhang Xianyi
fa916a0fac
Fixed #238 bug in lsame on x86.
12 years ago
Zhang Xianyi
fb298b34ae
Merge pull request #235 from wernsaar/develop
Added ddot, daxpy, dcopy kernels for AMD bulldozer.
12 years ago
wernsaar
16012767f4
added dcopy_bulldozer.S
12 years ago