wernsaar
c26bbee489
enabled abd tested optimized trtri lapack functions
11 years ago
wernsaar
c4ccb3fbb2
removed lapack/getri because it was never used
11 years ago
wernsaar
a748d3a75d
enabled optimized trti2 lapack functions again
11 years ago
wernsaar
dbaeea7b59
enabled lauu2 and lauum lapack functions again
11 years ago
wernsaar
4f98f8c9b3
enabled and tested optimized potrf lapack functions
11 years ago
wernsaar
536875d463
enabled and tested optimized getrs lapack functions
11 years ago
wernsaar
ac029f81b3
enabled and tested optimized dgetrf function
11 years ago
wernsaar
a35a1a9ae7
changed makefiles for lapack development
11 years ago
wernsaar
4be4db590c
Merge remote branch 'origin/develop' into armv7
12 years ago
wernsaar
fe5f46c330
added experimental support for ARMV8
12 years ago
Zhang Xianyi
5048a80032
Refs #283 . Fixed the incorrect usage of long data type for Windows 64.
12 years ago
Zhang Xianyi
73770e60b8
Refs #309 . Fixed trtri_U single thread computational bug.
12 years ago
wernsaar
95aedfa0ff
added missing file arm/Makefile in lapack/laswp
12 years ago
Zhang Xianyi
a07cc39571
Refs #266 . Fixed the compiling bug with Open64 5.0.
12 years ago
Zhang Xianyi
fd0c388681
Refs #191 . A walk around for dtrtri_U single thread bug.
This function caused the failure of ERKALE serial test.
I replaced it with LAPACK source code.
12 years ago
Zhang Xianyi
32d2ca3035
Refs #214 , #221 , #246 . Fixed the getrf overflow bug on Windows.
I used a smaller threshold since the stack size is 1MB on windows.
12 years ago
Zhang Xianyi
5d3312142a
Refs #221 #246 . Fixed the overflowing stack bug in mutlithreading BLAS3.
When NUM_THREADS(MAX_CPU_NUNBERS) is very large ,e.g. 256.
typedef struct {
volatile BLASLONG working[MAX_CPU_NUMBER][CACHE_LINE_SIZE * DIVIDE_RATE];
} job_t;
job_t job[MAX_CPU_NUMBER];
The job array is equal 8MB.
Thus, We use malloc instead of stack allocation.
12 years ago
Zhang Xianyi
4c2123c334
Fixed the overflowing bug in single thread cholesky factorization.
12 years ago
Zhang Xianyi
7bd1834d59
Refs #130 Fixed laswp building bug with DYNAMIC_ARCH=1.
13 years ago
Zhang Xianyi
1b056c5328
Refs #130 Prevent reading ipiv array beyond the bound in ?laswp. Use laswp instead of laswp_oncopy in getrf.
13 years ago
Xianyi Zhang
342bbc3871
Import GotoBLAS2 1.13 BSD version codes.
14 years ago