Martin Kroeker
|
e46971b9d5
|
Merge branch 'develop' into release-0.3.0
|
2 years ago |
Martin Kroeker
|
5736dbae16
|
Merge pull request #3961 from martin-frbg/lapack807
Replace the conditionally defined index INDIBL with a constant (Reference-LAPACK PR807)
|
2 years ago |
Martin Kroeker
|
09ace3cf23
|
Merge pull request #3846 from lilh9598/sbgemm_opt
Improve the performance of sbgemm_tcopy on neoversen2
|
2 years ago |
Martin Kroeker
|
c48bbe93d4
|
Replace the conditionally defined index variable INDIBL with a constant 1
|
2 years ago |
Martin Kroeker
|
2d39e715e2
|
Merge pull request #3960 from martin-frbg/symmsyrk_sp
Add multithreading threshold for SYMM and rework the one for SYRK
|
2 years ago |
Martin Kroeker
|
f2d6b1c70e
|
Add multithreading threshold
|
2 years ago |
Martin Kroeker
|
a495ffc554
|
Rework multithreading threshold
|
2 years ago |
Martin Kroeker
|
1659750861
|
Add ARMV5 to CMAKE prebuild.cmake for cross-compiles (#3959)
* Add ARMV5
|
2 years ago |
Martin Kroeker
|
7f56c31e21
|
Merge pull request #3958 from tttapa/develop
Add ARMv6 support to cmake/prebuild.cmake
|
2 years ago |
Martin Kroeker
|
cf80350b89
|
Merge pull request #3957 from martin-frbg/issue3954
Fix detection of Cray CCE after recent detection changes for gfortran on Cray
|
2 years ago |
Pieter Pas
|
a79ae116e2
|
Add ARMV6 to cmake/prebuild.cmake,
based on cpuid_arm.c and param.h
|
2 years ago |
Martin Kroeker
|
280c8469d4
|
Fix detection of Cray CCE after detection changes for gfortran on Cray
CCE15 no longer produces a .ident line, so look for HPE name in a note section as well
|
2 years ago |
Martin Kroeker
|
c68afeb567
|
Merge pull request #3956 from martin-frbg/issue3955
Update Reference-LAPACK README to 3.11.0
|
2 years ago |
Martin Kroeker
|
c2b01c6f81
|
Update to 3.11.0
|
2 years ago |
Martin Kroeker
|
02920be53a
|
Merge pull request #3953 from martin-frbg/issue3801
Do not use multithreading for small SYMV/SYR2K workloads
|
2 years ago |
Martin Kroeker
|
244147495a
|
Do not use multithreading for small workloads
|
2 years ago |
Martin Kroeker
|
2885e4d898
|
Merge pull request #3951 from martin-frbg/issue3940
Do not change the global OMP_NUM_THREADS
|
2 years ago |
Martin Kroeker
|
e09e494f25
|
Merge pull request #3950 from martin-frbg/issue3856-syr
Re-enable overriding the LAPACK SYMV,SYR,SPMV and SPR implementations
|
2 years ago |
Martin Kroeker
|
dac2d7aef0
|
Merge pull request #3952 from martin-frbg/cleanup3936
Avoid unused variable warning in non-OpenMP builds
|
2 years ago |
Martin Kroeker
|
743ae5ab18
|
move declaration of openmp_nthreads under the related ifdef
|
2 years ago |
Martin Kroeker
|
579bc86671
|
remove call to omp_set_num_threads
|
2 years ago |
Martin Kroeker
|
ab32f832a8
|
fix stray blank on continuation line
|
2 years ago |
Martin Kroeker
|
e359787e28
|
restore C/Z SPMV, SPR, SYR,SYMV
|
2 years ago |
Martin Kroeker
|
6ca6eb7613
|
Re-enable overriding the LAPACK SYMV,SYR,SPMV and SPR implementations
|
2 years ago |
Martin Kroeker
|
2b77f28810
|
Merge pull request #3949 from martin-frbg/lapack750
Improve inline documentation of ?GEJSV (Reference-LAPACK PR 750)
|
2 years ago |
Martin Kroeker
|
164eafd61d
|
Improve inline documentation of ?GEJSV (Reference-LAPACK PR 750)
|
2 years ago |
Martin Kroeker
|
871b730dc5
|
Create SECURITY.md
|
2 years ago |
Martin Kroeker
|
7719dbecde
|
Merge pull request #3946 from martin-frbg/lapack682
Rewrite ?LAQR5 and S/DHGEQZ , add tests for TRECV3 (Reference-LAPACK PR 682)
|
2 years ago |
Martin Kroeker
|
147e2fbf87
|
Add numerical tests for TRECV3 (Reference-LAPACK 682)
|
2 years ago |
Martin Kroeker
|
2a83ec1f79
|
Rewrite to use FMA with Householder reflectors
|
2 years ago |
Martin Kroeker
|
e1c3c34178
|
Standardize coding style (Reference-LAPACK 682)
|
2 years ago |
Martin Kroeker
|
f7b9391119
|
Merge pull request #3945 from ahojnnes/user/joschonb/fix-prebuild-path
Wrap variables in cmake/prebuild in quotes to handle spaces in paths
|
2 years ago |
Johannes Schönberger
|
fbfe474583
|
f
|
2 years ago |
Johannes Schönberger
|
ec4700087e
|
Wrap variables in cmake/prebuild in quotes to handle spaces in paths
|
2 years ago |
Martin Kroeker
|
e8db1fe89b
|
Merge pull request #3943 from martin-frbg/llvm15
Add clang option to avoid running out of registers in AVX512 assembly
|
2 years ago |
Martin Kroeker
|
de937b3194
|
Add clang option to avoid running out of registers in AVX512 assembly
|
2 years ago |
Martin Kroeker
|
f3d21039ce
|
Improve fix from PR3924 (#3941)
* compare denominator against DBL_MIN rather than a somewhat arbitrary small number near it
|
2 years ago |
Martin Kroeker
|
8d6813ec41
|
Merge pull request #3938 from martin-frbg/issue3937
Fix CMAKE-based cross-compilation to CortexA53 (wrong DGEMM setting)
|
2 years ago |
Martin Kroeker
|
19d6795122
|
Merge pull request #3936 from martin-frbg/issue3933
Observe any stricter (OpenMP) thread count limit imposed by openblas_set_num_threads()
|
2 years ago |
Martin Kroeker
|
50c263716e
|
Correct DGEMM_UNROLL_M value for A53 in cross-compile
|
2 years ago |
Martin Kroeker
|
e298d613fa
|
initialize status variable for openblas_set_num_threads
|
2 years ago |
Martin Kroeker
|
05aa88268f
|
add status variable for openblas_set_num_threads
|
2 years ago |
Martin Kroeker
|
9f9d0012a3
|
observe thread limit imposed by openblas_set_num_threads()
|
2 years ago |
Martin Kroeker
|
fe5d3ca8e0
|
Merge pull request #3935 from martin-frbg/omp_place_num
Fix OpenMP thread counting returning places rather than cores
|
2 years ago |
Martin Kroeker
|
e38ab079a0
|
Fix OpenMP thread counting returning places rather than cores
|
2 years ago |
Martin Kroeker
|
9feaaa3f39
|
Merge pull request #3932 from martin-frbg/issue3931
Handle unrecognized ASM compiler (from Arm Compiler 22.1) in CMAKE builds
|
2 years ago |
Martin Kroeker
|
8272dfc552
|
Handle unrecognized ASM compiler (from Arm Compiler 22.1)
|
2 years ago |
Martin Kroeker
|
f616c86404
|
Merge pull request #3930 from sergei-lewis/dot-kernel-early-bail
dot.c early bail fix
|
2 years ago |
Sergei Lewis
|
cb0a70e0e2
|
dot.c early bail fix
|
2 years ago |
Martin Kroeker
|
5925178d03
|
Merge pull request #3924 from martin-frbg/numpy22025
Avoid overflow from division in GETF2 potentially causing NaN
|
2 years ago |