Martin Kroeker
|
b89fb708ca
|
Update version to 0.3.21
|
3 years ago |
Martin Kroeker
|
9a34217cc6
|
Merge pull request #3717 from xianyi/develop
Update from develop for 0.3.21 release
|
3 years ago |
Martin Kroeker
|
79f54f266d
|
Update version to 0.3.21
|
3 years ago |
Martin Kroeker
|
94cba8e3c5
|
Merge pull request #3716 from martin-frbg/0321changes
Update Changelog for 0.3.21
|
3 years ago |
Martin Kroeker
|
25ce2e2a63
|
Update with 0.3.21 changes
|
3 years ago |
Martin Kroeker
|
9f89b62b25
|
Merge pull request #3715 from martin-frbg/issue3648
Increase thresholds for STFSM and CTFSM in the LAPACK testsuite
|
3 years ago |
Martin Kroeker
|
9c4e91a77d
|
Increase threshold
|
3 years ago |
Martin Kroeker
|
1fe06caf49
|
Increase threshold
|
3 years ago |
Martin Kroeker
|
ff58e9a7f1
|
Merge pull request #3609 from martin-frbg/lapack3101
Update LAPACK/LAPACKE to Reference-LAPACK 3.10.1
|
3 years ago |
Martin Kroeker
|
f6a1854ce9
|
resync gensymbol with develop
|
3 years ago |
Martin Kroeker
|
2bee490287
|
Merge pull request #3714 from martin-frbg/crosscmake
Add more x86_64 target definitions for CMAKE cross-compiling
|
3 years ago |
Martin Kroeker
|
85fd3c4279
|
Support compilation with the Cray C and Fortran compilers (#3712)
* Add support for the Cray Fortran compiler
|
3 years ago |
Martin Kroeker
|
3784b3d45c
|
Add more x86_64 target definitions for cross-compiling
|
3 years ago |
Martin Kroeker
|
096ae6f2bd
|
Merge pull request #3709 from nursik/develop
Add TCORE Generic
|
3 years ago |
Martin Kroeker
|
19fefd100e
|
Merge pull request #3703 from martin-frbg/omp_adaptive
Add env variable OMP_ADAPTIVE to control OMP threadpool behaviour
|
3 years ago |
Martin Kroeker
|
2e51a61914
|
Merge pull request #3693 from Mayank-Raj3/Mayank-Raj3-patch-1
corrected indentation of for and if statement dgemv_thread_safety.cpp
|
3 years ago |
Nursultan Zarlyk
|
a7ac252fd9
|
Add TCORE Generic in prebuild.cmake
During the cross-compilation on x64 host with MSVC for ARMv8, the
build fails as there is no define directives for Generic core.
|
3 years ago |
Martin Kroeker
|
648a69a67e
|
Merge pull request #3707 from martin-frbg/getarch_risc
Fix crash in RISCV autodetection when pmodel is not present in /proc/cpuinfo
|
3 years ago |
Martin Kroeker
|
ef9c976a94
|
Really fix compilation; fix crash when pmodel is not present in cpuinfo
|
3 years ago |
Martin Kroeker
|
f727235be4
|
Merge pull request #3706 from martin-frbg/czifunding
Acknowledge past CZI EOSS 1/EOSS 3 funding
|
3 years ago |
Martin Kroeker
|
880bc1d1db
|
Acknowledge past CZI EOSS 1/EOSS 3 funding
|
3 years ago |
Martin Kroeker
|
d0ba257de0
|
Merge pull request #3704 from XiWeiGu/loongarch64_dynamic_arch
LoongArch64: Add DYNAMIC_ARCH support
|
3 years ago |
Martin Kroeker
|
78da6a750a
|
Merge pull request #3705 from RajalakshmiSR/bf16ppc
POWER: Enable bfloat16 kernels by default
|
3 years ago |
Rajalakshmi Srinivasaraghavan
|
1d97405c02
|
POWER: Enable bfloat16 kernels by default
This patch enables bfloat16 kernels by default for POWER processors.
Tested on Linux POWER8, POWER9, POWER10 and AIX POWER10 systems.
|
3 years ago |
gxw
|
fbfe1daf6e
|
LoongArch64: Add DYNAMIC_ARCH support
|
3 years ago |
Martin Kroeker
|
80cdfed7b2
|
Use OMP_ADAPTIVE setting to choose between static and dynamic OMP threadpool size
|
3 years ago |
Martin Kroeker
|
08e3754b39
|
Add environment variable OMP_ADAPTIVE
|
3 years ago |
Martin Kroeker
|
047a279f09
|
Merge pull request #3702 from martin-frbg/issue3687
Add openblas_getaffinity() extension (Linux-only)
|
3 years ago |
Martin Kroeker
|
30473b6a9d
|
add openblas_getaffinity()
|
3 years ago |
Martin Kroeker
|
8668571040
|
add openblas_getaffinity()
|
3 years ago |
Martin Kroeker
|
daca01622b
|
fix detection of Neoverse V1 and user-enforced selection of N2 in ARM64 DYNAMIC_ARCH (#3700)
* fix detection of Neoverse V1 and user-enforced selection of N2
|
3 years ago |
Martin Kroeker
|
c322aab685
|
Merge pull request #3684 from imzhuhl/neoversen2_dynamic_arch
Neoverse N2: DYNAMIC_ARCH
|
3 years ago |
Martin Kroeker
|
cf796aee8c
|
Merge pull request #3699 from martin-frbg/issue3692
Add c_check recognition of Fujitsu fcc for Fugaku A64FX
|
3 years ago |
Martin Kroeker
|
28d40ba60b
|
Merge pull request #3696 from XiWeiGu/loongson2k1000
LoongArch64: Add core LOONGSON2K1000 and LOONGSONGENERIC
|
3 years ago |
Martin Kroeker
|
692848d20c
|
typo fix
|
3 years ago |
Martin Kroeker
|
76ea7739dd
|
Merge pull request #3698 from martin-frbg/issue3697
utest needs to be linked against libm on QNX as well
|
3 years ago |
Martin Kroeker
|
f8c5bdfbab
|
Treat Fujitsu fcc on Fugaku like clang
|
3 years ago |
Martin Kroeker
|
70001e1e9e
|
Add Fujitsu compiler
|
3 years ago |
Martin Kroeker
|
cf37182260
|
Add Fujitsu compiler (fcc)
|
3 years ago |
Martin Kroeker
|
68d86ea150
|
Add Fujitsu compiler
|
3 years ago |
Martin Kroeker
|
7aaa0ce0e8
|
utest needs to be linked against libm on QNX as well
|
3 years ago |
Martin Kroeker
|
cd8e57040c
|
Merge pull request #3691 from martin-frbg/issue3679-sparc
SPARC: fix DNRM2 returning INF instead of zero due to intermediate overflow
|
3 years ago |
gxw
|
3573306a69
|
LoongArch64: Add core LOONGSON2K1000 and LOONGSONGENERIC
|
3 years ago |
Martin Kroeker
|
a4303ae378
|
Merge pull request #3695 from martin-frbg/ppc6nrm2
PPC6: Fix DNRM2 returning INF instead of zero due to intermediate overflow
|
3 years ago |
Martin Kroeker
|
31377d04f0
|
Merge pull request #3694 from martin-frbg/traviswait
Add back travis_wait to keep ppc jobs from getting cancelled
|
3 years ago |
Martin Kroeker
|
6c118b7977
|
Fix DNRM2 returning INF instead of zero due to intermediate overflow
|
3 years ago |
Martin Kroeker
|
b60415a347
|
Add back travis_wait to keep ppc jobs from getting cancelled
|
3 years ago |
Mayank Raj
|
a9939111d7
|
Update dgemv_thread_safety.cpp
|
3 years ago |
Martin Kroeker
|
c43ec53bdd
|
Merge pull request #3690 from RajalakshmiSR/cdotp10
POWER: Fix complex dot function failures
|
3 years ago |
Martin Kroeker
|
b7c65d08cb
|
Merge pull request #3689 from RajalakshmiSR/dgemvgcc10
POWER10: dgemv builtin rename
|
3 years ago |