Martin Kroeker
668f48f4fc
Use CMAKE_C_COMPILER_VERSION instead of dumpversion calls ( #4698 )
* Use CMAKE_C_COMPILER_VERSION throughout
1 year ago
Martin Kroeker
39c96063fb
Merge pull request #4694 from martin-frbg/issue3660
Add a minimum problem size for multithreading in GBMV
1 year ago
Martin Kroeker
f5c080f083
Fix CMAKE syntax in kernel file parsing of IFNEQ conditionals ( #4695 )
* Fix syntax in parsing of IFNEQ
1 year ago
Martin Kroeker
9a2a6a2e52
Merge pull request #4696 from frjohnst/restore_second
Revert PRs 4515 and 4520 (restore second, dsecnd)
1 year ago
frjohnst
87026ac1b1
Revert "fix conlict between PR 4515 and AIX shared obj support"
This reverts commit bdaa6705ca
.
It turns out that PRs 4515 and 4520 break the tests under
lapack-netlib/TESTING which require SECOND and DSECND. IBM
has decided this is a bigger biger problem than the conflict
between lapack second_ and the xlf run time.
1 year ago
frjohnst
56d3d1039c
Revert "resolve second_ conflict which breaks xlf timef"
This reverts commit 9b24b31419
.
It turns out that PRs 4515 and 4520 break the tests under
lapack-netlib/TESTING which require SECOND and DSECND. IBM
has decided this is a bigger biger problem than the conflict
between lapack second_ and the xlf run time.
1 year ago
Martin Kroeker
2957281275
Introduce a lower limit for multithreading
1 year ago
Martin Kroeker
5fd871d7ea
Introduce a lower limit for multithreading
1 year ago
Martin Kroeker
6ca9ffa7f5
Merge pull request #4655 from yamazakimitsufumi/update_2d_thread_distribution
Expanding the scope of 2D thread distribution to improve multi-threaded DGEMM performance
1 year ago
Martin Kroeker
b45a78c6e9
fix zdotu argument passing in utest_ext on windows ( #4691 )
* fix passing of results on windows
1 year ago
Martin Kroeker
1ab9f50561
Merge pull request #4690 from mattip/blasint
use blasint instead of int to quiet warnings
1 year ago
Matti Picus
243640c354
use blasint instead of int to quiet warnings
1 year ago
Martin Kroeker
f0560f906f
Merge pull request #4689 from martin-frbg/issue4684
Fix compilation of the BLAS extension utests for NO_CBLAS=1
1 year ago
Martin Kroeker
e1e0d9a2ae
Merge pull request #4688 from XiWeiGu/loongarch64_fixed_gcc14_compilation
loongarch64: Fixed GCC14 compilation issue
1 year ago
Martin Kroeker
d8baf2f2ea
Support compilation without CBLAS
1 year ago
Martin Kroeker
a6c184d150
forward NO_CFLAGS to the CFLAGS, if set
1 year ago
gxw
ecf8b588a9
loongarch64: Fixed GCC14 compilation issue
1 year ago
Martin Kroeker
8da6f7e5f2
Merge pull request #4686 from XiWeiGu/loongarch64_dgemm_kernel_16x6
Loongarch64: Improving the Performance and Stability of dgemm
1 year ago
gxw
f9a26240a7
loongarch64: Fixed icamax_lsx
1 year ago
gxw
cb0f707409
loongarch64: Fixed utest fork:safety
1 year ago
gxw
637c650f4f
loongarch64: Add buffer offset for target LOONGSON3R5
1 year ago
Martin Kroeker
5d678f1831
Merge pull request #4685 from martin-frbg/issue4660-2
Fix builds for LOONGARCH64 in LSX mode
1 year ago
Martin Kroeker
b45d8e1ab2
remove stray comma
1 year ago
Martin Kroeker
5500b4ab26
Merge pull request #4680 from theAeon/develop
Expose whether locking is enabled in get_config
1 year ago
gxw
6017ad7146
loongarch64: Update dgemm_kernel_16x4 to dgemm_kernel_16x6
1 year ago
Martin Kroeker
d66aa63478
Merge pull request #4681 from martin-frbg/fix4662-2
fix HUGETLB allocation for TLS mode as well
1 year ago
Martin Kroeker
f0f1ff7820
fix HUGETLB allocation for TLS mode as well
1 year ago
Andrew Robbins
edfe1aa471
Expose whether locking is enabled in get_config
1 year ago
Martin Kroeker
edeb5259a1
Merge pull request #4679 from martin-frbg/fix4662
Restore Loongson LA64ARCH handling
1 year ago
Martin Kroeker
4376b6f7d2
Restore Loongson LA64ARCH handling
1 year ago
Martin Kroeker
8735b54fa8
Merge pull request #4662 from martin-frbg/hugetlb-doc
Fix and document the two HUGETLB options for buffer allocation in Makefile.rule
1 year ago
Martin Kroeker
fc10673fd3
Merge branch 'develop' into hugetlb-doc
1 year ago
Martin Kroeker
c20189cc82
Merge pull request #4677 from martin-frbg/issue4676
Add autodetection of Intel Meteor Lake and Emerald Rapids
1 year ago
Martin Kroeker
bbd227ce4a
Add Intel Meteor Lake and Emerald Rapids
1 year ago
Martin Kroeker
f034745ce6
Merge pull request #4675 from martin-frbg/issue4619
Mention LD_LIBRARY_PATH in user documentation
1 year ago
Martin Kroeker
a82ecadc11
mention LD_LIBRARY_PATH
1 year ago
Martin Kroeker
b859f6f191
Merge pull request #4617 from cyk2018/patch-1
[Doc]Update user_manual.md for static linker
1 year ago
Martin Kroeker
dc99b61380
sort unwanted interdependencies of alloc_shm and alloc_hugetlb
1 year ago
Martin Kroeker
9c4e10fbd1
sort hugetlb and shm alloc options
1 year ago
Martin Kroeker
a63d71129c
Merge pull request #4671 from martin-frbg/issue4668
Silence a GCC14 warning/error in the f2c-converted LAPACK
1 year ago
Martin Kroeker
3d26837a35
Suppress GCC14 error exit in the f2c-converted LAPACK
1 year ago
Martin Kroeker
7c915e64ca
Silence a GCC14 warning/error in the f2c-converted LAPACK
1 year ago
Martin Kroeker
edacf9b397
Work around spurious BLAS3 test errors on LOONGSON3R3/4 ( #4667 )
Force compilation with gfortran to use O0 on older Loongson hardware to avoid spurious test failures
1 year ago
Martin Kroeker
89e3fd0821
Merge pull request #4666 from martin-frbg/issue4633
Fix spurious errors in the extended utest for INTERFACE64=1 on big-endian systems
1 year ago
Martin Kroeker
b1d722fc0c
Fix cast to work with INTERFACE64 (especially on big-endian)
1 year ago
Martin Kroeker
1031d161f6
Merge pull request #4663 from ayappanec/develop
Fix openblas_utest_ext build in AIX
1 year ago
Ayappan P
f4ee0a423b
Fix openblas_utest_ext build in AIX
1 year ago
Martin Kroeker
faf7b3d1bb
Document the two HUGETLB options for buffer allocation
1 year ago
Martin Kroeker
ab5882ebf0
Merge pull request #4661 from martin-frbg/issue4660
Fix CMAKE builds for Loongarch64
1 year ago
Martin Kroeker
69aa93e34f
Fix Loongson compiler flag check
1 year ago