Chip Kerchner
|
c23897f585
|
Add GEMV testing to SBGEMx vs SGEMx testing.
|
1 year ago |
Martin Kroeker
|
6452f7b46d
|
Merge pull request #4873 from ChipKerchner/fixSBGEMMDefaults
[POWER] Problem with multi-threaded SBGEMM
|
1 year ago |
Martin Kroeker
|
ca7777de18
|
Merge pull request #4870 from chenx97/fix-recursive-make-var
Fix recursive variable expansion in Makefiles for LOONGSON3A
|
1 year ago |
Chip Kerchner
|
31226740d6
|
Cleanup of SBGEMM unit test.
|
1 year ago |
Henry Chen
|
20bdb65882
|
Fix recursive variable expansion in Makefiles for LOONGSON3A
|
1 year ago |
Chip Kerchner
|
b1737698db
|
Fix DEFAULTS in SBGEMM for POWER10. Also comparisons for SBGEMM unit test can be exactly due to epilison differences.
|
1 year ago |
Martin Kroeker
|
e5525036e7
|
Merge pull request #4865 from martin-frbg/issue4856
Tweak LAPACK STFSM test threshold a little more to cover POWER10 fma
|
1 year ago |
Martin Kroeker
|
fd52d09490
|
Merge pull request #4864 from martin-frbg/issue4862
Spell out function prototypes in the SYRK calls of potrf_parallel
|
1 year ago |
Martin Kroeker
|
35dd625adf
|
Merge pull request #4859 from martin-frbg/cooper_sb
Address clang array overflow warning in the SBGEMV microkernel for Cooper Lake
|
1 year ago |
Martin Kroeker
|
d8f740791a
|
tweak threshold a little more to cover POWER10 fma
|
1 year ago |
Martin Kroeker
|
73e13b0273
|
flesh out HERK prototype
|
1 year ago |
Martin Kroeker
|
824306baab
|
flesh out HERK prototype
|
1 year ago |
Martin Kroeker
|
7ca835a82c
|
address clang array overflow warning
|
1 year ago |
Martin Kroeker
|
a87c4d26dd
|
Merge pull request #4857 from nekopsykose/ppc
fix cmake typo for power10 cc version check
|
1 year ago |
psykose
|
1265eee85c
|
fix cmake typo for power10 cc version check
fixes 668f48f4fc
|
1 year ago |
Martin Kroeker
|
cd3945b998
|
Update version to 0.3.28.dev
|
1 year ago |
Martin Kroeker
|
cbd321aecb
|
Update versin to 0.3.28.dev
|
1 year ago |
Martin Kroeker
|
cb38d666da
|
Merge pull request #4855 from OpenMathLib/release-0.3.0
Merge release branch back into develop to copy tag
|
1 year ago |
Martin Kroeker
|
5ef8b19646
|
Merge pull request #4854 from OpenMathLib/develop
merge develop in preparation of the 0.3.28 release
|
1 year ago |
Martin Kroeker
|
884a949a0d
|
Merge branch 'release-0.3.0' into develop
|
1 year ago |
Martin Kroeker
|
116bc767d8
|
Update version to 0.3.28
|
1 year ago |
Martin Kroeker
|
91d6722a3d
|
Update version to 0.3.28
|
1 year ago |
Martin Kroeker
|
2c8e001efe
|
Merge pull request #4853 from martin-frbg/changelog0328
Update Changelog.txt for 0.3.28
|
1 year ago |
Martin Kroeker
|
1c2bfea1bb
|
Merge pull request #4852 from martin-frbg/fix4814
Disable forwarding from SBGEMM to SBGEMV for now
|
1 year ago |
Martin Kroeker
|
1df95bb23a
|
Update Changelog.txt for 0.3.28
|
1 year ago |
Martin Kroeker
|
7878976236
|
disable forwarding from SBGEMM to SBGEMV for now
|
1 year ago |
Martin Kroeker
|
d92cc96978
|
Merge pull request #4851 from martin-frbg/test3m
Fix invocation of GEMM3M tests in gmake builds
|
1 year ago |
Martin Kroeker
|
76db713e79
|
fix invocation of GEMM3M tests
|
1 year ago |
Martin Kroeker
|
deae7cf1ec
|
Merge pull request #4850 from martin-frbg/generic_3m
Make the dummy GEMM3M kernel for GENERIC targets forward to regular GEMM for now
|
1 year ago |
Martin Kroeker
|
46e331a917
|
remove the unworkable GEMM3M restriction from GENERIC again
|
1 year ago |
Martin Kroeker
|
ccc23338d7
|
have the dummy GEMM3M kernel at least forward to regular GEMM
|
1 year ago |
Martin Kroeker
|
753c7ebe17
|
Merge pull request #4835 from martin-frbg/revertwin4359
Temporarily revert to the coarse-grained locking in the Windows thread server
|
1 year ago |
Martin Kroeker
|
3b8d7dfdca
|
Merge pull request #4846 from martin-frbg/lapack1025
Make the type used for the "hidden" string length argument configurable (adapted from Reference-LAPACK PR 1025)
|
1 year ago |
Martin Kroeker
|
797ae08dbe
|
Add explanation of LAPACK_STRLEN
|
1 year ago |
Martin Kroeker
|
923b79de47
|
make the type of the hidden arguments configurable via LAPACK_STRLEN (Reference-LAPACK PR 1025)
|
1 year ago |
Martin Kroeker
|
cc36db643e
|
Support new LAPACK build option LAPACK_STRLEN
|
1 year ago |
Martin Kroeker
|
7e8118d94e
|
Support new build option LAPACK_STRLEN
|
1 year ago |
Martin Kroeker
|
5bdd3a05f0
|
Merge pull request #4841 from martin-frbg/lapack1033
Prevent compilers from using FMA that could increase error in ?GEEVX (Reference-LAPACK PR 1033)
|
1 year ago |
Martin Kroeker
|
ae9e0e36c3
|
Merge pull request #4842 from martin-frbg/lapack1030
Fix typos and sytrd boundary workspace (Reference-LAPACK PR 1030)
|
1 year ago |
Martin Kroeker
|
bce48d4a13
|
Fix typos and sytrd boundary workspace (Reference-LAPACK PR 1030)
|
1 year ago |
Martin Kroeker
|
c8b4ceca85
|
prevent compilers from using FMA (Reference-LAPACK PR 1033)
|
1 year ago |
Martin Kroeker
|
14a8a9a43c
|
Merge pull request #4840 from martin-frbg/issue4823
set MACOSX_RPATH to true on Apple
|
1 year ago |
Martin Kroeker
|
a4845fa12d
|
set MACOSX_RPATH to true on Apple
|
1 year ago |
Martin Kroeker
|
19f8a8d61c
|
Merge pull request #4839 from martin-frbg/fix4794
Add proper returns in x86_64 s/dscal kernels
|
1 year ago |
Martin Kroeker
|
cf483d9f64
|
Merge pull request #4836 from martin-frbg/issue4275-3
use TARGET rather than CORE from Makefile.conf_last to fill in pkgconfig
|
1 year ago |
Martin Kroeker
|
50397e017a
|
Merge pull request #4838 from martin-frbg/fix4662-3
fix invalid ifdef syntax in HUGETLB handling
|
1 year ago |
Martin Kroeker
|
ae27b02213
|
Merge pull request #4837 from martin-frbg/dyn_riscv_cmake
Add CMAKE support for RISCV64 DYNAMIC_ARCH
|
1 year ago |
Martin Kroeker
|
f1c9803f9a
|
add proper return statement
|
1 year ago |
Martin Kroeker
|
60abcc3991
|
add proper return statement
|
1 year ago |
Martin Kroeker
|
5257f807a9
|
fix invalid ifdef syntax in HUGETLB handling
|
1 year ago |