gxw
6954845d8d
utest: Add utest for {c/z}scal and {c/z}gemv
8 months ago
gxw
e114880dc4
kernel/generic: Fixed cscal and zscal
8 months ago
Martin Kroeker
7c3a920a81
CI: Update ubuntu-latest runners to fix side effects of switch to 24.04 ( #5079 )
8 months ago
Martin Kroeker
a7483d181b
Merge pull request #5074 from tingboliao/develop
Optimize the gemm_tcopy_8_rvv to be compatible with the vlens 128 and 256.
8 months ago
tingbo.liao
ef7f54b357
Optimized the gemm_tcopy_8_rvv to be compatible with the vlens 128 and 256.
Signed-off-by: tingbo.liao <tingbo.liao@starfivetech.com>
8 months ago
Martin Kroeker
eba7338484
Merge pull request #5073 from XiWeiGu/la64_update_symv_lsx_version
LoongArch64: Update symv lsx version
8 months ago
gxw
e0a8216554
LoongArch64: Update dsymv LSX version
8 months ago
gxw
a9070ba3f9
LoongArch64: Update ssymv LSX version
8 months ago
Martin Kroeker
9b981035db
Merge pull request #5070 from xry111/xry111/lasx-la664
LoongArch64: Fix dsymv and ssymv LASX version
8 months ago
Martin Kroeker
fee353e63d
Merge pull request #5072 from martin-frbg/azureosx13
Azure CI: update deprecated macos-12 jobs to macos-13 image
8 months ago
Martin Kroeker
0c0112dfef
update deprecated macos-12 jobs to macos-13 image
8 months ago
Xi Ruoyao
af10c132b8
LoongArch64: Fix dsymv and ssymv LASX version
"fmov.d $f2, $f4" leaves all the bits higher than the 63-th bit
unpredictable but it's obvious that the following code uses the value of
those high bits. We actually want to replicate the lower 64 bits here,
so we should use xvreplve0.d instead.
LA464 (Loongson 3[A-Z]-5000) happens to replicate them for us due to
some uarch internal details so the issue was not detected, but for LA664
(Loongson 3[A-Z]-6000) and future uarch we need to do things correctly
or we end up getting a lot of test failures.
Closes: https://bbs.aosc.io/t/topic/302
Signed-off-by: Xi Ruoyao <xry111@xry111.site>
8 months ago
Martin Kroeker
4e817f804c
Update version to 0.3.29.dev
8 months ago
Martin Kroeker
8a316e68a5
Update version to 0.3.29.dev
8 months ago
Martin Kroeker
07756abb3e
Merge pull request #5067 from OpenMathLib/release-0.3.0
merge release 0.3.29 back into develop to copy tag
8 months ago
Martin Kroeker
8795fc7985
set version to 0.3.29
8 months ago
Martin Kroeker
e0c134e1f6
set version to 0.3.29
8 months ago
Martin Kroeker
9207052d85
Merge pull request #5066 from OpenMathLib/develop
Merge changes from develop in preparation of the 0.3.29 release
8 months ago
Martin Kroeker
7f5b703a80
Merge pull request #5065 from martin-frbg/changelog0329
Update the Changelog for version 0.3.29
8 months ago
Martin Kroeker
20f6114e98
add descriptions of build/runtime vars to 0.3.29 improvements
8 months ago
Martin Kroeker
f422845b6d
Merge pull request #5064 from martin-frbg/lapack1080
Replace LAPACK ?LARFT with a recursive implementation (Reference-LAPACK PR 1080)
8 months ago
Martin Kroeker
ce66ffe7bb
Update the Changelog for version 0.3.29
8 months ago
Martin Kroeker
d035e80d33
move the original non-recursive ?LARFT here (Reference-LAPACK PR 1080)
8 months ago
Martin Kroeker
459fa8102b
Create subdirectory for the old non-recursive ?larft
8 months ago
Martin Kroeker
0c4b4cd78c
move the non-recursive original ?larft here (Reference-LAPACK PR 1080)
8 months ago
Martin Kroeker
ed516994d6
replace ?larft with a recursive implementation (Reference-LAPACK PR 1080)
8 months ago
Martin Kroeker
5527eda561
Merge pull request #5063 from martin-frbg/lapack1062
Remove comparison that is always false (Reference-LAPACK PR 1062)
8 months ago
Martin Kroeker
4c1a23673a
Remove comparison that is always false (Reference-LAPACK PR 1062)
8 months ago
Martin Kroeker
d74eb02954
Merge pull request #5057 from martin-frbg/issue5050
Replace while loop in generic C/ZGEMM_BETA to avoid going out of bounds
8 months ago
Martin Kroeker
30f7a4120b
Merge pull request #5056 from tingboliao/dev_omatcopy_20250108
Optimize the omatcopy_cn/zomatcopy_cn kernels with RVV 1.0 intrinsic.
8 months ago
Martin Kroeker
0b9de3ef7d
Merge pull request #5042 from tingboliao/develop
Add the test cases of rot to improve the unit tests for rot_rvv.
8 months ago
Martin Kroeker
c31f148c76
Merge pull request #5061 from XiWeiGu/la64_update_symv
LoongArch64: Update symv
8 months ago
gxw
20a8e48f25
LoongArch64: Update ssymv LASX version
8 months ago
gxw
e0748588b8
LoongArch64: Update dsymv LASX version
8 months ago
Martin Kroeker
d91d4fa6e9
convert the beta=0 branch to a for loop as well
8 months ago
Martin Kroeker
8cc32f5461
Merge branch 'OpenMathLib:develop' into issue5050
8 months ago
Martin Kroeker
a588ea90fd
Merge pull request #5052 from martin-frbg/issue5051
Do not check LDVT in LAPACKE_?gesvd_work if VT is not used anyway
8 months ago
Martin Kroeker
21013fdb66
Merge pull request #5058 from martin-frbg/cirrusbsd3
Cirrus CI: Update FreeBSD jobs to 14.1
8 months ago
Martin Kroeker
b67a963412
gcc remains at 13 even for freebsd-14.1
8 months ago
Martin Kroeker
a9eec233d3
Update FreeBSD jobs to 14.1
8 months ago
Martin Kroeker
09e75f1588
fix absurd typo
8 months ago
Martin Kroeker
2891fd8d6d
Replace while loop with for
8 months ago
tingbo.liao
0a5dbf13d3
Optimize the omatcopy_cn and zomatcopy_cn kernels with RVV 1.0 intrinsic.
Signed-off-by: tingbo.liao <tingbo.liao@starfivetech.com>
8 months ago
Martin Kroeker
69d3541488
Merge pull request #5055 from fleclairTT/develop
remove fastmath from x280 target
8 months ago
Felix LeClair
d0350a08b7
Merge pull request #1 from fleclairTT/fleclairTT-remove-fast-math
Update Makefile.riscv64
8 months ago
Felix LeClair
05dce05c24
Update Makefile.riscv64
remove fast-math
8 months ago
Martin Kroeker
8385e02ae1
Do not check LDVT when VT is not going to be referenced
8 months ago
Martin Kroeker
74be676936
Merge pull request #5049 from rgommers/docs-buildsystem-page
Rewrite the "Build system" documentation page
9 months ago
Martin Kroeker
91d861a0d4
Merge pull request #5048 from barracuda156/powerpc
PPC970: fix scal
9 months ago
Ralf Gommers
f697cfe0d0
docs: improve the rendering of the HarmonyOS build instructions
9 months ago