Martin Kroeker
d15ffb7fdf
Allow negative INCX (API change from version 3.10 of the reference implementation)
2 years ago
Martin Kroeker
a2d867f4d1
Allow negative iNCX (API change from version 3.10 of the reference implementation)
2 years ago
Martin Kroeker
9a0e9c8b69
Merge pull request #4171 from boomanaiden154/clang-libomp-fixes
Fix build with some clang installations when openmp is enabled
2 years ago
Martin Kroeker
7af0f41762
Merge pull request #4189 from martin-frbg/issue4186
Prepare the interface for INCX < 0 in the new NRM2 implementation from BLAS 3.10
2 years ago
Martin Kroeker
4cc804c754
Prepare for INCX < 0 in new NRM2 implementation from BLAS 3.10
2 years ago
Martin Kroeker
afdc56a421
Merge pull request #4158 from XiWeiGu/loongarch64_update_dgemm_kernel
LoongArch64: Update dgemm kernel
2 years ago
Martin Kroeker
91e5513f3b
Merge pull request #4184 from XiWeiGu/dgemv
LoongArch64: Add dgemv_t_8_lasx.S and dgemv_n_8_lasx.S V2
2 years ago
gxw
e8b571d245
LoongArch64: Add dgemv_t_8_lasx.S and dgemv_n_8_lasx.S V2
2 years ago
gxw
71fcee6eef
LoongArch64: Update dgemm kernel
2 years ago
Martin Kroeker
0f521ece25
Merge pull request #4183 from martin-frbg/issue4181
Apply USE_TRMM to MIPS64_GENERIC as to GENERIC in gmake builds
2 years ago
Martin Kroeker
232420bdf5
Merge pull request #4182 from xianyi/revert-4153-dgemv
Revert "LoongArch64: Add dgemv_t_8_lasx.S and dgemv_n_8_lasx.S"
2 years ago
Martin Kroeker
41c31bc1d4
Revert "LoongArch64: Add dgemv_t_8_lasx.S and dgemv_n_8_lasx.S"
2 years ago
Martin Kroeker
61d803547a
Apply USE_TRMM to MIPS64_GENERIC as to GENERIC
2 years ago
Martin Kroeker
f8ee309402
Merge pull request #4153 from XiWeiGu/dgemv
LoongArch64: Add dgemv_t_8_lasx.S and dgemv_n_8_lasx.S
2 years ago
Martin Kroeker
12e98482e9
Merge pull request #4179 from martin-frbg/jenkinsfix
Run "make clean" on Jenkins first to remove stale objects
2 years ago
Martin Kroeker
51c218d17a
Update Jenkinsfile
2 years ago
Martin Kroeker
df978c90cd
Update Jenkinsfile.pwr
2 years ago
Martin Kroeker
ef4a7e3fca
Merge pull request #4127 from XiWeiGu/LoongArch64-CI
LoongArch64 CI
2 years ago
Martin Kroeker
b63e4581a3
Merge pull request #4016 from mmuetzel/ci-msys2
Add support for LLVM Flang
2 years ago
Markus Mützel
53378296c8
CI: Build with NO_AVX512 for the runners that use Flang 16.
2 years ago
Markus Mützel
1c3fcaaf42
CI (MSYS2): Re-run failed tests verbosely.
2 years ago
Markus Mützel
f334bd9041
CI (MSYS2): Use LLVM Flang on CLANG64 runners. Add CLANG32 runner.
2 years ago
Markus Mützel
57256623f4
fc.cmake: Add support for LLVM Flang.
2 years ago
gxw
ec1e96aac8
LoongArch64: Add dgemv_t_8_lasx.S and dgemv_n_8_lasx.S
2 years ago
gxw
96bf226bca
gh-actions: Add loongarch64 CI
2 years ago
gxw
db9a42f8c3
LoongArch64: using getauxval to do runtime check
Using the getauxval instruction can prevent errors
caused by hardware supporting vector instructions
while the kernel does not support them
2 years ago
gxw
d46772e037
LoongArch64: Add compiler feature checks
2 years ago
Martin Kroeker
8a171350db
Merge pull request #4178 from martin-frbg/llvm17
Add (gmake) support for LLVM17's new flang
2 years ago
Martin Kroeker
ef23240ab8
Merge pull request #4177 from martin-frbg/issue4176
Fix ZAXPY calls with INCX=0 on pre-AVX x86_64 and add utest
2 years ago
Martin Kroeker
e8bc8a0ee7
Add support for the new generation flang that comes with LLVM17
2 years ago
Martin Kroeker
f2c9ae9c33
Identify the new generation of flang that comes with LLVM17
2 years ago
Martin Kroeker
862d06ab8a
Add INCX=0,INCY=1 test case for CAXPY
2 years ago
Martin Kroeker
d64fa286f7
add test case for zaxpy with incx=0 incy=1
2 years ago
Martin Kroeker
4664b57e6e
use shortcut only when both incx and incy are zero
2 years ago
Martin Kroeker
c2f4bdbbb4
Merge pull request #4163 from martin-frbg/issue4017
Rework OpenMP thread count limit handling
2 years ago
Martin Kroeker
09131f79a6
Merge pull request #4164 from martin-frbg/issue4162
Enable use of AVX512 microkernels with NVIDIA HPC from version 22.3
2 years ago
Martin Kroeker
6a428b5629
Update casum_microk_skylakex-2.c
2 years ago
Martin Kroeker
ebb447e32e
Update zasum_microk_skylakex-2.c
2 years ago
Martin Kroeker
9f6847583a
nvc currently miscompiles this, hopefully fixed in release 23.09
2 years ago
Martin Kroeker
fe54ee3d15
nvc currently miscompiles this, hopefully fixed in release 23.09
2 years ago
Aiden Grossman
b209915121
Fix build with clang
There are two instances when building the tests where OpenBLAS fails to
build with OpenMP and clang due to library paths getting reset as flags
are set rather than appended. This seems to only affect certain
clang/libomp installations, but if it's already grabbing the correct
library paths we might as well use them.
2 years ago
Felix Yan
f5506b002c
Add 64-bit flag on INTERFACE64 only
2 years ago
Felix Yan
4ed6414c17
Fix 64-bit fortran options for riscv64
64-bit builds are currently broken without this flag.
Makefiles have done this already: 5720fa02c5/Makefile.system (L831)
2 years ago
Felix Yan
007cd834c1
Use defined variable for riscv64 in arch.cmake
It's defined in #4137
2 years ago
Martin Kroeker
5720fa02c5
Merge pull request #4168 from Mousius/sve-zgemm-cgemm
Use SVE zgemm/cgemm on Arm(R) Neoverse(TM) V1 core
2 years ago
Martin Kroeker
b3a5144a74
Merge pull request #4167 from Mousius/sve-zhemm-fix
Fix ZHEMM copy for SVE
2 years ago
Chris Sidebottom
84a268b6ca
Use SVE zgemm/cgemm on Arm(R) Neoverse(TM) V1 core
This patch removes the prefetches from cgemm/zgemm which improves the performance similar to sgemm/dgemm did in #3868 , this means I'm happy to enable this on any applicable cores.
I also replicated the unrolling the copies from sgemm and dgemm.
2 years ago
Chris Sidebottom
730ca04b48
Fix ZHEMM copy for SVE
Whilst disambiguating whilelt, I inadvertantly used the wrong datatype
for offsets, which can be negative. This rectifies that.
2 years ago
Martin Kroeker
9ba9c8bdc0
Merge pull request #4165 from rgommers/docs-packaging-and-ilp64
Add documentation on redistributing OpenBLAS
2 years ago
Ralf Gommers
ee72575475
Add documentation on redistributing OpenBLAS
This touches on the following:
- build configurations
- naming of symbols, shared/static libraries and other build outputs
like pkg-config and CMake files
- (in more detail) guidance on ILP64 builds
It tries to explain that, while this is only guidance and there may be
reasons to deviate from that, for some build options there are best
practices, and for some others there are choices to make.
It also links to a number of well-maintained build recipes in order
to help packagers of other distros make choices.
Closes gh-3798
[skip ci]
2 years ago