Martin Kroeker
05fe49ddaf
Rename local copy functions to avoid name clash with the standard BLAS ones
9 months ago
Martin Kroeker
64c6c79201
Assume no underline suffixes on symbols when compiling with Intel ifx on Windows
9 months ago
Martin Kroeker
5c9417d306
Assume no underline suffixes on symbols when compiling with ifx on Windows
9 months ago
Martin Kroeker
5d81e514e4
Assume no underline suffixes on symbols when compiling with ifx on Windows
9 months ago
Martin Kroeker
d78fbe425c
Assume no underline suffixes on symbols when compiling with ifx on Windows
9 months ago
Martin Kroeker
30188a55d1
Don't assume underlined symbols for ifx; make cpuid.S inclusion conditional
9 months ago
Martin Kroeker
32319a33ac
Add options for Intel oneAPI 2025.0 ifx on Windows
9 months ago
Martin Kroeker
37a4ca7e46
Merge pull request #5025 from martin-frbg/nvidia_arm64
Add target-specific options to enable ARM64 SVE with the NVIDIA compiler
9 months ago
Martin Kroeker
1c4401ebf1
Add target-specific options to enable SVE with the NVIDIA compiler
9 months ago
Martin Kroeker
f2be482d43
Merge pull request #5024 from martin-frbg/issue5001
Improve the wording of the build instructions for Windows on Arm in the docs
9 months ago
Martin Kroeker
70dddacb9f
Merge pull request #5023 from rgommers/fix-warnings
Fix two compiler warnings in `memory.c`
9 months ago
Martin Kroeker
a93d3db34a
fix formatting of WoA section
9 months ago
Martin Kroeker
e460512685
Update WoA build instructions from rewording in issue #5001
9 months ago
Martin Kroeker
d3cc8c65ed
Merge pull request #5022 from tingboliao/develop
Replace the __riscv_vid_v_i32m2 and __riscv_vid_v_i64m2 with __riscv…_vid_v_u32m2 and __riscv_vid_v_u64m2 for riscv64-unknown-linux-gnu-gcc compiling.
9 months ago
Ralf Gommers
765ad8bcd2
Fix guard around `alloc_hugetlb`, fixes compile warning
The warning was:
```
/home/rgommers/code/pixi-dev-scipystack/openblas/OpenBLAS/driver/others/memory.c: At top level:
/home/rgommers/code/pixi-dev-scipystack/openblas/OpenBLAS/driver/others/memory.c:2565:14: warning: 'alloc_hugetlb' defined but not used [-Wunused-function]
2565 | static void *alloc_hugetlb(void *address){
| ^~~~~~~~~~~~~
```
The added define is the same as is already present in the TLS part of
`memory.c`. This follows up on gh-4681.
9 months ago
Ralf Gommers
48caf2303d
Fix build warning about discarding volatile qualifier in memory.c
The warning was:
```
[4339/5327] Building C object driver/others/CMakeFiles/driver_others.dir/memory.c.o
/home/rgommers/code/pixi-dev-scipystack/openblas/OpenBLAS/driver/others/memory.c: In function 'blas_shutdown':
/home/rgommers/code/pixi-dev-scipystack/openblas/OpenBLAS/driver/others/memory.c:3257:10: warning: passing argument 1 of 'free' discards 'volatile' qualifier from pointer target type [-Wdiscarded-qualifiers]
3257 | free(newmemory);
| ^~~~~~~~~
In file included from /home/rgommers/code/pixi-dev-scipystack/openblas/OpenBLAS/common.h:83,
from /home/rgommers/code/pixi-dev-scipystack/openblas/OpenBLAS/driver/others/memory.c:74:
/home/rgommers/code/pixi-dev-scipystack/openblas/.pixi/envs/default/x86_64-conda-linux-gnu/sysroot/usr/include/stdlib.h:482:25: note: expected 'void *' but argument is of type 'volatile struct newmemstruct *'
482 | extern void free (void *__ptr) __THROW;
| ~~~~~~^~~~~
```
The use of `volatile` for `newmemstruct` seems on purpose, and there are
more such constructs in this file. The warning appeared after gh-4451
and is correct. The `free` prototype doesn't expect a volatile pointer,
hence this change adds a cast to silence the warning.
9 months ago
tingbo.liao
d00cc400b1
Replaced the __riscv_vid_v_i32m2 and __riscv_vid_v_i64m2 with __riscv_vid_v_u32m2 and __riscv_vid_v_u64m2 for riscv64-unknown-linux-gnu-gcc compiling.
Signed-off-by: tingbo.liao <tingbo.liao@starfivetech.com>
9 months ago
Martin Kroeker
229d8a025e
Merge pull request #4959 from CDAC-Bengaluru/level-1-sve
SVE Implementation for Level-1 BLAS Routines
9 months ago
SushilPratap04
3368a4e697
Update swap_kernel_sve.c
9 months ago
CDAC-SSDG
dd71e4234a
Added Updated swap and rot sve kernels.
9 months ago
CDAC-SSDG
06ffd411a5
Update KERNEL.ARMV8SVE
9 months ago
CDAC-SSDG
41912f9c22
Update CONTRIBUTORS.md
9 months ago
CDAC-SSDG
765850194e
Delete kernel/arm64/swap_kernel_sve.c
9 months ago
CDAC-SSDG
c17c19fbcf
Delete kernel/arm64/swap_kernel_c.c
9 months ago
CDAC-SSDG
f6416c0e37
Delete kernel/arm64/swap.c
9 months ago
CDAC-SSDG
3b7b74664c
Delete kernel/arm64/scal_kernel_sve.c
9 months ago
CDAC-SSDG
95a97012e8
Delete kernel/arm64/scal_kernel_c.c
9 months ago
CDAC-SSDG
5540f2121e
Delete kernel/arm64/scal.c
9 months ago
CDAC-SSDG
f62519cc87
Delete kernel/arm64/rot_kernel_sve.c
9 months ago
CDAC-SSDG
10857c9df4
Delete kernel/arm64/rot_kernel_c.c
9 months ago
CDAC-SSDG
b9f51a5cf7
Delete kernel/arm64/rot.c
9 months ago
Martin Kroeker
89f02ed394
Merge pull request #5014 from martin-frbg/issue5013
Add some missed lapack 3.11+ symbols to gensymbol
9 months ago
Martin Kroeker
61d5aec7c1
remove typo
9 months ago
Martin Kroeker
5aea097df0
add missing lapack 3.11+ symbols
9 months ago
Martin Kroeker
72f7b7011c
Merge pull request #5009 from martin-frbg/pybenchdoc
DOCS, pybench : Add build notes for Windows and flang from gh Discussion 5008
10 months ago
Martin Kroeker
0f8ff82592
Add build notes for Windows and flang from gh Discussion 5008
10 months ago
Martin Kroeker
81666de4ef
Merge pull request #5007 from martin-frbg/issue5006
Revert the NRM2 kernels for NeoverseN2 and ARMV8SVE targets to the generic NEON version
10 months ago
Martin Kroeker
230e665bca
Merge pull request #4996 from iha-taisei/sdgemv_sve_unroll
Loop-unrolled transposed [SD]GEMV kernels for A64FX and Neoverse V1
10 months ago
Martin Kroeker
3345007d8f
retire the thunderx2 NRM2 kernels due to reported inaccuracies and NAN
10 months ago
Martin Kroeker
5fe983db29
retire the thunderx2 nrm2 kernels for now due to NAN and inaccuracies
10 months ago
Martin Kroeker
5dc4d7dd7e
Merge pull request #5005 from martin-frbg/evbarm
Improve support for NetBSD on arm64 (evbarm)
10 months ago
Martin Kroeker
4ba471dd5a
Merge pull request #5003 from mathomp4/bugfix/nag-pic
Fixes for NAG Compiler
10 months ago
Martin Kroeker
a791912cbb
handle uname returning evbarm on NetBSD
10 months ago
Martin Kroeker
1a6ecda398
utilize /proc/cpuinfo on NetBSD too
10 months ago
Matthew Thompson
c4e8bac5a5
Fix indent
10 months ago
Matthew Thompson
d3b2036d49
Move to use ERROR STOP instead of ABORT
10 months ago
Matthew Thompson
35334ed2ea
Fixes for Fortran Standards violations for lapack-netlib
10 months ago
Matthew Thompson
be19966d3b
Fixes for NAG CMake
10 months ago
Martin Kroeker
9c5d20187b
Merge pull request #4999 from dg0yt/macro-failed
Fix redefinition of FAILED
10 months ago
Matthew Thompson
2eaf285de5
Use F_COMPILER name
10 months ago