Chris Sidebottom
f95e7b0e32
Add infrastructure for BGEMM
Setting up all the infrastructure for BGEMM support in OpenBLAS, hopefully I found all the right places.
Derived mostly from the previous work done in https://github.com/OpenMathLib/OpenBLAS/pull/5287
Co-authored-by: Ye Tao <ye.tao@arm.com>
3 months ago
Martin Kroeker
1c5d0d5539
move libomp to extralib
5 months ago
Henry Chen
20bdb65882
Fix recursive variable expansion in Makefiles for LOONGSON3A
1 year ago
Martin Kroeker
76db713e79
fix invocation of GEMM3M tests
1 year ago
Vladimir Nikolić
56e1782ffb
Add another missing parenthesis
1 year ago
Martin Kroeker
edacf9b397
Work around spurious BLAS3 test errors on LOONGSON3R3/4 ( #4667 )
Force compilation with gfortran to use O0 on older Loongson hardware to avoid spurious test failures
1 year ago
Martin Kroeker
ba201c1939
Enable GEMM3M tests on supported platforms
1 year ago
Chip Kerchner
61c8e19f95
Fix Makefile to support OpenMP on AIX for xlc (clang) with xlf.
1 year ago
Isuru Fernando
6b2651ece3
Fix building test_sbgemm
1 year ago
Martin Kroeker
2a9981a244
Add -lgomp when IBM xlf is combined with gcc in OPENMP builds
2 years ago
Martin Kroeker
44e6e5479b
Use the C compiler for the C SBGEMM test source
2 years ago
Aiden Grossman
b209915121
Fix build with clang
There are two instances when building the tests where OpenBLAS fails to
build with OpenMP and clang due to library paths getting reset as flags
are set rather than appended. This seems to only affect certain
clang/libomp installations, but if it's already grabbing the correct
library paths we might as well use them.
2 years ago
Martin Kroeker
5dc6aa74f0
Disable gfortran tree vectorizer to avoid gcc11+ miscompilation at O3
4 years ago
Martin Kroeker
3cfdb1770c
Remove code that disabled EXTRALIB on RISCV C910V
4 years ago
Martin Kroeker
6ae7af78a3
Support compilation with nagfor
4 years ago
Martin Kroeker
2b114c3f30
Restore proper Makefile
4 years ago
Martin Kroeker
65eb7afaf4
Change ifndef CROSS to ifneq
4 years ago
damonyu
ef8e7d0279
Add the support for RISC-V Vector.
Change-Id: Iae7800a32f5af3903c330882cdf6f292d885f266
5 years ago
Martin Kroeker
c1f4f5d4e7
Replace Makefile with simplified version again
5 years ago
Martin Kroeker
bc5c7f9578
Cleanup
5 years ago
Rajalakshmi Srinivasaraghavan
b5d30b390d
Fix build issues with bfloat16
This patch fixes compilation errors due to recent renaming from SH to SB
with BUILD_BFLOAT16.
5 years ago
Martin Kroeker
5800758b43
Rename "HALF" and "sh" to "BFLOAT16" and "sb"
5 years ago
Martin Kroeker
5f23bdf437
remove debug output
5 years ago
Martin Kroeker
08f4749eb4
Adapt tests to having only a subset of types in the build
5 years ago
Rajalakshmi Srinivasaraghavan
564b0d39ef
Add test for shgemm
This patch has Makefile changes to add test for shgemm which
compares sgemm and shgemm result.
5 years ago
Martin Kroeker
2d8781b0dc
Do not attempt to run test without fortran
5 years ago
Martin Kroeker
6a5ab083b7
Handle special case of gfortran+clang+OpenMP
7 years ago
Aleksey Kuleshov
3d50ccdc0d
allow building tests when CROSS compiling but don't run them
9 years ago
wernsaar
f0f9b25bb6
added test for CGEMM3M function
11 years ago
wernsaar
7a911569b8
added test for GEMM3M functions
11 years ago
Timothy Gu
6c2ead30f0
Remove all trailing whitespace except lapack-netlib
Signed-off-by: Timothy Gu <timothygu99@gmail.com>
11 years ago
Sebastien Fabbro
9f0fb6e662
Respect user's LDFLAGS
12 years ago
Xianyi Zhang
57658a8c14
ref #62 . Added the user friendly message with USE_OPENMP=1. The users should use OMP_NUM_THREADS.
When OpenBLAS is compiled with USE_OPENMP=1, it ignores OPENBLAS_NUM_THREADS and GOTO_NUM_THREADS flags.Therefore, you should use OMP_NUM_THREADS.
Without setting OMP_NUM_THREADS, a process will use maximal number of threads on a computing node. Thus, if there are 2 processes on the computing node, the thread will contend against other threads on CPU cores. As a result, the application will hang.
14 years ago
Xianyi Zhang
066465af5b
Used the environment variable OPENBLAS_NUM_THREADS to set the number of threads in test.
14 years ago
Xianyi Zhang
342bbc3871
Import GotoBLAS2 1.13 BSD version codes.
14 years ago