Martin Kroeker
28f151808e
Avoid overriding the global USE_GEMM3M
1 year ago
Martin Kroeker
ba201c1939
Enable GEMM3M tests on supported platforms
1 year ago
Martin Kroeker
4adfe4d531
Avoid linking both libgomp and libomp in mixed clang/gfortran builds
1 year ago
Martin Kroeker
e9f480111e
fix sbgemm bfloat16 conversion errors introduced in PR 4488
1 year ago
Martin Kroeker
fb99fc2e6e
fix type conversion warnings
1 year ago
Chip Kerchner
61c8e19f95
Fix Makefile to support OpenMP on AIX for xlc (clang) with xlf.
1 year ago
Isuru Fernando
6b2651ece3
Fix building test_sbgemm
1 year ago
Chip-Kerchner
d46eba06a7
Pack structure only on AIX.
2 years ago
Chip-Kerchner
e98e3c4783
Fix float32_bits union so that it always the sizeof float.
2 years ago
Chip-Kerchner
97a61d0577
Fix bfloat16_bits union so that it always the sizeof unsigned short.
2 years ago
Martin Kroeker
2a9981a244
Add -lgomp when IBM xlf is combined with gcc in OPENMP builds
2 years ago
Martin Kroeker
44e6e5479b
Use the C compiler for the C SBGEMM test source
2 years ago
Aiden Grossman
b209915121
Fix build with clang
There are two instances when building the tests where OpenBLAS fails to
build with OpenMP and clang due to library paths getting reset as flags
are set rather than appended. This seems to only affect certain
clang/libomp installations, but if it's already grabbing the correct
library paths we might as well use them.
2 years ago
Martin Kroeker
3d338b57de
remove spurious loops
3 years ago
Martin Kroeker
d9dc015cfc
Use blasint for INTERFACE64 compatibility
3 years ago
Rajalakshmi Srinivasaraghavan
1d97405c02
POWER: Enable bfloat16 kernels by default
This patch enables bfloat16 kernels by default for POWER processors.
Tested on Linux POWER8, POWER9, POWER10 and AIX POWER10 systems.
3 years ago
Martin Kroeker
74dba0c90a
Fix missing braces from previous commit (PR3600)
3 years ago
Martin Kroeker
bef3871833
Disable flang (over-)optimizations in BLAS tests ( #3600 )
* limit flang optimizations to -O2
3 years ago
Martin Kroeker
145ccf8943
Prevent powershell from adding a BOM to test input ( #3595 )
* Prevent addition of a BOM to test input (which would distort the names of output files)
3 years ago
Martin Kroeker
2408315d10
Skip tests if Windows powershell added a BOM
3 years ago
Martin Kroeker
5dc6aa74f0
Disable gfortran tree vectorizer to avoid gcc11+ miscompilation at O3
4 years ago
Martin Kroeker
b4f4ed378b
Disable gfortran tree vectorizer to avoid gcc11+ miscompilation at O3
4 years ago
Martin Kroeker
3cfdb1770c
Remove code that disabled EXTRALIB on RISCV C910V
4 years ago
Martin Kroeker
30d835168a
Merge pull request #3088 from xoviat/msvc
add misc fixes.
4 years ago
Martin Kroeker
6ae7af78a3
Support compilation with nagfor
4 years ago
xoviat
3165c915b6
fix test helpers
4 years ago
xoviat
457ccc42c9
Merge branch 'develop' into msvc
4 years ago
Martin Kroeker
2b114c3f30
Restore proper Makefile
4 years ago
Martin Kroeker
65eb7afaf4
Change ifndef CROSS to ifneq
4 years ago
damonyu
ef8e7d0279
Add the support for RISC-V Vector.
Change-Id: Iae7800a32f5af3903c330882cdf6f292d885f266
5 years ago
Martin Kroeker
c1f4f5d4e7
Replace Makefile with simplified version again
5 years ago
Martin Kroeker
bc5c7f9578
Cleanup
5 years ago
Rajalakshmi Srinivasaraghavan
b5d30b390d
Fix build issues with bfloat16
This patch fixes compilation errors due to recent renaming from SH to SB
with BUILD_BFLOAT16.
5 years ago
Martin Kroeker
5800758b43
Rename "HALF" and "sh" to "BFLOAT16" and "sb"
5 years ago
Martin Kroeker
924fd806d0
Rename "HALF" and "sh" to "BFLOAT16" and "sb"
5 years ago
Martin Kroeker
4db09c6cec
Rename compare_sgemm_shgemm.c to compare_sgemm_sbgemm.c
5 years ago
Martin Kroeker
5f23bdf437
remove debug output
5 years ago
Martin Kroeker
08f4749eb4
Adapt tests to having only a subset of types in the build
5 years ago
Martin Kroeker
0eaae30e8c
Adapt tests to having only a subset of types in the build
5 years ago
Martin Kroeker
593ce9e237
Make building individual tests depend on BUILD_SINGLE etc defines
5 years ago
Martin Kroeker
18a11137f1
Update BLAS tests to correspond to Reference-LAPACK 3.9.0
replaces calculation of machine precision with call to epsilon intrinsic and removes the requirement for previous output files to be removed before rerunning tests
5 years ago
Martin Kroeker
13c28889a2
Update "cosmetic fixes for non-C99 compilers"
5 years ago
Martin Kroeker
28915eed72
Cosmetic fixes for non-C99 compilers
5 years ago
Rajalakshmi Srinivasaraghavan
8efba9b7c0
Improve shgemm test
This patch adds another check to test shgemm results.
5 years ago
Rajalakshmi Srinivasaraghavan
564b0d39ef
Add test for shgemm
This patch has Makefile changes to add test for shgemm which
compares sgemm and shgemm result.
5 years ago
Rajalakshmi Srinivasaraghavan
7eb55504b1
RFC : Add half precision gemm for bfloat16 in OpenBLAS
This patch adds support for bfloat16 data type matrix multiplication kernel.
For architectures that don't support bfloat16, it is defined as unsigned short
(2 bytes). Default unroll sizes can be changed as per architecture as done for
SGEMM and for now 8 and 4 are used for M and N. Size of ncopy/tcopy can be
changed as per architecture requirement and for now, size 2 is used.
Added shgemm in kernel/power/KERNEL.POWER9 and tested in powerpc64le and
powerpc64. For reference, added a small test compare_sgemm_shgemm.c to compare
sgemm and shgemm output.
This patch does not cover OpenBLAS test, benchmark and lapack tests for shgemm.
Complex type implementation can be discussed and added once this is approved.
5 years ago
Martin Kroeker
2d8781b0dc
Do not attempt to run test without fortran
5 years ago
xoviat
dbf9ad1f3d
tests: add windows compatibility
6 years ago
luz.paz
daf2fec12d
Misc. typo fixes
Found via `codespell -q 3 -w -L ith,als,dum,nd,amin,nto,wis,ba -S ./relapack,./kernel,./lapack-netlib`
6 years ago
Martin Kroeker
6a5ab083b7
Handle special case of gfortran+clang+OpenMP
7 years ago