Sergei Lewis
3ffd6868d7
Merge branch 'develop' into dev/slewis/merge-from-riscv
1 year ago
Martin Kroeker
1a6fdb0353
Add prototypes for extensions ?AMIN/?AMAX and CAXPYC/ZAXPYC
1 year ago
Sergei Lewis
1093def0d1
Merge branch 'risc-v' into develop
1 year ago
kseniyazaytseva
ff41cf5c49
Fix BLAS, BLAS-like functions and Generic RISC-V kernels
* Fixed gemmt, imatcopy, zimatcopy_cnc functions
* Fixed cblas_cscal testing in ctest
* Removed rotmg unreacheble code
* Added zero size checks
2 years ago
Martin Kroeker
2e2e538b7c
Add openblas_set_num_threads_local() and use of blas_omp_threads_local in OMP parallel regions
1 year ago
Martin Kroeker
c2078b2356
Mark xerbla's arguments as const
2 years ago
Martin Kroeker
8668571040
add openblas_getaffinity()
3 years ago
Wangyang Guo
2e44ca0136
sbgemm: add missing cblas_sbgemm definition
4 years ago
Martin Kroeker
25c986db5a
Add prototypes for CBLAS_CROTG and CBLAS_ZROTG
4 years ago
Martin Kroeker
9ccb12b031
Add prototypes for cblas_csrot and cblas_zdrot
4 years ago
Chen, Guobing
a7b1f9b1bb
Implementation of BF16 based gemv
1. Add a new API -- sbgemv to support bfloat16 based gemv
2. Implement a generic kernel for sbgemv
3. Implement an avx512-bf16 based kernel for sbgemv
Signed-off-by: Chen, Guobing <guobing.chen@intel.com>
5 years ago
Rajalakshmi Srinivasaraghavan
b5d30b390d
Fix build issues with bfloat16
This patch fixes compilation errors due to recent renaming from SH to SB
with BUILD_BFLOAT16.
5 years ago
Martin Kroeker
ca31c32693
Rename "HALF" and "sh" to "BFLOAT16" and "sb"
5 years ago
Chen, Guobing
deaeb6c5b8
Add bfloat16 based dot and conversion with single/double
1. Added bfloat16 based dot as new API: shdot
2. Implemented generic kernel and cooperlake-specific (AVX512-BF16) kernel for shdot
3. Added 4 conversion APIs for bfloat16 data type <=> single/double: shstobf16 shdtobf16 sbf16tos dbf16tod
shstobf16 -- convert single float array to bfloat16 array
shdtobf16 -- convert double float array to bfloat16 array
sbf16tos -- convert bfloat16 array to single float array
dbf16tod -- convert bfloat16 array to double float array
4. Implemented generic kernels for all 4 conversion APIs, and cooperlake-specific kernel for shstobf16 and shdtobf16
5. Update level1 thread facilitate functions and macros to support multi-threading for these new APIs
6. Fix Cooperlake platform detection/specify issue when under dynamic-arch building
7. Change the typedef of bfloat16 from unsigned short to more strict uint16_t
Signed-off-by: Chen, Guobing <guobing.chen@intel.com>
5 years ago
Sharvil Nanavati
7b4773b24d
Add API to set thread affinity on Linux.
Issue: #2545
5 years ago
Martin Kroeker
5c42287c4f
Add declarations for ?sum and cblas_?sum
6 years ago
Martin Kroeker
4f9d3e4b28
Expose CBLAS interfaces for I?MIN and I?MAX
6 years ago
Martin Kroeker
b57af93792
just make CBLAS_LAYOUT an alias of the existing CBLAS_ORDER
to avoid having to change all instances of enum CBLAS_ORDER in this file
7 years ago
Martin Kroeker
8aeab0601e
Follow netlib renaming/aliasing CBLAS_ORDER to CBLAS_LAYOUT
fixes #1754
7 years ago
Martin Kroeker
eb71d61c7c
Expose CBLAS interface to BLAS extensions iXamin
7 years ago
Martin Kroeker
1c9f4422b5
Fix declaration of cblas_Xdotc_sub and cblas_Xdotu_sub
last parameter of cblas_(c,z)dotc_sub and cblas_(c,z)dotu_sub should be void* as well
7 years ago
Martin Kroeker
66ac898f64
Change prototypes of all complex functions to use void*
Change prototypes of complex functions to use void pointers like the other implementations of CBLAS
8 years ago
Martin Koehler
39cc6b21d3
Add ATLAS-style ?geadd function
10 years ago
Erik Schnetter
65a847cd36
Introduce openblas_get_num_threads and openblas_get_num_procs
10 years ago
wernsaar
f446d2368a
updated cblas.h and cblas_noconst.h
11 years ago
wernsaar
7f234f8ed1
added GEMM3M tests
11 years ago
Martin Koehler
a057e5434d
add CBLAS interface for s/d/c/zimatcopy
11 years ago
Martin Köhler
7794766d3c
Add cblas_(s/d/c/z)omatcopy in order to have cblas interface for them.
11 years ago
Zhang Xianyi
552119c484
Fixed #407 . Support outputing the CPU corename on runtime.
The user can use char * openblas_get_config() or char * openblas_get_corename().
11 years ago
Timothy Gu
6c2ead30f0
Remove all trailing whitespace except lapack-netlib
Signed-off-by: Timothy Gu <timothygu99@gmail.com>
11 years ago
wernsaar
faf3ac0aad
Ref #285 : added axpby kernels
11 years ago
Lars Buitinck
212463dce9
get rid of the generated cblas_noconst.h file
12 years ago
grisuthedragon
c19a488af2
create openblas_get_parallel to retrieve information which
parallelization model is used by OpenBLAS.
12 years ago
Zaheer Chothia
875d520ccf
Refs #193 . cblas: move #include out of extern "C" block.
Standard headers may contain C++ templates which are not permitted inside an
extern "C" block. This might be the case when we include <complex.h>.
12 years ago
Zaheer Chothia
4db6660de4
Refs #185 . Add missing 'const' to declarations in <cblas.h>. Thanks to Dan Povey!
The 'const' modifications were done automatically using this scripts:
https://kaldi.svn.sourceforge.net/svnroot/kaldi/sandbox/dan/tools/for_openblas
12 years ago
Zaheer Chothia
200e4acf15
cblas: typedef enums for improved compatibility with Intel MKL.
Netlib style:
enum CBLAS_XYZ {X=1, Y=2, Z=3};
Intel MKL style:
typedef enum {X=1, Y=2, Z=3} CBLAS_XYZ;
With this hybrid style, code written in the latter form won't need any
modifications to be built with OpenBLAS. This change should not affect existing
code, although a warning may be emitted for C code which does the following
(does not occur with C++):
typedef enum CBLAS_XYZ CBLAS_XYZ;
warning: redefinition of typedef 'CBLAS_XYZ' [-pedantic]
13 years ago
Zhang Xianyi
bdf8d9411e
Refs #163 . Obtain the build configure on runtime.
openblas_get_config function returns the configure string.
So far, it supports USE64BITINT, NO_CBLAS, NO_LAPACK, NO_LAPACKE,
DYNAMIC_ARCH, NO_AFFINITY.
Example:
#include <stdio.h>
extern char * openblas_get_config();
void main()
{
printf("%s\n",openblas_get_config());
return;
}
12 years ago
Zhang Xianyi
422359d09a
Export openblas_set_num_threads in shared library.
13 years ago
Zaheer Chothia
f93318a6c8
Refs #95 cblas: compatibility for compilers without C99 complex number support (e.g. Visual Studio)
13 years ago
Xianyi Zhang
0a45e5495f
Fixed #21 . Added extern C to support C++. Thank Tasio for the patch.
14 years ago
Xianyi Zhang
552f31dbbd
Fixed #13 . Fixed blasint undefined bug in <cblas.h> file.
14 years ago
Xianyi Zhang
342bbc3871
Import GotoBLAS2 1.13 BSD version codes.
14 years ago