martin
3be5c3d343
Update LAPACK to 3.8.0
7 years ago
Martin Kroeker
b18730f9e1
Merge pull request #1365 from xoviat/patch-1
[appveyor] use cmake from conda forge
7 years ago
xoviat
be9e63c021
[appveyor] use cmake from conda forge
7 years ago
Martin Kroeker
21844238ca
Merge pull request #1364 from martin-frbg/shmem-init
Handle shmem init failures in cpu affinity setup code
7 years ago
Martin Kroeker
07e7c36dac
Handle shmem init failures in cpu affinity setup code
Failures to obtain or attach shared memory segments would lead to an exit without explanation of the exact cause.
This change introduces a more verbose error message and tries to make the code continue without setting cpu affinity.
Fixes #1351
7 years ago
Martin Kroeker
9251a2efde
Merge pull request #1359 from brada4/develop
Eliminate mode variable where not needed in syrk interface
7 years ago
Martin Kroeker
06adbf2e41
Merge pull request #1347 from martin-frbg/issue1322
Change CBLAS complex functions to take void pointers
7 years ago
Martin Kroeker
b46e2b57cc
Make return parameter of cblas_Xdotc_sub, cblas_Xdotu_sub a void pointer as well
7 years ago
Martin Kroeker
3ce401f51b
Make last parameter of cblas_Xdotc_sub/cblas_Xdotu_sub a void pointer as well
7 years ago
Martin Kroeker
1c9f4422b5
Fix declaration of cblas_Xdotc_sub and cblas_Xdotu_sub
last parameter of cblas_(c,z)dotc_sub and cblas_(c,z)dotu_sub should be void* as well
7 years ago
Andrew
27575d200a
Eliminate mode variable where not needed
7 years ago
Martin Kroeker
6157d0902a
Merge pull request #1358 from martin-frbg/unused_vars
Clean up spurious unused variables in the kernels
7 years ago
Martin Kroeker
3fea849bbf
Remove unused variables from Haswell dtrmm and Bulldozer dtrsm
7 years ago
Martin Kroeker
8f177621bc
Remove unused variables at0...at3 from ?symv_U
7 years ago
Martin Kroeker
5f402b7759
Remove unused (loop?) variable j from the gemv_n_4 implementations
7 years ago
Martin Kroeker
65bf0a343c
Remove unused variable btpr
7 years ago
Martin Kroeker
acf3d34bc5
Silence an unused variable warning with a cast
l2 cache size is not universally needed to assign default unrolling limits, but neither putting its declaration inside an ifdef nor cloning it into all ifdef sections that need it really makes sense here.
7 years ago
Martin Kroeker
8e75f7dcb4
Merge pull request #1353 from xoviat/patch-1
[appveyor] use flang from conda-forge
8 years ago
Martin Kroeker
bd3546704c
Merge pull request #1356 from martin-frbg/lapack-issue196
Break out of potentially infinite rescaling loop after 1000 iterations
8 years ago
Martin Kroeker
2df1e3372d
Break out of potentially infinite rescaling loop after 1000 iterations
Inf values in the input vector will survive rescaling, causing an infinite loop. The value of 1000 is arbitrarily chosen as a large but finite value with the intention to never interfere with regular calculations.
8 years ago
Martin Kroeker
4271b2b158
Merge pull request #1354 from martin-frbg/shmem
Try to handle shmget or shmat failing
8 years ago
Martin Kroeker
148493df89
Merge branch 'develop' into shmem
8 years ago
Martin Kroeker
415555a9c1
Merge branch 'develop' into shmem
8 years ago
Martin Kroeker
2a6fef9a55
Try to handle shmget or shmat failing
also replaces one verbatim sched_yield with the YIELDING macro for consistency as suggested in #1351
8 years ago
xoviat
307305aeb5
[appeyor] use flang from conda-forge
This flang will be updated in the future. We leave cmake because it's
not yet released with fortran support
8 years ago
Martin Kroeker
cc26cdce0c
Merge pull request #1352 from martin-frbg/issue1351
Output an error message when shmat() fails
8 years ago
Martin Kroeker
d8576826c4
Output an error message when shmat() fails
Observed in #1351 with SELinux as the likely culprit. Without the message, the user saw a segfault with no apparent reason
8 years ago
Martin Kroeker
c6968edec4
Merge pull request #1350 from insertinterestingnamehere/flang
WIP: Support for Flang on Windows
8 years ago
Isuru Fernando
9268314290
Fix gensymbol script
8 years ago
Ian Henriksen
3ace0fda3f
Merge pull request #1 from xoviat/patch-1
[appveyor] fixes
8 years ago
xoviat
3cfc64404a
[appveyor] fixes
8 years ago
Ian Henriksen
72956e8950
Build MATGEN LAPACK routines by default when building with CMake.
8 years ago
Ian Henriksen
505dc08635
Update lapacke.cmake with routines added in LAPACK 3.7.0.
8 years ago
Ian Henriksen
61587b0670
Update lapack.cmake with additional routines from LAPACK version 3.7.0.
8 years ago
Ian Henriksen
632fc75d77
Allow using compilers other than gfortran in conjunction with
MSVC or clang-cl.
8 years ago
Martin Kroeker
2c222f1faa
Modify complex CBLAS functions to take void pointers
Modify complex CBLAS functions to take void pointers instead of float or double arguments (to bring the prototypes in line with netlib and other implementations' cblas.h)
8 years ago
Martin Kroeker
66ac898f64
Change prototypes of all complex functions to use void*
Change prototypes of complex functions to use void pointers like the other implementations of CBLAS
8 years ago
Martin Kroeker
ab87ee6b48
Merge pull request #1329 from martin-frbg/dsdot
(Trivial) optimized dsdot implementation for HASWELL
8 years ago
Martin Kroeker
a07807caac
Eliminate loop code when called as/from dsdot
8 years ago
Martin Kroeker
b71f4fe681
Merge pull request #1334 from ashwinyes/develop_aarch64_20171024_addlocallabels
ARM64: Convert all labels to local labels
8 years ago
Ashwin Sekhar T K
a0128aa489
ARM64: Convert all labels to local labels
While debugging/profiling applications using perf or other tools, the
kernels appear scattered in the profile reports. This is because the labels
within the kernels are not local and each label is shown as a separate
function.
To avoid this, all the labels within the kernels are changed to local
labels.
8 years ago
Martin Kroeker
627133f9ad
Merge pull request #1333 from martin-frbg/haswell32
Fix 32bit HASWELL builds
8 years ago
Martin Kroeker
0e2cf102e1
Fix 32bit HASWELL
8 years ago
Martin Kroeker
5e3e91d0fc
Split the microkernel workload into chunks of 32 floats for dsdot mode to limit loss of precision
8 years ago
Martin Kroeker
28c3fa8950
Add dsdot
8 years ago
Martin Kroeker
8ac87c1cb6
Implement DSDOT with unchanged sdot microkernels
8 years ago
Martin Kroeker
b7cee00455
Merge pull request #1327 from martin-frbg/cmake-relapack
Make ReLAPACK available in cmake builds
8 years ago
Martin Kroeker
962b20a9bb
Optionally add ReLAPACK to LIB_COMPONENTS
8 years ago
Martin Kroeker
fbf83f4833
Add cmake build list file for ReLAPACK
8 years ago
Martin Kroeker
78cec6209c
Add ReLAPACK option
8 years ago