Andrew
e5cc3d72c0
core.IdenticalExpr clang501 checker
7 years ago
Andrew
4938faa822
core.IdenticalExpr clang501 checker
7 years ago
Andrew
9fa986337d
add missing brackets to silence indentation warnings gcc721
7 years ago
Andrew
3eed97f6b9
Initialize values to silence cppcheck
7 years ago
Andrew
13e137fbc9
Initialize uninitialized variables (cppcheck)
7 years ago
Martin Kroeker
3d23f45107
Merge pull request #1415 from quickwritereader/develop
(Z systems Z13) small fixes, some (i(dz)amin,i(dz)amax,(dz)dot,(dz)asum) mikrokernels…
7 years ago
Abdelrauf
87669d1c0a
small fixes, some (i(dz)amin,i(dz)amax,(dz)dot,(dz)asum) mikrokernels can be inlined
8 years ago
Martin Kroeker
42285d8e70
Merge pull request #1410 from brada4/develop
Address warnings #1357
7 years ago
Andrew
d602b99386
LAPACK helpers in C that need care too
8 years ago
Andrew
4d0b005e5b
Eliminate remaining unused results in kernels (clang5 analyzer)
8 years ago
Martin Kroeker
b81656936f
Merge pull request #1409 from martin-frbg/issue1292-2
Tag %1 and %2 as both input and output operands
8 years ago
Martin Kroeker
b973990df2
Tag %1 and %2 as both input and output operands
fix from #1292 extended to the other gemv microkernels
8 years ago
Martin Kroeker
1e31124eb0
Merge pull request #1406 from martin-frbg/issue1292
Tag %1 and %2 as both input and output
8 years ago
Martin Kroeker
cc9500db41
Merge pull request #1403 from brada4/develop
Address few more warnings
8 years ago
Martin Kroeker
723f396a20
Tag %1 and %2 as both input and output
The inline assembly modifies its input operands, so mark them as output to avoid surprises with optimization. Fixes #1292
8 years ago
Andrew
03e5ff0687
initialize potentially unitialized variables (clang5)
8 years ago
Andrew
47deec2c1a
fix couple of dead assignment warnings
8 years ago
Martin Kroeker
43c0622e7b
Retire Piledriver/Steamroller/Excavator daxpy microkernels as well
related to issue #1332
8 years ago
Martin Kroeker
0623636c98
Use Sandybridge daxpy kernel on Haswell and Zen for now
The testcase from #1332 exposes a problem in daxpy_microk_haswell-2.c that is not seen with
any of the other Intel x86_64 microkernels.
8 years ago
Andrew
281a2b952f
warning cleanup ( #1380 )
* dead increments in driver/level2
* dead increments in kernel/generic
* part dead increments in kernel/x86_64
8 years ago
Martin Kroeker
8213385ab8
Work around compiler warnings for unused variables in the generic zgemm3m_Xcopy kernels
8 years ago
Martin Kroeker
db00a51e6b
Merge pull request #1371 from martin-frbg/develop
Add trivially optimized DSDOT for POWER8
8 years ago
martin
7a4b3cfbf8
Add trivially optimized DSDOT for POWER8
8 years ago
Martin Kroeker
6c77b5f267
Merge pull request #1369 from martin-frbg/dsdot
Add optimized dsdot to all other x86_64 kernels that use sdot.c
8 years ago
Andrew
441a9c8385
more dead increments clang4 scan-build deadcode.deadstores
8 years ago
Andrew
1236dbe5a6
Eliminate 2-8 dead increments code
8 years ago
Martin Kroeker
c92cd6d162
Add trivially optimized dsdot based on sdot
8 years ago
Martin Kroeker
cae5d9a20b
Add trivially optimized dsdot based on sdot
8 years ago
Martin Kroeker
3d891c3106
Add trivially optimized dsdot based on sdot
8 years ago
Martin Kroeker
4fbdcfa823
Add trivially optimized dsdot based on sdot
8 years ago
Martin Kroeker
1bb6a96ebc
Add trivially optimized dsdot based on sdot
8 years ago
Martin Kroeker
6bd163f37a
Add trivially optimized dsdot based on sdot
8 years ago
Martin Kroeker
f0333333d1
Add trivially optimized dsdot based on sdot
8 years ago
Andrew
e89b979b2c
fix spurious compiler warning fix (no code change)
8 years ago
Andrew
7e9b29b9b8
fix spurious compiler warning (no code change)
8 years ago
Martin Kroeker
6157d0902a
Merge pull request #1358 from martin-frbg/unused_vars
Clean up spurious unused variables in the kernels
8 years ago
Martin Kroeker
3fea849bbf
Remove unused variables from Haswell dtrmm and Bulldozer dtrsm
8 years ago
Martin Kroeker
8f177621bc
Remove unused variables at0...at3 from ?symv_U
8 years ago
Martin Kroeker
5f402b7759
Remove unused (loop?) variable j from the gemv_n_4 implementations
8 years ago
Martin Kroeker
65bf0a343c
Remove unused variable btpr
8 years ago
Martin Kroeker
acf3d34bc5
Silence an unused variable warning with a cast
l2 cache size is not universally needed to assign default unrolling limits, but neither putting its declaration inside an ifdef nor cloning it into all ifdef sections that need it really makes sense here.
8 years ago
Martin Kroeker
ab87ee6b48
Merge pull request #1329 from martin-frbg/dsdot
(Trivial) optimized dsdot implementation for HASWELL
8 years ago
Martin Kroeker
a07807caac
Eliminate loop code when called as/from dsdot
8 years ago
Ashwin Sekhar T K
a0128aa489
ARM64: Convert all labels to local labels
While debugging/profiling applications using perf or other tools, the
kernels appear scattered in the profile reports. This is because the labels
within the kernels are not local and each label is shown as a separate
function.
To avoid this, all the labels within the kernels are changed to local
labels.
8 years ago
Martin Kroeker
0e2cf102e1
Fix 32bit HASWELL
8 years ago
Martin Kroeker
5e3e91d0fc
Split the microkernel workload into chunks of 32 floats for dsdot mode to limit loss of precision
8 years ago
Martin Kroeker
28c3fa8950
Add dsdot
8 years ago
Martin Kroeker
8ac87c1cb6
Implement DSDOT with unchanged sdot microkernels
8 years ago
Martin Kroeker
c7a8512d12
Cmake fixes for DYNAMIC_ARCH builds and whitespace in path names ( #1323 )
* prebuild.cmake: Put quotes around path names that may contain whitespace
(Copied from alexkaratakis' PR #1295 )
* kernel/CMakeLists.txt: Fix common_lapack header inclusion and DYNAMIC_ARCH generation of ?neg_tcopy and ?laswp_ncopy files
* lapack/CMakeLists.txt: Use correct template for ?laswp_(plus,minus) functions
8 years ago
Martin Kroeker
97ecd4996a
Merge pull request #1319 from martin-frbg/issue601
Fix out-of-bounds memory accesses exposed by xccblat3 testcase
8 years ago