Martin Kroeker
5d643929dd
Merge pull request #2948 from martin-frbg/issue2947
Expressly enable neon for use with intrinsics if available
5 years ago
Martin Kroeker
e8cbf0fc50
Output predefined HAVE_ entries to Makefile.conf for ARM with specified TARGET
5 years ago
Martin Kroeker
b937d78a6d
Try to read cpu information from /sys/devices/system/cpu/cpu0 if HWCAP_CPUID fails
5 years ago
Martin Kroeker
e2f9005db8
Merge pull request #2950 from RajalakshmiSR/saxpy
Optimize saxpy for POWER10
5 years ago
Martin Kroeker
6a1f3e40af
Remove debug printout of object list
5 years ago
Martin Kroeker
878b6d1f41
Remove spurious expr in flang version check
5 years ago
Rajalakshmi Srinivasaraghavan
c24ba8b1dd
Optimize saxpy for POWER10
This patch makes use of new POWER10 vector pair instructions for
loads and stores.
5 years ago
Qiyu8
f917c26e83
Refractoring remaining benchmark cases.
5 years ago
Martin Kroeker
76203e2120
Merge pull request #2946 from martin-frbg/issue2945
Move definitions that are neither needed nor supported on Solaris
5 years ago
Martin Kroeker
eec517af0e
Expressly enable neon for use with intrinsics if available
5 years ago
Martin Kroeker
fd7da56965
Move definitions that are neither needed nor supported on SUNOS
5 years ago
Martin Kroeker
2f9fc9be30
Update version to 0.3.12.dev
5 years ago
Martin Kroeker
81fcfd5ed3
Update version to 0.3.12.dev
5 years ago
Martin Kroeker
addf7593ae
Merge pull request #2944 from xianyi/release-0.3.0
Merge back 0.3.12 tag (and Changelog typo fixes) from release
5 years ago
Martin Kroeker
c5f280a7f0
Fix typos
5 years ago
Martin Kroeker
6e3a05f2c9
Merge pull request #2943 from xianyi/develop
Merge from develop for 0.3.12 release
5 years ago
Martin Kroeker
89db73569b
Update Changelog with 0.3.12 changes
5 years ago
Martin Kroeker
e1c18e4eeb
Update version to 0.3.12 for release
5 years ago
Martin Kroeker
26f658c9d2
Update version to 0.3.12 for release
5 years ago
Martin Kroeker
dc35477317
Merge pull request #2942 from martin-frbg/makebuildtypes
Comment out BUILD_SINGLE etc. in Makefile.rule and add a short explanation
5 years ago
Martin Kroeker
365f28787c
Comment out BUILD_SINGLE etc. and add a short explanation
5 years ago
Martin Kroeker
2f2e9ddb65
Merge pull request #2941 from martin-frbg/exportsfix
Fix grouping of sladiv1/dladiv1/ilaenv2stage in gensymbol
5 years ago
Martin Kroeker
0d140e61ac
Fix wrong grouping of dcombssq
5 years ago
Martin Kroeker
4c45cd6294
fix missing split of sladiv1/dladiv/ilaenv2stage by build type
5 years ago
Martin Kroeker
680f744abf
Merge pull request #108 from xianyi/develop
rebase
5 years ago
Martin Kroeker
6f9460f0f6
Merge pull request #2937 from martin-frbg/pwr-buffersz
Increase and unify BUFFERSIZE on POWER;fix gcc inline warning
5 years ago
Qiyu8
dd6ebdfdab
Refactor the performance measurement system
5 years ago
Guillaume Horel
1917a4e7b8
reuse variables defined in Makefile.system
5 years ago
Martin Kroeker
6c970fa998
Merge pull request #2938 from martin-frbg/2934-3
Fix twisted spelling that broke the gfortran version test again
5 years ago
Martin Kroeker
b23cb05231
Fix twisted spelling that broke the gfortran version test again
5 years ago
Martin Kroeker
1d4c96fa0c
Increase BUFFERSIZE further
5 years ago
Martin Kroeker
34c3c407ef
label always_inline function as inline to silence a gcc warning
5 years ago
Martin Kroeker
3f84a9ca15
Merge pull request #2936 from martin-frbg/issue2934-2
Fix compiler version check for -mavx2 support (DYNAMIC_ARCH case)
5 years ago
Martin Kroeker
7e265c50bf
Merge pull request #2935 from martin-frbg/lapack458
Fix macro used in argument conversion (LAPACK PR 458)
5 years ago
Martin Kroeker
ee90f30384
Increase BUFFERSIZE for POWER8-10 and use same value for POWER6
to fix overflow warning for PWR8 ZGEMM and PWR9 C/ZGEMM and avoid size mismatches in DYNAMIC_ARCH
5 years ago
Martin Kroeker
2e48d560ba
Fix compiler version check
5 years ago
Martin Kroeker
ab7f466467
Merge pull request #106 from xianyi/develop
rebase
5 years ago
Martin Kroeker
f95031204e
Fix macro used in argument conversion (LAPACK PR 458)
5 years ago
Martin Kroeker
909068facf
Merge pull request #2932 from RajalakshmiSR/copyp10
Optimize scopy/ccopy for POWER10
5 years ago
Martin Kroeker
5b7438fdde
Merge pull request #2934 from thrasibule/improve_version_check
actually check that version is greater than 4.7
5 years ago
Guillaume Horel
47696b43e9
actually check that version is greater than 4.7
5 years ago
Rajalakshmi Srinivasaraghavan
ad745c0bae
Optimize scopy/ccopy for POWER10
This patch makes use of new POWER10 vector pair instructions for
loads and stores. Also reorganized all variants of copy functions
to make use of same kernel.
5 years ago
Martin Kroeker
17c46bf06a
Merge pull request #2930 from ismail/fix-no-return
Fix build with -Werror=return-type
5 years ago
Martin Kroeker
28242096cd
Merge pull request #2928 from martin-frbg/issue2917
Enable -mavx2 for flang as well where supported
5 years ago
İsmail Dönmez
4a1d00f589
Fix build with -Werror=return-type
dgemm_tcopy_16_skylakex.c CNAME function should return an int, add a
return 0 similar to other files.
5 years ago
Martin Kroeker
00813363be
Enable -mavx2 for flang as well
5 years ago
Martin Kroeker
336e35469a
Merge pull request #105 from xianyi/develop
rebase
5 years ago
Martin Kroeker
29668458f7
Merge pull request #2925 from martin-frbg/issue2911-2
Add binutils version check as prerequisite for POWER10 in DYNAMIC_ARCH build
5 years ago
Martin Kroeker
ee83e29046
Merge pull request #2926 from bartoldeman/vzeroupper-clobber-all
x86_64: clobber all xmm registers after vzeroupper
5 years ago
Martin Kroeker
1a0f57c8f0
Fix missing backquotes
5 years ago