Martin Kroeker
|
18014b04c8
|
Merge pull request #4979 from martin-frbg/issue4978-2
Remove any optimization flags from DEBUG builds on POWER architecture
|
10 months ago |
Martin Kroeker
|
9db51f790a
|
Remove any optimization flags from DEBUG builds on POWER architecture
|
10 months ago |
Martin Kroeker
|
e334b79b47
|
Merge pull request #4977 from martin-frbg/issue4973
Add dummy implementations of openblas_get/set_affinity for OpenMP builds
|
10 months ago |
Martin Kroeker
|
4060dd43e3
|
Add dummy implementations of openblas_get/set_affinity
|
10 months ago |
Martin Kroeker
|
2e2f952bfb
|
Merge pull request #4975 from martin-frbg/fixup4974
Update Cray compiler options and calling convention in CMake
|
10 months ago |
Martin Kroeker
|
cea9df3643
|
Update Cray compiler options and calling convention
|
10 months ago |
Martin Kroeker
|
3e7e312d7d
|
Merge pull request #4974 from cenewcombe/develop
Corrections for Cray and Nvidia Fortran compiler calling conventions
|
10 months ago |
Caroline Newcombe
|
10cf06dce1
|
Merge branch 'OpenMathLib:develop' into develop
|
10 months ago |
Caroline Newcombe
|
760bf7aa37
|
Update Fortran return for complex data types (Cray and Nvidia compilers)
|
10 months ago |
Martin Kroeker
|
2057cd75fd
|
Merge pull request #4971 from martin-frbg/m4init
Add initial autodetection support for Apple M4 as VORTEX with HAVE_SME
|
10 months ago |
Martin Kroeker
|
50da5c2b34
|
Add Apple M4 as VORTEX with HAVE_SME
|
10 months ago |
Martin Kroeker
|
24c5152fbd
|
Merge pull request #4966 from martin-frbg/getrf_indent
Fix misleading indentation warning in getrf
|
10 months ago |
Martin Kroeker
|
2332ea7e7a
|
fix misleading indentation
|
10 months ago |
Martin Kroeker
|
794f75a8f7
|
Merge pull request #4965 from martin-frbg/homebrewjob-2
CI: Update upload-artifacts version in the homebrew workflow again
|
10 months ago |
Martin Kroeker
|
87a18154ce
|
Update version of upload-artifacts again
|
11 months ago |
Martin Kroeker
|
453b9e4886
|
Merge pull request #4961 from h-vetinari/flang
explicitly link to OpenMP
|
11 months ago |
H. Vetinari
|
d3272e51eb
|
explicitly link to OpenMP
|
1 year ago |
Martin Kroeker
|
c634114c8a
|
Merge pull request #4960 from martin-frbg/gemmtr
Provide copies of GEMMT under its new name GEMMTR adopted by the Reference BLAS
|
11 months ago |
Martin Kroeker
|
598bd21991
|
Merge pull request #4958 from XiWeiGu/x86_64_opt_somatcopy_ct_avx
x86_64: opt somatcopy_ct with AVX
|
11 months ago |
Martin Kroeker
|
c9d1a3b912
|
Merge pull request #4954 from XiWeiGu/la64_update_gh_actions
gh-actions: Test LoongArch64 with gcc14 from Ubuntu 24.04
|
11 months ago |
Martin Kroeker
|
c3e7d08fb4
|
Copy GEMMT to its new name GEMMTR
|
11 months ago |
Martin Kroeker
|
0cf656fd3e
|
Add copies of GEMMT under its new name GEMMTR
|
11 months ago |
Martin Kroeker
|
24b5ccaf4b
|
Merge pull request #4202 from TiborGY/inlines_pt2
Remove some unused inline macro definitions
|
11 months ago |
gxw
|
73c6a28073
|
x86_64: opt somatcopy_ct with AVX
|
11 months ago |
Martin Kroeker
|
f66e6d32c2
|
Merge pull request #4953 from NickelWenzel/fix_trtrs_return_types
fix: return types of *trtrs routines
|
11 months ago |
Martin Kroeker
|
a8bb105ed6
|
Merge pull request #4848 from haampie/fix/cmake-min-version
cmake: set `CMP0042` to `NEW`
|
11 months ago |
Martin Kroeker
|
0e6a2cc93c
|
bump the minimum_required version instead
|
11 months ago |
TGY
|
815cb24944
|
remove unused INLINE macro definitions
|
2 years ago |
Martin Kroeker
|
ac736820d7
|
Merge pull request #4955 from cdaley/optimize_gemv_forwarding
Optimize gemv forwarding on ARM64 systems
|
11 months ago |
gxw
|
8f595382c4
|
gh-actions: Test LoongArch64 with gcc14 from Ubuntu 24.04
|
11 months ago |
Chris Daley
|
cb48505251
|
optimize gemv forwarding on ARM64 systems
|
11 months ago |
nickel
|
79f4bbd4cd
|
fix: return types of *trtrs routines
|
11 months ago |
Martin Kroeker
|
72461f1c8c
|
Merge pull request #4950 from ayappanec/fix-aix-build
Fix AIX build
|
11 months ago |
Ayappan Perumal
|
020cce1068
|
Fix build issues with gcc compiler as well
|
11 months ago |
Ayappan Perumal
|
b6ec73e77c
|
Fix AIX build
|
11 months ago |
Martin Kroeker
|
8a0cd5fcef
|
Merge pull request #4949 from martin-frbg/mingw32-14.2
work around mingw32-gfortran 14.2 miscompiling CBLAS1 tests
|
11 months ago |
Martin Kroeker
|
4dba6ce6ea
|
work around mingw32-gfortran 14.2 miscompiling CBLAS1 tests
|
11 months ago |
Martin Kroeker
|
a93ec74e95
|
Merge pull request #4948 from martin-frbg/fixhavesve
Properly report HAVE_SVE in ARM64 autodetection where applicable
|
11 months ago |
Martin Kroeker
|
c4bb4e74fc
|
NeoverseN2 has SVE too
|
11 months ago |
Martin Kroeker
|
86720778ef
|
write HAVE_SVE to config where applicable
|
11 months ago |
Martin Kroeker
|
016bdb9b0b
|
Merge pull request #4946 from XiWeiGu/la64_omatcopy_lasx
LoongArch64: Opt somatcopy with LASX
|
11 months ago |
gxw
|
ffaa5765a4
|
Bench: Add omatcopy
|
11 months ago |
Martin Kroeker
|
a93897276b
|
Merge pull request #4943 from martin-frbg/update_readme
Update README.md
|
11 months ago |
Martin Kroeker
|
3fc1225dd6
|
Merge branch 'OpenMathLib:develop' into update_readme
|
11 months ago |
Martin Kroeker
|
33078d11e4
|
stress importance of TARGET setting in DYNAMIC_ARCH builds
|
11 months ago |
Martin Kroeker
|
15a57598f5
|
Merge pull request #4944 from ChipKerchner/vectorizeBF16GEMV
[POWER] Vectorize BF16 GEMV
|
11 months ago |
Chip Kerchner
|
ab71a1edf2
|
Better VSX.
|
11 months ago |
gxw
|
bb31bbef52
|
LoongArch64: Opt somatcopy_ct with LASX
|
11 months ago |
gxw
|
b37129341b
|
LoongArch64: Opt somatcopy_cn with LASX
|
11 months ago |
gxw
|
acf6cab304
|
LoongArch64: Opt somatcopy_rn with LASX
|
11 months ago |