TGY
|
b5ba95a6c0
|
Modernize obsolete inline order
|
2 years ago |
H.J. Lu
|
53ee0b76bb
|
x86: Enable Intel CET
When Intel CET is enabled, we need to include <cet.h> in assembly codes
to mark Intel CET support and place _CET_ENDBR at the function entry.
|
4 years ago |
Martin Kroeker
|
db3226a646
|
Add (empty) read barrier definition
|
5 years ago |
luz.paz
|
daf2fec12d
|
Misc. typo fixes
Found via `codespell -q 3 -w -L ith,als,dum,nd,amin,nto,wis,ba -S ./relapack,./kernel,./lapack-netlib`
|
6 years ago |
Martin Kroeker
|
b55c586fac
|
Fix missing clobber in x86/x86_64 blas_quickdivide inline assembly function (#2017)
* Fix missing clobber in blas_quickdivide assembly
|
6 years ago |
Martin Kroeker
|
88e224f4c0
|
Merge pull request #1542 from martin-frbg/quickdiv64
Avoid out-of-bounds accesses in blas_quickdivide on big X86 systems
|
7 years ago |
Martin Kroeker
|
e93355e5e1
|
Omit the table overflow check when building for small systems
|
7 years ago |
Martin Kroeker
|
8145ecd70b
|
Avoid out-of-bounds reads from blas_quick_divide_table on big systems
|
7 years ago |
Alex Arslan
|
8da6b6ae52
|
Allow building on OpenBSD
With this change, OpenBLAS builds and all tests pass on OpenBSD 6.2
using Clang. Tested on x86-64 only, with and without DYNAMIC_ARCH=1.
|
7 years ago |
theoractice
|
fc3a558515
|
Fix a minor compiler error in VisualStudio with CMake
|
9 years ago |
Zhang Xianyi
|
a1a96589aa
|
Fixed #773 blas_quickdivide bug on CMake and Visual Studio x86 32-bit.
|
9 years ago |
Zhang Xianyi
|
94b125255f
|
Merge branch 'develop' into cmake
Conflicts:
driver/others/memory.c
|
10 years ago |
Grazvydas Ignotas
|
6b92204a7c
|
add fallback blas_lock implementation
to be used on armv5 and new platforms
|
10 years ago |
Grazvydas Ignotas
|
e12cf1123e
|
add fallback rpcc implementation
- use on arm, arm64 and any new platform
- use faster integer math instead of double
- use similar scale as rdtsc so that timeouts work
|
10 years ago |
Zhang Xianyi
|
dcd5ba4443
|
Merge branch 'cmake' of https://github.com/hpanderson/OpenBLAS into hpanderson_cmake
|
10 years ago |
Zhang Xianyi
|
51ff17d46e
|
Add AMD Excavator target.
|
10 years ago |
Hank Anderson
|
3649cfbd7b
|
Fixed EPILOGUE for clang.
|
10 years ago |
Hank Anderson
|
5ae8993752
|
Added intrinsics for MSVC.
|
10 years ago |
Hank Anderson
|
056ba26755
|
Changed a number of inline calls to use __inline.
MSVC doesn't inmplement C99, so can't use the inline keyword. __inline
appears to work in MSVC and GCC.
|
10 years ago |
Werner Saar
|
4319769b79
|
added target processor STEAMROLLER
|
10 years ago |
Timothy Gu
|
6c2ead30f0
|
Remove all trailing whitespace except lapack-netlib
Signed-off-by: Timothy Gu <timothygu99@gmail.com>
|
11 years ago |
Zhang Xianyi
|
16eb780e13
|
Refs #262. Fixed compatibility issues of GNU stack markings with PathScale EKOPath(tm) Compiler Suite: Version 4.0.12.1
|
12 years ago |
Zhang Xianyi
|
a2930664f4
|
Refs #262. Added executable stack markings.
|
12 years ago |
Zhang Xianyi
|
886cbaf4e4
|
Support AMD Piledriver by bulldozer kernels.
|
12 years ago |
Zhang Xianyi
|
57944538b6
|
Use ALIGN_5 instead of .algin 32 in assembly kernel. Added ALIGN_5 for 32-bit OSX.
|
12 years ago |
Zhang Xianyi
|
37edae1c90
|
Refs #75. Check ffreep macro before the define.
|
13 years ago |
Xianyi Zhang
|
a4daa34db7
|
Refs #75. Use ffreep opcode directly. Please check out http://www.sandpile.org/x86/opc_fpu.htm .
|
13 years ago |
Zaheer Chothia
|
5199809bba
|
Fix typo: OS_CYGWIN -> OS_CYGWIN_NT, OS_INERIX -> OS_INTERIX
|
13 years ago |
Zaheer Chothia
|
a431042475
|
Fix inconsistent case for OS_* macros (Refs pull request #111)
|
13 years ago |
Mike Nolta
|
4e29b6ffc0
|
FreeBSD: fix OS_FreeBSD -> OS_FREEBSD typos
|
13 years ago |
Xianyi Zhang
|
342bbc3871
|
Import GotoBLAS2 1.13 BSD version codes.
|
14 years ago |