Martin Kroeker
83b5c6b92d
Fix compilation with NO_AVX=1 set
fixes #1974
6 years ago
caiyu
29dc72889f
Add support for Hygon Dhyana
6 years ago
Martin Kroeker
00401489c2
Fix missing braces in support_avx()
6 years ago
Martin Kroeker
68eb3146ce
Add xcr0 (os support) check
6 years ago
Martin Kroeker
0afaae4b23
Query AVX2 and AVX512VL capability in x86 cpu detection
6 years ago
TiborGY
211120c508
Fix typo in UNKNOWN core name
Should be of no consequence, right?
6 years ago
Martin Kroeker
64ca44873b
Fix detection of Ryzen2 (missing CORE_ZEN)
7 years ago
Martin Kroeker
3f73e8b8cf
Add cpuid for AMD Ryzen 2
for #1664
7 years ago
Martin Kroeker
2d8cc7193a
Support upcoming Intel Cannon Lake CPUs as Skylake X ( #1621 )
* Support upcoming Cannon Lake as Skylake X
7 years ago
Martin Kroeker
dc9fe05ab5
Update cpuid_x86.c
7 years ago
Martin Kroeker
5a92b311e0
Separate Skylake X from Skylake
7 years ago
Arjan van de Ven
99c7bba8e4
Initial support for SkylakeX / AVX512
This patch adds the basic infrastructure for adding the SkylakeX (Intel Skylake server)
target. The SkylakeX target will use the AVX512 (AVX512VL level) instruction set,
which brings 2 basic things:
1) 512 bit wide SIMD (2x width of AVX2)
2) 32 SIMD registers (2x the number on AVX2)
This initial patch only contains a trivial transofrmation of the Haswell SGEMM kernel
to AVX512VL; more will follow later but this patch aims to get the infrastructure
in place for this "later".
Full performance tuning has not been done yet; with more registers and wider SIMD
it's in theory possible to retune the kernels but even without that there's an
interesting enough performance increase (30-40% range) with just this change.
7 years ago
Martin Kroeker
aece65ea29
Fix coretype detection for Bay Trail Atom
My earlier PR #982 appears to have been incomplete in this regard - fixes #1285
8 years ago
Martin Kroeker
00774b1105
Add dummy implementation of cpuid_count for the CPUIDEMU case
8 years ago
Martin Kroeker
6497aae57c
Use cpuid 4 with subleafs to query L1 cache size on Intel processors
8 years ago
Gian-Carlo Pascutto
9c884986ad
Add an extra familiy/model combination used by AMD Steamrolller (Godavari).
8 years ago
Johannes Buchner
b4071d0d16
Autodetect AMD A8-6410 as BARCELONA
8 years ago
Denis Steckelmacher
c9ff735da6
Add ZEN support (tested for auto-detected static backend)
8 years ago
Martin Kroeker
688267edf3
Fix core detection for Kaby Lake without AVX (G4560)
Should fix #1109 )
8 years ago
Elliot Saba
04b2b06665
CPUID mappings for Core i5-7600K (Kaby Lake)
8 years ago
Martin Koehler
76c6e33e54
Enable EXCAVATOR kernels for A12-9800
8 years ago
Martin Kroeker
60816c9259
Add files via upload
9 years ago
Martin Kroeker
3409bccb21
Update cpuid_x86.c
Add Bay Trail "Pentium N3520" atom cpu
9 years ago
Martin Kroeker
154729908e
Update cpuid_x86.c
9 years ago
Martin Kroeker
97bd1e42c8
Update cpuid_x86.c
9 years ago
Martin Kroeker
935356c34f
Update dynamic.c and cpuid_x86.c for Intel Avoton.
Second part of "support Intel Avoton via Nehalem kernel"
9 years ago
Martin Kroeker
4f05c23673
Update cpuid_x86.c
Add recognition of Intel Atom C27xx (Avoton, model code 4D)
9 years ago
Jerome Robert
76398c3233
Fix detection of AMD E2-3200
9 years ago
Zhang Xianyi
839395fc25
Detect AMD Trinity and Richland.
10 years ago
Zhang Xianyi
94b125255f
Merge branch 'develop' into cmake
Conflicts:
driver/others/memory.c
10 years ago
Zhang Xianyi
cc7cab8a45
Detect other Intel Skylake cores.
http://users.atw.hu/instlatx64/
10 years ago
Yichao Yu
61ae47eb99
Ref #632 . Support Intel Skylake by Haswell kernels.
10 years ago
Zhang Xianyi
dcd5ba4443
Merge branch 'cmake' of https://github.com/hpanderson/OpenBLAS into hpanderson_cmake
10 years ago
Zhang Xianyi
51ff17d46e
Add AMD Excavator target.
10 years ago
Zhang Xianyi
8977b3f235
Refs #529 . Support Intel Broadwell by Haswell kernels.
10 years ago
Hank Anderson
e19bf3a28b
Removed MSVC cpuid func when using clang.
10 years ago
Hank Anderson
84d90d6ed8
Fixed some compiler errors/warnings for clang.
10 years ago
Hank Anderson
92cdac5f87
Added MSVC functions to cpuid_x86.c to replace gcc-specific ASM.
10 years ago
Werner Saar
4319769b79
added target processor STEAMROLLER
10 years ago
Zhang Xianyi
2987bc7b40
refs #464 . Fixed the bug of detecting L2 associative on x86.
11 years ago
Isaac Dunham
db7e6366cd
Workaround PIC limitations in cpuid.
cpuid uses register ebx, but ebx is reserved in PIC.
So save ebx, swap ebx & edi, and return edi.
Copied from Igor Pavlov's equivalent fix for 7zip (in CpuArch.c),
which is public domain and thus OK license-wise.
11 years ago
Zhang Xianyi
c94762bb56
Refs #401 . Added NO_AVX2 flag for old binutils (e.g. RHEL6)
11 years ago
Timothy Gu
6c2ead30f0
Remove all trailing whitespace except lapack-netlib
Signed-off-by: Timothy Gu <timothygu99@gmail.com>
11 years ago
wernsaar
88b6bf251a
force fallback for x86 32bit
11 years ago
Zhang Xianyi
7b8604ea29
Refs #335 . Added the fallback of L2 size detection for some virtual machines.
11 years ago
Zhang Xianyi
ab69443bd4
Refs #332 . Added addtional Intel Ivy Bridge and Haswell CPU-id.
11 years ago
Zhang Xianyi
2638370844
Init code base for Intel Haswell.
12 years ago
Zhang Xianyi
23186d9f21
Fixed the FMA3 detection bug.
12 years ago
Zhang Xianyi
886cbaf4e4
Support AMD Piledriver by bulldozer kernels.
12 years ago
Dan Luu
88ef307cef
Refs #241 . Add Haswell support (using sandybridge optimizations)
12 years ago