You can not select more than 25 topics Topics must start with a chinese character,a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
 
Martin Kroeker e1b7123bbe
Merge pull request #2867 from Qiyu8/usimd-floatdot
5 years ago
..
KERNEL Fix paths to C kernels for nrm2 7 years ago
KERNEL.ARMV8 Adapt ARM architect 5 years ago
KERNEL.CORTEXA53 Adapt ARM architect 5 years ago
KERNEL.CORTEXA57 Adapt ARM architect 5 years ago
KERNEL.CORTEXA72 Simplifying ARMv8 build parameters 6 years ago
KERNEL.CORTEXA73 Simplifying ARMv8 build parameters 6 years ago
KERNEL.EMAG8180 Add preliminary support for EMAG8180 5 years ago
KERNEL.FALKOR Simplifying ARMv8 build parameters 6 years ago
KERNEL.NEOVERSEN1 Add Neoverse-N1 core 5 years ago
KERNEL.THUNDERX ARM64: Enable DYNAMIC_ARCH 7 years ago
KERNEL.THUNDERX2T99 ARM64: Enable DYNAMIC_ARCH 7 years ago
KERNEL.THUNDERX3T110 ARM64: Add THUNDERX3T110 Target 5 years ago
KERNEL.TSV110 update 5 years ago
KERNEL.VORTEX Rename KERNEL.SILICON to KERNEL.VORTEX 5 years ago
Makefile added experimental support for ARMV8 12 years ago
amax.S ARM64: Convert all labels to local labels 8 years ago
asum.S ARM64: Convert all labels to local labels 8 years ago
axpy.S ARM64: Convert all labels to local labels 8 years ago
casum.S ARM64: Convert all labels to local labels 8 years ago
casum_thunderx2t99.c Fixed a few more unnecessary calls to num_cpu_avail. 7 years ago
cgemm_kernel_4x4.S ARM64: Convert all labels to local labels 8 years ago
cgemm_kernel_8x4.S ARM64: Convert all labels to local labels 8 years ago
cgemm_kernel_8x4_thunderx2t99.S ARM64: Convert all labels to local labels 8 years ago
copy.S ARM64: Convert all labels to local labels 8 years ago
copy_thunderx2t99.c Fixed a few more unnecessary calls to num_cpu_avail. 7 years ago
csum.S Add ARM64 implementations of ?sum 6 years ago
ctrmm_kernel_4x4.S ARM64: Convert all labels to local labels 8 years ago
ctrmm_kernel_8x4.S ARM64: Convert all labels to local labels 8 years ago
dasum_thunderx2t99.c Fixed a few more unnecessary calls to num_cpu_avail. 7 years ago
daxpy_thunderx.c aarch64 fix std=c18 compilation 5 years ago
daxpy_thunderx2t99.S ARM64: Improve DAXPY for ThunderX2 5 years ago
ddot_thunderx.c ARM64: Rename kernel files to have consistent naming 8 years ago
dgemm_beta.S Fix zero initialization for beta=0 case 5 years ago
dgemm_kernel_4x4.S ARM64: Convert all labels to local labels 8 years ago
dgemm_kernel_4x8.S ARM64: Convert all labels to local labels 8 years ago
dgemm_kernel_8x4.S ARM64: Convert all labels to local labels 8 years ago
dgemm_kernel_8x4_thunderx2t99.S ARM64: Move parameters from parameter.c to param.h 7 years ago
dgemm_ncopy_4.S ARM64: Convert all labels to local labels 8 years ago
dgemm_ncopy_8.S ARM64: Convert all labels to local labels 8 years ago
dgemm_tcopy_4.S ARM64: Convert all labels to local labels 8 years ago
dgemm_tcopy_8.S ARM64: Convert all labels to local labels 8 years ago
dot.S ARM64: Fix utest dsdot errors 7 years ago
dot_thunderx.c ARM64: Rename kernel files to have consistent naming 8 years ago
dot_thunderx2t99.c Fixed a few more unnecessary calls to num_cpu_avail. 7 years ago
dtrmm_kernel_4x4.S ARM64: Convert all labels to local labels 8 years ago
dtrmm_kernel_4x8.S ARM64: Convert all labels to local labels 8 years ago
dtrmm_kernel_8x4.S ARM64: Convert all labels to local labels 8 years ago
dznrm2_thunderx2t99.c Fixed a few more unnecessary calls to num_cpu_avail. 7 years ago
dznrm2_thunderx2t99_fast.c Fixed a few more unnecessary calls to num_cpu_avail. 7 years ago
gemv_n.S ARM64: Convert all labels to local labels 8 years ago
gemv_t.S ARM64: Convert all labels to local labels 8 years ago
iamax.S ARM64: Convert all labels to local labels 8 years ago
iamax_thunderx2t99.c Fixed a few more unnecessary calls to num_cpu_avail. 7 years ago
izamax.S ARM64: Convert all labels to local labels 8 years ago
izamax_thunderx2t99.c Fixed a few more unnecessary calls to num_cpu_avail. 7 years ago
nrm2.S Fix accidental duplication of jump instruction 6 years ago
rot.S ARM64: Convert all labels to local labels 8 years ago
sasum_thunderx2t99.c Fixed a few more unnecessary calls to num_cpu_avail. 7 years ago
scal.S ARM64: Convert all labels to local labels 8 years ago
scnrm2_thunderx2t99.c Fixed a few more unnecessary calls to num_cpu_avail. 7 years ago
sgemm_beta.S fix initialization to zero in the NEON SGEMM_BETA kernel as well 5 years ago
sgemm_kernel_4x4.S ARM64: Convert all labels to local labels 8 years ago
sgemm_kernel_8x8.S ARM64: Convert all labels to local labels 8 years ago
sgemm_kernel_8x8_cortexa53.S fix INIT8x4 5 years ago
sgemm_kernel_16x4.S ARM64: Convert all labels to local labels 8 years ago
sgemm_kernel_16x4_thunderx2t99.S ARM64: Convert all labels to local labels 8 years ago
sgemm_ncopy_4.S Use arm neon instructions to optimize ncopy operation 5 years ago
sgemm_ncopy_8.S sgemm copy source init 5 years ago
sgemm_tcopy_8.S sgemm copy source init 5 years ago
sgemm_tcopy_16.S [WIP] Use arm neon instructions to optimize tcopy operation 5 years ago
strmm_kernel_4x4.S ARM64: Convert all labels to local labels 8 years ago
strmm_kernel_8x8.S ARM64: Convert all labels to local labels 8 years ago
strmm_kernel_8x8_cortexa53.S use general register to speedup 5 years ago
strmm_kernel_16x4.S ARM64: Convert all labels to local labels 8 years ago
sum.S Add ARM64 implementations of ?sum 6 years ago
swap.S ARM64: Convert all labels to local labels 8 years ago
swap_thunderx2t99.S THUNDERX2T99: Add optimized S/D/C/Z SWAP Implementations 8 years ago
zamax.S Fix the functional bugs for zamax. 5 years ago
zasum.S ARM64: Convert all labels to local labels 8 years ago
zasum_thunderx2t99.c Fixed a few more unnecessary calls to num_cpu_avail. 7 years ago
zaxpy.S ARM64: Convert all labels to local labels 8 years ago
zdot.S ARM64: Convert all labels to local labels 8 years ago
zdot_thunderx2t99.c Fixed a few more unnecessary calls to num_cpu_avail. 7 years ago
zgemm_kernel_4x4.S ARM64: Convert all labels to local labels 8 years ago
zgemm_kernel_4x4_thunderx2t99.S ARM64: Convert all labels to local labels 8 years ago
zgemv_n.S ARM64: Convert all labels to local labels 8 years ago
zgemv_t.S ARM64: Convert all labels to local labels 8 years ago
znrm2.S Remove automatic label postfixes from macro included only once 6 years ago
zrot.S ARM64: Convert all labels to local labels 8 years ago
zscal.S ARM64: Convert all labels to local labels 8 years ago
zsum.S Add ARM64 implementations of ?sum 6 years ago
ztrmm_kernel_4x4.S ARM64: Convert all labels to local labels 8 years ago