This website works better with JavaScript.
Home
Issues
Pull Requests
Milestones
AI流水线
Repositories
Datasets
Forum
实训
竞赛
大数据
Register
Sign In
OSchip
/
OpenBLAS
Not watched
Unwatch
Watch all
Watch but not notify
1
Star
0
Fork
0
Code
Releases
66
Wiki
evaluate
Activity
Issues
0
Pull Requests
0
Datasets
Model
Cloudbrain
HPC
Browse Source
Merge pull request
#898
from wernsaar/develop
added experimental support for optimized lapack fortran functions
tags/v0.2.19^2
Werner Saar
9 years ago
parent
2daad2bcb5
41000c8443
commit
f04af36ad0
3 changed files
with
1458 additions
and
342 deletions
Split View
Diff Options
Show Stats
Download Patch File
Download Diff File
+1083
-0
interface/lapack/fortran/dlaqr5.f
+5
-8
kernel/power/dtrsm_logic_LT_16x4_power8.S
+370
-334
kernel/power/dtrsm_macros_LT_16x4_power8.S
+ 1083
- 0
interface/lapack/fortran/dlaqr5.f
File diff suppressed because it is too large
View File
+ 5
- 8
kernel/power/dtrsm_logic_LT_16x4_power8.S
View File
@@ -54,29 +54,26 @@ DSTRM_LT_L4x16_LOOP_START:
addic. L, KK, 0
ble- DSTRM_LT_L4x16_SAVE
mtctr L
DSTRM_LT_L4x16_LOOP:
dcbt AO, PRE
dcbt BO, PRE
KERNEL_16x4
addic. L, L, -1
ble- DSTRM_LT_L4x16_SAVE
bdz- DSTRM_LT_L4x16_SAVE
dcbt AO, PRE
KERNEL_16x4
addic. L, L, -1
ble- DSTRM_LT_L4x16_SAVE
bdz- DSTRM_LT_L4x16_SAVE
dcbt AO, PRE
KERNEL_16x4
addic. L, L, -1
ble- DSTRM_LT_L4x16_SAVE
bdz- DSTRM_LT_L4x16_SAVE
dcbt AO, PRE
KERNEL_16x4
addic. L, L, -1
bgt+ DSTRM_LT_L4x16_LOOP
bdnz+ DSTRM_LT_L4x16_LOOP
DSTRM_LT_L4x16_SAVE:
+ 370
- 334
kernel/power/dtrsm_macros_LT_16x4_power8.S
File diff suppressed because it is too large
View File
Write
Preview
Loading…
Cancel
Save