This website works better with JavaScript.
Home
Issues
Pull Requests
Milestones
AI流水线
Repositories
Datasets
Forum
实训
竞赛
大数据
AI开发
Register
Sign In
OSchip
/
OpenBLAS
Not watched
Unwatch
Watch all
Watch but not notify
1
Star
0
Fork
0
Code
Releases
66
Wiki
evaluate
Activity
Issues
0
Pull Requests
0
Datasets
Model
Cloudbrain
HPC
Browse Source
Fixed the Windows x86_64 ABI bug in s/daxpy kernels.
tags/v0.2.7
Zhang Xianyi
13 years ago
parent
2c9a203bd1
commit
724ae159ce
2 changed files
with
9 additions
and
2 deletions
Split View
Diff Options
Show Stats
Download Patch File
Download Diff File
+5
-1
kernel/x86_64/axpy_sse.S
+4
-1
kernel/x86_64/axpy_sse2.S
+ 5
- 1
kernel/x86_64/axpy_sse.S
View File
@@ -69,7 +69,7 @@
#endif
movaps %xmm0, ALPHA
#else
movaps %xmm3, ALPHA
movq 40(%rsp), X
movq 48(%rsp), INCX
@@ -79,6 +79,10 @@
SAVEREGISTERS
#ifdef WINDOWS_ABI
movaps %xmm3, ALPHA
#endif
shufps $0, ALPHA, ALPHA
leaq (, INCX, SIZE), INCX
+ 4
- 1
kernel/x86_64/axpy_sse2.S
View File
@@ -69,7 +69,6 @@
#endif
movaps %xmm0, ALPHA
#else
movaps %xmm3, ALPHA
movq 40(%rsp), X
movq 48(%rsp), INCX
@@ -79,6 +78,10 @@
SAVEREGISTERS
#ifdef WINDOWS_ABI
movaps %xmm3, ALPHA
#endif
unpcklpd ALPHA, ALPHA
leaq (, INCX, SIZE), INCX
Write
Preview
Loading…
Cancel
Save