This website works better with JavaScript.
Home
Issues
Pull Requests
Milestones
AI流水线
Repositories
Datasets
Forum
实训
竞赛
大数据
AI开发
Register
Sign In
OSchip
/
OpenBLAS
Not watched
Unwatch
Watch all
Watch but not notify
1
Star
0
Fork
0
Code
Releases
66
Wiki
evaluate
Activity
Issues
0
Pull Requests
0
Datasets
Model
Cloudbrain
HPC
Browse Source
Merge pull request
#2049
from Celelibi/fix_crash_sgemm_sse_x64
Fix crash in sgemm SSE/nano kernel on x86_64
tags/v0.3.6^2
Martin Kroeker
GitHub
7 years ago
parent
db3dc9e282
b7f59da42d
commit
8d3d29e4d7
No known key found for this signature in database
GPG Key ID:
4AEE18F83AFDEB23
2 changed files
with
2 additions
and
2 deletions
Split View
Diff Options
Show Stats
Download Patch File
Download Diff File
+1
-1
kernel/x86_64/gemm_kernel_4x8_nano.S
+1
-1
kernel/x86_64/gemm_kernel_8x4_sse.S
+ 1
- 1
kernel/x86_64/gemm_kernel_4x8_nano.S
View File
@@ -135,7 +135,7 @@
#endif
movq %rsp, %rbx # save old stack
subq $
128
+ LOCAL_BUFFER_SIZE, %rsp
subq $
256
+ LOCAL_BUFFER_SIZE, %rsp
andq $-4096, %rsp # align stack
STACK_TOUCHING
+ 1
- 1
kernel/x86_64/gemm_kernel_8x4_sse.S
View File
@@ -383,7 +383,7 @@
EMMS
movq %rsp, %rbx # save old stack
subq $
128
+ LOCAL_BUFFER_SIZE, %rsp
subq $
256
+ LOCAL_BUFFER_SIZE, %rsp
andq $-4096, %rsp # align stack
STACK_TOUCHING
Write
Preview
Loading…
Cancel
Save