Browse Source

!8038 [MSLITE][Develop] fix bug of arm32 fp32 cpu depthwise

Merge pull request !8038 from yangruoqi713/lite
tags/v1.1.0
mindspore-ci-bot Gitee 5 years ago
parent
commit
b3c6bdf6db
1 changed files with 5 additions and 5 deletions
  1. +5
    -5
      mindspore/lite/nnacl/assembly/arm32/ConvDwFp32Row.S

+ 5
- 5
mindspore/lite/nnacl/assembly/arm32/ConvDwFp32Row.S View File

@@ -92,11 +92,11 @@ ConvDwFp32Row:
beq Loop16LineEnd beq Loop16LineEnd


LoopDepth0: LoopDepth0:
vld1.32 {s0}, [r6]!
vld1.32 {s1}, [r8]!
vld1.32 {s2}, [r0]!
vmla.f32 s2, s0, s1
vst1.32 {s2}, [r11]!
vld1.32 d0[0], [r6]!
vld1.32 d2[0], [r8]!
vld1.32 d4[0], [r0]!
vmla.f32 s8, s0, s4
vst1.32 d4[0], [r11]!
subs r10, r10, #1 subs r10, r10, #1
bne LoopDepth0 bne LoopDepth0




Loading…
Cancel
Save