fuzhiye
|
b43cdc5df9
|
1.change int8 conv which not support sdot scheme (using matmul)
2.free reduandent memory in fp16 kernels
|
5 years ago |
mindspore-ci-bot
|
7e8dad791a
|
!7526 [MSLITE][Develop] optimize conv dw arm cpu int8 op: add 3x3
Merge pull request !7526 from yangruoqi713/conv_dw
|
5 years ago |
yangruoqi713
|
161ecc4ed0
|
[MSLITE][Develop] optimize conv dw arm cpu int8 op: add 3x3
|
5 years ago |
liuzhongkai
|
3f2eadee58
|
int pack asm optimize
|
5 years ago |
fuzhiye
|
68b8c7e84c
|
replace int8 common gemm with int8 matmul
|
5 years ago |
fuzhiye
|
2d00b74de2
|
optimize winograd input transform func
|
5 years ago |
liuzhongkai
|
d994d3430f
|
windgrad init optimize
|
5 years ago |
fuzhiye
|
2b056b7a28
|
replace 8x8 block with 12x8 block in common conv
|
5 years ago |
fuzhiye
|
06142a330b
|
optimize fp16 common conv preprocess
|
5 years ago |
fuzhiye
|
e14ffdc8f4
|
optimize conv pre process
|
5 years ago |
mindspore-ci-bot
|
966c52b7f6
|
!6194 [MSLITE][Develop]Conv1x1 public preTrasn neon code -> .S
Merge pull request !6194 from ling/conv1x1
|
5 years ago |
ling
|
96d01f17ec
|
[MSLITE][Develop]Conv1x1 preTrasn neon code -> .S
|
5 years ago |
fuzhiye
|
66219788cb
|
optimize init func
|
5 years ago |
ling
|
c6a8848700
|
[MS][LITE][Develop] conv1x1 per oc arm64
|
5 years ago |
mindspore-ci-bot
|
16eda726b7
|
!6114 [MSLITE][Develop]conv1x1 arm32 filter peroc
Merge pull request !6114 from ling/arm32
|
5 years ago |
ling
|
236c8de5da
|
[MSLITE][Develop]conv1x1 arm32 filter peroc
|
5 years ago |
lixian
|
902f08be82
|
add matmul fp32 kernel on arm32
|
5 years ago |
mindspore-ci-bot
|
8097d6c278
|
!6038 [MSLITE][Develop] arm cpu int8 conv depthwise support activation per channel
Merge pull request !6038 from yangruoqi713/act_per_channel
|
5 years ago |
yangruoqi713
|
7175e1921e
|
[MSLITE][Develop] arm cpu int8 conv depthwise support activation per channel
|
5 years ago |
mindspore-ci-bot
|
55adbbeae8
|
!5818 [MSLITE][Develop] int8 conv 1x1 support per weight output-channel on x86
Merge pull request !5818 from ling/sr
|
5 years ago |
ling
|
0db75b70d7
|
[MS][LITE][Develop] int8 conv 1x1 support weight per output-channel on x86
|
5 years ago |
fuzhiye
|
6a9eead482
|
1.change pack && rewrite winograd
2.rewrite fp16 winograd
3.remove useless code
|
5 years ago |
ling
|
efe805655e
|
[MSLITE][Develop] comm conv int8 inputsum bug
|
5 years ago |
hangq
|
f1637f1157
|
1. clean review bot
2. fix nan bug
|
5 years ago |
lixian
|
5dea062915
|
optimization for int8
|
5 years ago |
ling
|
25f29dcf50
|
[MS][LITE][Develop] conv1x1 int8 weight transpose bug
|
5 years ago |
fuzhiye
|
a4114a6a6c
|
malloc using memory pool
|
5 years ago |
yangruoqi713
|
1177c8958b
|
[MS][LITE] optimize arm cpu fp32 op: conv depthwise
|
5 years ago |
fuzhiye
|
ac3905c268
|
extract post process func
|
5 years ago |
fuzhiye
|
d34c620dce
|
optimize winograd
|
5 years ago |
ling
|
6cfcdaab3d
|
[MS][LITE][Develop]conv1x1 int8
|
5 years ago |
chenjianping
|
d88a98658c
|
move nnacl to lite/
|
5 years ago |