zhanyuan
|
488147dcbd
|
[MSLITE] Optimize depthwise conv 3x3 arm64
|
5 years ago |
mindspore-ci-bot
|
624f6b1607
|
!7715 [MSLITE][Develop] optimize arm cpu int8 op conv dw 3x3, add border assembly
Merge pull request !7715 from yangruoqi713/conv_dw
|
5 years ago |
yangruoqi713
|
9e274b6468
|
[MSLITE][Develop] optimize arm cpu int8 op conv dw 3x3, add border assembly
|
5 years ago |
fuzhiye
|
8485d8c89c
|
fix bug of deDepthwise int8 bug
|
5 years ago |
mindspore-ci-bot
|
17764803ef
|
!7648 [MSLITE] deconv winograd fp16 neon
Merge pull request !7648 from ling/sr
|
5 years ago |
ling
|
d8b928b7f8
|
[MSLITE] deconv winograd fp16 neon
|
5 years ago |
yangruoqi713
|
b1dbfa643b
|
[MSLITE][Develop] optimize arm cpu int8 op conv dw 3x3: add assembly arm32
|
5 years ago |
zhanyuan
|
4b810f5ee3
|
Optimize depthwise convolution 3x3 for arm64 platform
|
5 years ago |
mindspore-ci-bot
|
80bcd0acdb
|
!7563 [MSLITE][Develop] optimize arm cpu int8 op conv dw 3x3: add assembly
Merge pull request !7563 from yangruoqi713/lite
|
5 years ago |
ling
|
51fced3767
|
[MSLITE] deconv winograd fp16 neon
|
5 years ago |
yangruoqi713
|
89e83b92d0
|
[MSLITE][Develop] optimize arm cpu int8 op conv dw 3x3: add assembly
|
5 years ago |
mindspore-ci-bot
|
9b2b062642
|
!7527 [MS][LITE][Develop]add fp32 deconv kernels
Merge pull request !7527 from lixian/master
|
5 years ago |
lixian
|
aa94e5a91e
|
add fp32 deconv kernels
|
5 years ago |
mindspore-ci-bot
|
7747f4c471
|
!7457 [MSLITE] Support Fp32 Matrix-Vector Multiplication for FC/MATMUL Ops
Merge pull request !7457 from zhanyuan/tmp
|
5 years ago |
zhanyuan
|
cccaab4fdc
|
Support FP32 Matrix-Vector Multiplication
|
5 years ago |
ling
|
7d97c1b903
|
[MSLITE][Develop]deconv winograd input pack and output bias
|
5 years ago |
mindspore-ci-bot
|
89e11f6b2b
|
!7436 [MS][LITE][Develop]fix fp32 deconv relu_type bug
Merge pull request !7436 from lixian/master
|
5 years ago |
lixian
|
01140a29c0
|
fix fp32 deconv relu parameter bug
|
5 years ago |
mindspore-ci-bot
|
57ebdb4545
|
!7393 [MSLITE] Add matrix-vector multiplication for fp16 fullconnection
Merge pull request !7393 from zhanyuan/tmp
|
5 years ago |
zhanyuan
|
2635dc0f97
|
Optimize fullconnection kernel for vector input
|
5 years ago |
lixian
|
c7c045ccf0
|
post deconv assembly for arm32
|
5 years ago |
mindspore-ci-bot
|
257570b3b2
|
!7291 [MS][LITE][Develop]optimization for fp32 winograd on arm32
Merge pull request !7291 from lixian/master
|
5 years ago |
lixian
|
7f3582d0f5
|
optimization for fp32 winograd on arm32
|
5 years ago |
liuzhongkai
|
9457409314
|
fix winograd init bug
|
5 years ago |
mindspore-ci-bot
|
beb8bf5d65
|
!7059 [MS][LITE][Develop]fix fp16 matmul kernel write bug
Merge pull request !7059 from lixian/master
|
5 years ago |
lixian
|
d573a1180d
|
fix fp16 matmul bug
|
5 years ago |
liuzhongkai
|
f6f9d3915c
|
fp16 winograd init optimize
|
5 years ago |
liuzhongkai
|
8f678accf3
|
arm32 winograd init optimize
|
5 years ago |
mengyuanli
|
d56eb90044
|
syc op_base act type with schema
|
5 years ago |
lixian
|
869bffe976
|
optimization for fp16 matmul kernel
|
5 years ago |
mindspore-ci-bot
|
f2b1392c7c
|
!6984 [LITE][CPU][DEVELOP]optimize arm64 model init time
Merge pull request !6984 from liuzhongkai/winograd_optimize
|
5 years ago |
liuzhongkai
|
d994d3430f
|
windgrad init optimize
|
5 years ago |
mindspore-ci-bot
|
e7eafb38a0
|
!6967 [MS][LITE][Develop]merge border handling for matmul
Merge pull request !6967 from lixian/master
|
5 years ago |
lixian
|
895efc6e2f
|
merge border handling for matmul
|
5 years ago |
mindspore-ci-bot
|
dcc4bb1d5c
|
!6960 [MS][LITE][Develop]optimization for fp32 matmul kernel on arm64
Merge pull request !6960 from lixian/master
|
5 years ago |
lixian
|
cf9d13b24e
|
optimization for fp32 matmul kernel on arm64
|
5 years ago |
zhanyuan
|
638fa0c32f
|
Fix the bug of fp16 matmul asm
|
5 years ago |
zhangxuetong
|
d92a3eeeed
|
fix ConvDwFp16Center bug
|
5 years ago |
lixian
|
939ccf7b58
|
optimization for fp32 matmul kernel on arm32
|
5 years ago |
yangruoqi713
|
53c6862a6f
|
[MSLITE][Develop] support conv_depthwise arm32 int8 weight perchannel
|
5 years ago |
mindspore-ci-bot
|
f909f7d02c
|
!6426 [MSLITE][Develop] deconv arm32 fp32 bug
Merge pull request !6426 from ling/bug
|
5 years ago |
ling
|
d257fec69c
|
[MSLITE][Develop] fix arm32 deconv bug
|
5 years ago |
liuzhongkai
|
59bdc993c8
|
fix arm32 ConvDwInt8Center.S
|
5 years ago |
ling
|
a31d14aec5
|
[MSLITE][Develop] deconv fp16 bug
|
5 years ago |
ling
|
a19e6251bc
|
[MSLITE][Develop] modify optimize.so to sdot and fp16 so
|
5 years ago |
lixian
|
dcaf76a800
|
enable int8 kernel on arm32
|
5 years ago |
zhanyuan
|
cae7a22b51
|
Support per-channel quantization of int8 matmul for arm32 platform
|
5 years ago |
mindspore-ci-bot
|
966c52b7f6
|
!6194 [MSLITE][Develop]Conv1x1 public preTrasn neon code -> .S
Merge pull request !6194 from ling/conv1x1
|
5 years ago |
ling
|
96d01f17ec
|
[MSLITE][Develop]Conv1x1 preTrasn neon code -> .S
|
5 years ago |
mindspore-ci-bot
|
6873b53043
|
!6182 [MS][LITE][Develop] add arm32 fp32 DwBoder、Row、Center op
Merge pull request !6182 from liuzhongkai/arm32_new1
|
5 years ago |