tronzhang
c32bf5ac28
promote complex tensor as graph's input and recorrect getitem index for graph kernels fusion.
5 years ago
mindspore-ci-bot
21c5607fca
!6971 cudnn inplace optimizer
Merge pull request !6971 from chenweifeng/tensoradd_inplace
5 years ago
wilfChen
b420b6cda7
cudnn inplace optimizer
5 years ago
dayschan
7599686a72
GraphKernel supports multi-output kernels
5 years ago
lingyunli63
dd48f10c3d
add assign ops in composite_topi
5 years ago
mindspore-ci-bot
57ecb40022
!6825 GPU add combine cast fusion
Merge pull request !6825 from VectorSL/combine-cast
5 years ago
VectorSL
f36c2721af
gpu add combine cast fusion
5 years ago
mindspore-ci-bot
e4ad166e0c
!6818 GPU add combine mom fusion
Merge pull request !6818 from VectorSL/combine-mom
5 years ago
mindspore-ci-bot
25f8c6dc8d
!6681 Fix ub fusion's cycle fusion
Merge pull request !6681 from huanghui/fix-ub-fusion-loop
5 years ago
VectorSL
8dca80036a
gpu add combine mom fusion
5 years ago
liubuyu
8af3250477
support dynamic_rnn and dynamic_rnn_grad op
5 years ago
huanghui
90dfecfb00
fix bnupdate_eltwise_eltwise's cycle fusion
5 years ago
liubuyu
96622fc804
support extract_image_patches op
5 years ago
VectorSL
48db7f8c4f
gpu change bncast
5 years ago
mindspore-ci-bot
fe219b5680
!6181 make ref node's update edge using same address
Merge pull request !6181 from lianliguang/fix-bug-of-deal-ref
5 years ago
mindspore-ci-bot
564b99e549
!6541 fix bug of inset transdata in pynative mode
Merge pull request !6541 from lianliguang/fix-bug-of-insert-transdata-of-pynative
5 years ago
mindspore-ci-bot
149285b6f2
!6532 GPU fix BnCast
Merge pull request !6532 from VectorSL/bncast
5 years ago
mindspore-ci-bot
59f582faa3
!6539 Only match the pattern of Conv2DBackproInput+ReluGradV2
Merge pull request !6539 from huanghui/fix-ub-fusion-conv2dbpinput-eltwise
5 years ago
huanghui
af0a11ff69
Update the pattern of conv2dbackprop_eltwise_fusion pass. Only match Conv2dBackpropInput+ReluGradV2.
5 years ago
William Lian
25fcd0488e
fix bug of pynative in back gradient graph mode's transdata inserting when the node is the output of the graph
5 years ago
yujianfeng
1b45e5fe97
Fix codedex warning
5 years ago
VectorSL
50dc89332c
fix bn cast
5 years ago
limingqi107
740254161e
gpu code review
5 years ago
mindspore-ci-bot
deb1d2ba96
!6403 fix codex clean bug
Merge pull request !6403 from laiyongqiang/codex2
5 years ago
mindspore-ci-bot
0c2e7f5092
!6321 codex warning
Merge pull request !6321 from chenweifeng/codex
5 years ago
laiyongqiang
e13beaa7ed
clean codex warning
5 years ago
jjfeing
37ec5c0a1f
fix code warning
5 years ago
wilfChen
aacf7c2e34
codex warning
5 years ago
mindspore-ci-bot
014ea619a8
!6240 Fix review_bot and codedex problems
Merge pull request !6240 from DeshiChen/0910_review_bot
5 years ago
mindspore-ci-bot
ff42cd87b2
!6247 Fix cpu ScatterNdUpdate doesn't update output
Merge pull request !6247 from huanghui/clear-warning
5 years ago
mindspore-ci-bot
c543db0585
!6180 clean codex warning in memreuse
Merge pull request !6180 from laiyongqiang/codex
5 years ago
huanghui
d6944a70ca
fix cpu kernel:ScatterNdUpdate doesn't set output
5 years ago
dayschan
3c2da3197f
Fix review_bot and codedex problems
5 years ago
mindspore-ci-bot
b83d921735
!3317 add check for stridedslice when choose aicpu or aicore
Merge pull request !3317 from zhangbuxue/add_check_for_stridedslice_when_choose_aicpu_or_aicore
5 years ago
mindspore-ci-bot
f7691335eb
!6167 fused select and greater op to improve bert perfermance on GPU
Merge pull request !6167 from ZengZitao/fuse_greater_select_ms
5 years ago
laiyongqiang
4063a69846
clean codex warning
5 years ago
WilliamLian
d9fab277e3
make ref edage using same address
5 years ago
zengzitao
a38d6139fa
fused select and greater op to improve bert perfermance on GPU
5 years ago
r1chardf1d0
88de0cffa9
open graph kernel expander opt for gpu
5 years ago
mindspore-ci-bot
a0e3fd6bf3
!6021 merge the right branch first in dx+addn+drelu fusion
Merge pull request !6021 from guoqi/master
5 years ago
guoqi
dbd8036751
fusion the right branch first in dx+addn+drelu fusion
5 years ago
buxue
08059f5c61
add check for stridedslice when choose aicpu or aicore
5 years ago
John Tzanakakis
b0a7ebdeb0
enable debugger by default and set correct log message severity
5 years ago
mindspore-ci-bot
7b3873559f
!5883 support for frac_zn_lstm
Merge pull request !5883 from liubuyu/master
5 years ago
mindspore-ci-bot
7152fe04be
!5783 GraphKernel supports GPU
Merge pull request !5783 from DeshiChen/graph_kernel_1.0
5 years ago
dayschan
37a48f6aac
GraphKernel supports GPU
1. Update akg submodule
2. Refactor akg_kernel_build, akg_ascend_kernel_build, akg_gpu_kernel_build
3. Add akg_kernel_json_decoder to support converting kernel_json to AnfNode.
4. Add GraphKernel Cost Model. (mindspore/_extends/graph_kernel)
5. Add some GraphKernel passes to GpuSession, move these passes to backend/optimizer/graph_kernel.
6. Add global id for ir files.
7. Fix bug in ConstInputToAttr.
5 years ago
liubuyu
23a298ca81
support new format frac_zn_lstm
5 years ago
panyifeng
49432d11be
fix cpplint
5 years ago
limingqi107
5058e844cd
gpu inceptionv3 optimize
5 years ago
lizhenyu
c3d6918649
add kernel select after optimize pass
5 years ago