i-robot
fa5ea7b3a6
!26370 DynamicRNNGrad support `hidden_size not multiple of 16` scene
Merge pull request !26370 from yuchaojie/ir_fusion4
4 years ago
i-robot
519f14a909
!26006 slice recompute activation
Merge pull request !26006 from yao_yf/add_transformer_slice_activation_config
4 years ago
yuchaojie
b760eba23a
DynamicRNNGrad support `hidden_size not multiple of 16` scene
4 years ago
yao_yf
188d39da83
slice_activation_in_recompute
slice recompute activation
4 years ago
i-robot
9d6248194e
!26310 MindSpore support load custom aicpu kernels.
Merge pull request !26310 from linqingke/aicpu
4 years ago
i-robot
5233c73805
!25592 Reshape support shape is variable
Merge pull request !25592 from wangnan39/reshape_support_tensor
4 years ago
王南
1163cfe967
reshape support shape is tensor
4 years ago
linqingke
bef2923acf
MindSpore support load custom aicpu ops.
4 years ago
i-robot
aac1291062
!26297 compiler support dump flag
Merge pull request !26297 from huanghui/cell-dump
4 years ago
huanghui
35cb09a536
compiler support dump flag
4 years ago
i-robot
ede648876e
!26180 fix node type error
Merge pull request !26180 from jjfeing/master
4 years ago
jjfeing
d4e2a21d26
fix node type error
4 years ago
looop5
58e27d87bc
add Custom, custom_op_info_register, CustomRegOp to __init__
4 years ago
i-robot
816de6f0ee
!26055 convert attr to value node
Merge pull request !26055 from yanzhenxiang2020/aicpu_random_seed_to_input
4 years ago
i-robot
8bf7e28fa6
!25410 add dump flag for fusion nodes
Merge pull request !25410 from yuchaojie/ir_fusion3
4 years ago
jjfeing
34b73e305d
convert attr to value node
4 years ago
yuchaojie
0c90aecae4
add dump flag for fusion nodes
4 years ago
huangxinjing
f354ab22a3
add pipeline shard interface
Add support for no pipeline accugradient
Add delay tag for fusion op
Optimizer the visite order
add mirror for mini step control
Move the group to attributes
Add gradient_shard control for the mini step
Fix code stype
Fix ut description
Add interface
4 years ago
ttudu
e953c15cd2
NeighborExchangeV2 & Grad
4 years ago
Yang Jiao
9bcf28d15d
adapt custom op to pyfunc kernel
4 years ago
i-robot
324a767fe4
!25415 insert atomic ops process split
Merge pull request !25415 from liubuyu/master
4 years ago
LaiYongqiang
7f251e3f08
add attr kAttrSkipNopOpAddr for nop node hidden in execution order
4 years ago
lby
fe18bd8695
insert atomic clean ops
4 years ago
i-robot
ff924d6e4b
!25395 unified runtime support skip nop node
Merge pull request !25395 from limingqi107/new_actor_runtime
4 years ago
limingqi107
2fe803f81d
unified runtime support skip nop node
4 years ago
looop5
0b34bf33bc
add custom op pynative testcases
4 years ago
i-robot
aa63062595
!25082 Set akg kernel attrs in backend pass
Merge pull request !25082 from DeshiChen/1015_set_kernel_attr
4 years ago
dayschan
6600c0c474
Set akg kernel attrs in backend pass
it's unreasonable to change the node when generating kernel json.
instead, it should be set in a pass.
most of the operators in original akg_kernel_attrs_process are not longer used,
so we deleted them, leaving only the "Cast" and "MatMul/BatchMatMul".
4 years ago
i-robot
8cf8e4ddd3
!21778 support tensor slice when slice result should be caculated in runtime mode
Merge pull request !21778 from lianliguang/support-dynamic-slice
4 years ago
i-robot
ee70517a5b
!25070 fix single mode bug
Merge pull request !25070 from baihuawei/single_mode_bug
4 years ago
baihuawei
a16864069b
fix single mode bug
4 years ago
fangzehua
53009276d1
add cpu dynamic memory
4 years ago
lianliguang
72c7706d22
support dynamic slice in runtime mode
4 years ago
limingqi107
0a9cbd2220
imporve performance of CPU LSTMGrad
4 years ago
i-robot
d9e6edfc9f
!23669 Fix inline pass problem in switch.
Merge pull request !23669 from LiangZhibo/inline
4 years ago
l00591931
092fcc40c9
Fix inline pass problem on switch
4 years ago
i-robot
dc87bf7f4d
!23051 add TransposedUpdateFusion
Merge pull request !23051 from yuchaojie/ir_fusion3
4 years ago
i-robot
fb58cfa273
!23414 update pynative profiling codes and delete some useless code
Merge pull request !23414 from lvchangquan/master
4 years ago
yuchaojie
6cd16d61c4
add TransposedUpdateFusion
4 years ago
lvchangquan
b908f40d94
delete some useless codes and update pynative profiling codes
4 years ago
kswang
a0dc465041
add mem scheduler
4 years ago
i-robot
d7388b40ab
!23295 add dump and profiling warning log when task is not sink
Merge pull request !23295 from baihuawei/fixlog
4 years ago
i-robot
aac70cb939
!22873 parallel_optimizer_allgather_recompute_fix
Merge pull request !22873 from yao_yf/parallel_optimizer_allgather_recompute_fix
4 years ago
baihuawei
e1e11b9a47
fix some bugs
4 years ago
yao_yf
d749690a61
adjust parallel optimizer recompute allgather depend
4 years ago
lby
27ae7cce43
code clean
4 years ago
王南
de540c68bb
asr support dynamic shape
4 years ago
guoqi
8fccec4c20
insert overflow check operaters according to the 'gradients' scope
4 years ago
i-robot
16437722fd
!22465 Adam op performance optimization
Merge pull request !22465 from zhaosida/zsd_pangu
4 years ago
zhaosida
be34ccd29f
Adam op performance optimization
4 years ago