i-robot
ede648876e
!26180 fix node type error
Merge pull request !26180 from jjfeing/master
4 years ago
jjfeing
d4e2a21d26
fix node type error
4 years ago
looop5
58e27d87bc
add Custom, custom_op_info_register, CustomRegOp to __init__
4 years ago
i-robot
816de6f0ee
!26055 convert attr to value node
Merge pull request !26055 from yanzhenxiang2020/aicpu_random_seed_to_input
4 years ago
i-robot
8bf7e28fa6
!25410 add dump flag for fusion nodes
Merge pull request !25410 from yuchaojie/ir_fusion3
4 years ago
jjfeing
34b73e305d
convert attr to value node
4 years ago
yuchaojie
0c90aecae4
add dump flag for fusion nodes
4 years ago
huangxinjing
f354ab22a3
add pipeline shard interface
Add support for no pipeline accugradient
Add delay tag for fusion op
Optimizer the visite order
add mirror for mini step control
Move the group to attributes
Add gradient_shard control for the mini step
Fix code stype
Fix ut description
Add interface
4 years ago
ttudu
e953c15cd2
NeighborExchangeV2 & Grad
4 years ago
Yang Jiao
9bcf28d15d
adapt custom op to pyfunc kernel
4 years ago
i-robot
324a767fe4
!25415 insert atomic ops process split
Merge pull request !25415 from liubuyu/master
4 years ago
LaiYongqiang
7f251e3f08
add attr kAttrSkipNopOpAddr for nop node hidden in execution order
4 years ago
lby
fe18bd8695
insert atomic clean ops
4 years ago
i-robot
ff924d6e4b
!25395 unified runtime support skip nop node
Merge pull request !25395 from limingqi107/new_actor_runtime
4 years ago
limingqi107
2fe803f81d
unified runtime support skip nop node
4 years ago
looop5
0b34bf33bc
add custom op pynative testcases
4 years ago
i-robot
aa63062595
!25082 Set akg kernel attrs in backend pass
Merge pull request !25082 from DeshiChen/1015_set_kernel_attr
4 years ago
dayschan
6600c0c474
Set akg kernel attrs in backend pass
it's unreasonable to change the node when generating kernel json.
instead, it should be set in a pass.
most of the operators in original akg_kernel_attrs_process are not longer used,
so we deleted them, leaving only the "Cast" and "MatMul/BatchMatMul".
4 years ago
i-robot
8cf8e4ddd3
!21778 support tensor slice when slice result should be caculated in runtime mode
Merge pull request !21778 from lianliguang/support-dynamic-slice
4 years ago
i-robot
ee70517a5b
!25070 fix single mode bug
Merge pull request !25070 from baihuawei/single_mode_bug
4 years ago
baihuawei
a16864069b
fix single mode bug
4 years ago
fangzehua
53009276d1
add cpu dynamic memory
4 years ago
lianliguang
72c7706d22
support dynamic slice in runtime mode
4 years ago
limingqi107
0a9cbd2220
imporve performance of CPU LSTMGrad
4 years ago
i-robot
d9e6edfc9f
!23669 Fix inline pass problem in switch.
Merge pull request !23669 from LiangZhibo/inline
4 years ago
l00591931
092fcc40c9
Fix inline pass problem on switch
4 years ago
i-robot
dc87bf7f4d
!23051 add TransposedUpdateFusion
Merge pull request !23051 from yuchaojie/ir_fusion3
4 years ago
i-robot
fb58cfa273
!23414 update pynative profiling codes and delete some useless code
Merge pull request !23414 from lvchangquan/master
4 years ago
yuchaojie
6cd16d61c4
add TransposedUpdateFusion
4 years ago
lvchangquan
b908f40d94
delete some useless codes and update pynative profiling codes
4 years ago
kswang
a0dc465041
add mem scheduler
4 years ago
i-robot
d7388b40ab
!23295 add dump and profiling warning log when task is not sink
Merge pull request !23295 from baihuawei/fixlog
4 years ago
i-robot
aac70cb939
!22873 parallel_optimizer_allgather_recompute_fix
Merge pull request !22873 from yao_yf/parallel_optimizer_allgather_recompute_fix
4 years ago
baihuawei
e1e11b9a47
fix some bugs
4 years ago
yao_yf
d749690a61
adjust parallel optimizer recompute allgather depend
4 years ago
lby
27ae7cce43
code clean
4 years ago
王南
de540c68bb
asr support dynamic shape
4 years ago
guoqi
8fccec4c20
insert overflow check operaters according to the 'gradients' scope
4 years ago
i-robot
16437722fd
!22465 Adam op performance optimization
Merge pull request !22465 from zhaosida/zsd_pangu
4 years ago
zhaosida
be34ccd29f
Adam op performance optimization
4 years ago
i-robot
32281f84e7
!19000 update LayerNormGrad split pass to V2
Merge pull request !19000 from yuchaojie/ir_fusion2
4 years ago
baihuawei
a9694a9230
ascend add nontask sink mode
4 years ago
yuchaojie
1d1490df0b
xupdate LayerNormGrad split pass to V2
4 years ago
yuchaojie
7b3943bc91
DynamicRNN supports scenarios that hidden_size is not multiples of 16
4 years ago
ms_yan
36a8886ca2
Revert "[feat] [assistant] [I3T96T] add new Dataset operator CMUARCTICDataset"
This reverts commit b077aa1cab .
Revert "[feat] [assistant] [I3T96X] add new Dataset operator LibriSpeechDataset"
This reverts commit 4e6f7dc97d .
delete pass_registry_test.cc
comment hiai_nlu_model_multi.pb related line
4 years ago
djc
b077aa1cab
[feat] [assistant] [I3T96T] add new Dataset operator CMUARCTICDataset
4 years ago
djc
4e6f7dc97d
[feat] [assistant] [I3T96X] add new Dataset operator LibriSpeechDataset
4 years ago
i-robot
0b7c4a757a
!21966 [BugFix][ops][Ascend]Adapt DynamicGruV2Grad fission v2 for RuntimeError.
Merge pull request !21966 from liuxiao93/adapter-dynamicgrv2grad-pass-v2
4 years ago
i-robot
ef535853cb
!21796 refactor TransShape
Merge pull request !21796 from yuchaojie/op_select
4 years ago
liuxiao93
bb736c108e
adapt DynamicGruV2Grad fission v2.
4 years ago