i-robot
6bdd38399a
!25811 fault_recover_by_mirror_group
Merge pull request !25811 from yao_yf/fault_recover_by_mirror_group
4 years ago
i-robot
ede648876e
!26180 fix node type error
Merge pull request !26180 from jjfeing/master
4 years ago
yao_yf
501b978d16
find data parallel common group in auto parallel
4 years ago
i-robot
5211733add
!25614 [GraphKernel] Enable parallel fusion in Ascend and enhance parallel feature.
Merge pull request !25614 from TronZhang/parallel_support_in_ascend
4 years ago
tronzhang
e2a0c0d613
support parallel for ascend
4 years ago
i-robot
6c587dc2d3
!25391 Support to profiling parallel strategy
Merge pull request !25391 from ougongchang/profiling_stategy
4 years ago
i-robot
b21a98fca3
!26070 [GraphKernel] move file_utils from ccsrc to core.
Merge pull request !26070 from chenlei_autodiff/move_file_utils
4 years ago
jjfeing
d4e2a21d26
fix node type error
4 years ago
looop5
58e27d87bc
add Custom, custom_op_info_register, CustomRegOp to __init__
4 years ago
ougongchang
9229f1c1ff
profiler support to collect parallel strategy info
If SetNodeOutputType functions forcibly splits into multiple functions, the readability decreases, so it blocks lizard scans
4 years ago
i-robot
816de6f0ee
!26055 convert attr to value node
Merge pull request !26055 from yanzhenxiang2020/aicpu_random_seed_to_input
4 years ago
chenlei_autodiff
13777375bd
[GraphKernel] move file_utils from ccsrc to core.
4 years ago
i-robot
14efcd5a1c
!26030 [GraphKernel] Add Compiling Macros in graph_kernel_flags.
Merge pull request !26030 from chenlei_autodiff/decouple_code
4 years ago
i-robot
8bf7e28fa6
!25410 add dump flag for fusion nodes
Merge pull request !25410 from yuchaojie/ir_fusion3
4 years ago
i-robot
9b00c2d941
!26047 [ME][Fallback] Modify fallback log level
Merge pull request !26047 from Margaret_wangrui/fallback_log
4 years ago
jjfeing
34b73e305d
convert attr to value node
4 years ago
Margaret_wangrui
ea95e2c7d2
modify fallback log level
4 years ago
yuchaojie
0c90aecae4
add dump flag for fusion nodes
4 years ago
chenlei_autodiff
6ac7471d5c
[GraphKernel] Add Compiling Macros in graph_kernel_flags.
4 years ago
huangxinjing
f354ab22a3
add pipeline shard interface
Add support for no pipeline accugradient
Add delay tag for fusion op
Optimizer the visite order
add mirror for mini step control
Move the group to attributes
Add gradient_shard control for the mini step
Fix code stype
Fix ut description
Add interface
4 years ago
i-robot
ded1c77bbf
!25765 neighborExchangeV2 & grad
Merge pull request !25765 from TuDouNi/neighborExchangeV2
4 years ago
ttudu
e953c15cd2
NeighborExchangeV2 & Grad
4 years ago
Margaret_wangrui
ae7ff1d17d
Add Primitive type into ValueNameToConverterVector
4 years ago
He Wei
fd8bbb4d96
Optimize isa() and type id related code.
1. Let 'isa<T>()' calls 'IsSameTypeId()' when T is final,
and we add 'final' mark to some final classes derived from Base;
2. Generate type id at compile time using constexpr hash function,
remove TypeIdManager and related code;
3. Add a tool script 'scripts/check_tid.sh' to check tid uniqueness;
4. Improve type search in 'proto_exporter.cc' and 'convert_utils_py.cc';
5. Add some missed MS_DECLARE_PARENT;
6. Ensure calls of 'parent_t::IsFromTypeId()' are all inlined.
4 years ago
Yang Jiao
9bcf28d15d
adapt custom op to pyfunc kernel
4 years ago
i-robot
324a767fe4
!25415 insert atomic ops process split
Merge pull request !25415 from liubuyu/master
4 years ago
LaiYongqiang
7f251e3f08
add attr kAttrSkipNopOpAddr for nop node hidden in execution order
4 years ago
lby
fe18bd8695
insert atomic clean ops
4 years ago
i-robot
ff924d6e4b
!25395 unified runtime support skip nop node
Merge pull request !25395 from limingqi107/new_actor_runtime
4 years ago
limingqi107
2fe803f81d
unified runtime support skip nop node
4 years ago
looop5
0b34bf33bc
add custom op pynative testcases
4 years ago
i-robot
aa63062595
!25082 Set akg kernel attrs in backend pass
Merge pull request !25082 from DeshiChen/1015_set_kernel_attr
4 years ago
dayschan
6600c0c474
Set akg kernel attrs in backend pass
it's unreasonable to change the node when generating kernel json.
instead, it should be set in a pass.
most of the operators in original akg_kernel_attrs_process are not longer used,
so we deleted them, leaving only the "Cast" and "MatMul/BatchMatMul".
4 years ago
i-robot
8cf8e4ddd3
!21778 support tensor slice when slice result should be caculated in runtime mode
Merge pull request !21778 from lianliguang/support-dynamic-slice
4 years ago
i-robot
ee70517a5b
!25070 fix single mode bug
Merge pull request !25070 from baihuawei/single_mode_bug
4 years ago
baihuawei
a16864069b
fix single mode bug
4 years ago
i-robot
4268cf5de0
!25026 support dynamic shape for cpu kernel of mae net
Merge pull request !25026 from fangzehua/dynamic_mae_1014
4 years ago
fangzehua
53009276d1
add cpu dynamic memory
4 years ago
lianliguang
72c7706d22
support dynamic slice in runtime mode
4 years ago
Yang Jiao
2d83f0e9ef
fix static-check
4 years ago
dayschan
7502345c8f
Add GraphKernelFlags into namespace mindspore::graphkernel
4 years ago
i-robot
06b0beced7
!22731 GraphKernel supports CPU
Merge pull request !22731 from DeshiChen/0901_graphkernel_cpu
4 years ago
dayschan
32ecd8ee79
GraphKernel supports CPU
only Linux system is supported now.
change the default value of `ENABLE_AKG` to off, and controlled by option `-K`.
the `ENABLE_AKG` is auto enabled when `ENABLE_GPU` or `ENABLE_D` is on.
since now, we can use `ENABLE_AKG` to control the compilation of graphkernel
and akg codes.
fix usage description for option "-K", it should be "[-K on|off]".
LLVM is required by akg for cpu kernels, so AKG for cpu is default disabled now.
4 years ago
limingqi107
0a9cbd2220
imporve performance of CPU LSTMGrad
4 years ago
i-robot
f98cc8dce2
!23599 Handle real to complex in bprob
Merge pull request !23599 from zhouyaqiang0/complex_ops
4 years ago
zhouyaqiang
3d01cb0c3d
handle real to complex bprob
4 years ago
i-robot
d9e6edfc9f
!23669 Fix inline pass problem in switch.
Merge pull request !23669 from LiangZhibo/inline
4 years ago
l00591931
092fcc40c9
Fix inline pass problem on switch
4 years ago
i-robot
9309a11999
!23352 [Fallback] Handle interpreted node as input, set function Parameters and update local params if need.
Merge pull request !23352 from 张清华/opt_fallback
4 years ago
i-robot
b825fd281d
!23697 fix warnings of pclint plus, codedex and codecheck at 0917
Merge pull request !23697 from huanghui/clean-warnings-0917
4 years ago