Xiaoda Zhang
364858cbc9
In sharding propagation, to keep strategy consistent of parameter being used by multiple operators, we check the edge with one node of TmpIdentityInfo
4 years ago
Xiaoda Zhang
04db51a528
In a previous PR ( https://gitee.com/mindspore/mindspore/pulls/26807/ ), we replaced 'auto_parallel_search_mode' by 'search_mode' directly.
However, to be forward compatitable, it is suitable to keep 'auto_parallel_search_mode' available. This PR recovers the 'auto_parallel_search_mode' interface and adds a warning when using this old interface.
This PR also deals with other codestyle things.
4 years ago
i-robot
9f8ec2c5ab
!26807 [Auto parallel] [Sharding propagation] Interface change of sharding propagation
Merge pull request !26807 from Xiaoda/113-auto-parallel-search-mode-changes-to-search-mode
4 years ago
i-robot
6ecbc97fd6
!26804 virtual_dataset_avoid_auto_parallel
Merge pull request !26804 from yao_yf/virtual_dataset_avoid_auto_parallel
4 years ago
i-robot
b282414de7
!26619 arallel_ut_refactoring
Merge pull request !26619 from 王志伟/parallel_ut_refactoring1
4 years ago
Xiaoda Zhang
ad5ac77ae8
1) 'auto_parallel_search_mode' changes to 'search_mode';
2) 'sharding_propagation' moves to 'search_mode';
4 years ago
yao_yf
f29ce1fb60
virtual dataset avoid auto parallel
4 years ago
i-robot
519f14a909
!26006 slice recompute activation
Merge pull request !26006 from yao_yf/add_transformer_slice_activation_config
4 years ago
wzw
86c5ad20c8
parallel_ut_refactoring1
4 years ago
i-robot
1b8c2ff0e9
!26414 fault_recover_by_mirror_group_fix_opt_shard
Merge pull request !26414 from yao_yf/fault_recover_by_mirror_group_fix_opt_shard
4 years ago
yao_yf
188d39da83
slice_activation_in_recompute
slice recompute activation
4 years ago
yao_yf
01dc4bbdf9
fix fault recover in optimizer shard
4 years ago
Xiaoda Zhang
df67e74eaf
making sharding_propagation smooth, add a reshape justification:
1) when propagate sharding strategy from one op to another, try to find the strategy with zero communication cost;
2) if there is no such strategy, find the strategy with minimum communication cost, and raise a warning;
4 years ago
i-robot
9f52343a6a
!26350 add check for resizenearestneighbor parallel op
Merge pull request !26350 from yangzhenzhang/add-check-for-resize-op
4 years ago
yangzhenzhang
ba99e4c505
add check for resize op
4 years ago
ttudu
33ac1de062
fix bug
4 years ago
i-robot
7a73bae5c3
!26036 add output strategy for matmul operator
Merge pull request !26036 from yangzhenzhang/add-output-strategy-for-op-init
4 years ago
Xiaoda Zhang
a772767265
support reshape in sharding propagation:
1) using 'swc index of strategy_cost_' as reshape's selected strategy;
2) when encountering reshape in BFS, select the 'swc index' with zero communication cost;
3) when encountering a reshape that is already visited, check whether there exists communication between reshape and current operator. It is OK if communication happens between two configured operators;
4) currently, two consecutive reshapes are not supported;
5) adjusting BFS structure in graph_costmodel.cc;
6) adjusting some code in step_auto_parallel.cc to avoid cyclomatic complexity.
4 years ago
yangzhenzhang
8431ba616c
add output strategy for op init
4 years ago
huangxinjing
f354ab22a3
add pipeline shard interface
Add support for no pipeline accugradient
Add delay tag for fusion op
Optimizer the visite order
add mirror for mini step control
Move the group to attributes
Add gradient_shard control for the mini step
Fix code stype
Fix ut description
Add interface
4 years ago
i-robot
ded1c77bbf
!25765 neighborExchangeV2 & grad
Merge pull request !25765 from TuDouNi/neighborExchangeV2
4 years ago
ttudu
e953c15cd2
NeighborExchangeV2 & Grad
4 years ago
yangzhenzhang
6ad6304b77
add output strategy
4 years ago
yangzhenzhang
c42081619e
add parallel op for resizenearestneighbor
4 years ago
wanyiming
4fbc59a98a
utfixs
4 years ago
i-robot
f83070728d
!24790 support user define strategy gen method under auto parallel context
Merge pull request !24790 from zhuyuxiao/master
4 years ago
zhuyuxiao
1907246931
change api
4 years ago
i-robot
3fd94000c5
!24568 Apply batch parallel in auto_parallel mode when strategies are not specified
Merge pull request !24568 from zhuyuxiao/master
4 years ago
zhuyuxiao
cf76c76745
apply batch parallel in auto_parallel mode when strategies are not specified
4 years ago
yao_yf
b303d6001c
parallel ut refactor
4 years ago
i-robot
7cde7731b0
!23537 Update pangu reshape and softmax.
Merge pull request !23537 from linqingke/pangu
4 years ago
i-robot
e7cb505e68
!23569 Produce parallel operators for ResizeBilinear and ResizeNearestNeighbor
Merge pull request !23569 from Bert0108/resizebilinear_parallel_ops
4 years ago
i-robot
d37fccc56f
!23544 remove deprecated gather op
Merge pull request !23544 from zhuyuxiao/master
4 years ago
Bert0108
2d3d0b673e
parallel operators for ResizeBilinear and ResizeNearestNeighbor
4 years ago
linqingke
acde7febef
update pangu reshape and softmax performance.
Add layer norm judge
Fix layer norm name error
Fix input tyoe check
Fix ut test
Add 3d supports
4 years ago
zhoufeng
1f934bd782
check neighbor attr type
Signed-off-by: zhoufeng <zhoufeng54@huawei.com>
4 years ago
zhuyuxiao
79d99323a2
rename gather
4 years ago
huangxinjing
0b89d5c9c4
fix batch size error
4 years ago
i-robot
bbdacd41f4
!20585 add security isolate for save_graphs
Merge pull request !20585 from huanghui/add-security-isolate-for-DumpIR
4 years ago
i-robot
db19a40280
!23365 add print op security
Merge pull request !23365 from fangzehua/print_sec
4 years ago
huanghui
ba66c0d491
add security isolate for save_graphs
4 years ago
fangzehua
4ccc635a07
add print security
4 years ago
yangzhenzhang
1b8eb283e4
modify batch parallel info
4 years ago
huangxinjing
e02f553010
Fix spell error and add mode check
4 years ago
Xiaoda Zhang
5613c0b974
add a moe implementation:
1) extend the Liner cell for including BatchMatMul implementation, in
which the first dimension indicates the expert number;
2) implement a Switch (top1) router;
3) implement a MoE cell, which extends the FeedForward cell.
4 years ago
i-robot
77424eaad5
!23004 Add args Check for Transformer
Merge pull request !23004 from huangxinjing/args_check
4 years ago
huangxinjing
6cea07f749
Add args check
4 years ago
yao_yf
3ef26288a2
parallel_sparse_attention_ops_fix_repeated_cal
4 years ago
i-robot
fa12d62d4d
!21776 set device_id master 0813
Merge pull request !21776 from mindspore_ding/set_device_id_master_0813
4 years ago
dingpeifei
b4bc6000dc
set device id master 0813
4 years ago