ttudu
33ac1de062
fix bug
4 years ago
i-robot
7a73bae5c3
!26036 add output strategy for matmul operator
Merge pull request !26036 from yangzhenzhang/add-output-strategy-for-op-init
4 years ago
Xiaoda Zhang
a772767265
support reshape in sharding propagation:
1) using 'swc index of strategy_cost_' as reshape's selected strategy;
2) when encountering reshape in BFS, select the 'swc index' with zero communication cost;
3) when encountering a reshape that is already visited, check whether there exists communication between reshape and current operator. It is OK if communication happens between two configured operators;
4) currently, two consecutive reshapes are not supported;
5) adjusting BFS structure in graph_costmodel.cc;
6) adjusting some code in step_auto_parallel.cc to avoid cyclomatic complexity.
4 years ago
yangzhenzhang
8431ba616c
add output strategy for op init
4 years ago
huangxinjing
f354ab22a3
add pipeline shard interface
Add support for no pipeline accugradient
Add delay tag for fusion op
Optimizer the visite order
add mirror for mini step control
Move the group to attributes
Add gradient_shard control for the mini step
Fix code stype
Fix ut description
Add interface
4 years ago
i-robot
ded1c77bbf
!25765 neighborExchangeV2 & grad
Merge pull request !25765 from TuDouNi/neighborExchangeV2
4 years ago
ttudu
e953c15cd2
NeighborExchangeV2 & Grad
4 years ago
yangzhenzhang
6ad6304b77
add output strategy
4 years ago
yangzhenzhang
c42081619e
add parallel op for resizenearestneighbor
4 years ago
wanyiming
4fbc59a98a
utfixs
4 years ago
i-robot
f83070728d
!24790 support user define strategy gen method under auto parallel context
Merge pull request !24790 from zhuyuxiao/master
4 years ago
zhuyuxiao
1907246931
change api
4 years ago
i-robot
3fd94000c5
!24568 Apply batch parallel in auto_parallel mode when strategies are not specified
Merge pull request !24568 from zhuyuxiao/master
4 years ago
zhuyuxiao
cf76c76745
apply batch parallel in auto_parallel mode when strategies are not specified
4 years ago
yao_yf
b303d6001c
parallel ut refactor
4 years ago
i-robot
7cde7731b0
!23537 Update pangu reshape and softmax.
Merge pull request !23537 from linqingke/pangu
4 years ago
i-robot
e7cb505e68
!23569 Produce parallel operators for ResizeBilinear and ResizeNearestNeighbor
Merge pull request !23569 from Bert0108/resizebilinear_parallel_ops
4 years ago
i-robot
d37fccc56f
!23544 remove deprecated gather op
Merge pull request !23544 from zhuyuxiao/master
4 years ago
Bert0108
2d3d0b673e
parallel operators for ResizeBilinear and ResizeNearestNeighbor
4 years ago
linqingke
acde7febef
update pangu reshape and softmax performance.
Add layer norm judge
Fix layer norm name error
Fix input tyoe check
Fix ut test
Add 3d supports
4 years ago
zhoufeng
1f934bd782
check neighbor attr type
Signed-off-by: zhoufeng <zhoufeng54@huawei.com>
4 years ago
zhuyuxiao
79d99323a2
rename gather
4 years ago
huangxinjing
0b89d5c9c4
fix batch size error
4 years ago
i-robot
bbdacd41f4
!20585 add security isolate for save_graphs
Merge pull request !20585 from huanghui/add-security-isolate-for-DumpIR
4 years ago
i-robot
db19a40280
!23365 add print op security
Merge pull request !23365 from fangzehua/print_sec
4 years ago
huanghui
ba66c0d491
add security isolate for save_graphs
4 years ago
fangzehua
4ccc635a07
add print security
4 years ago
yangzhenzhang
1b8eb283e4
modify batch parallel info
4 years ago
huangxinjing
e02f553010
Fix spell error and add mode check
4 years ago
Xiaoda Zhang
5613c0b974
add a moe implementation:
1) extend the Liner cell for including BatchMatMul implementation, in
which the first dimension indicates the expert number;
2) implement a Switch (top1) router;
3) implement a MoE cell, which extends the FeedForward cell.
4 years ago
i-robot
77424eaad5
!23004 Add args Check for Transformer
Merge pull request !23004 from huangxinjing/args_check
4 years ago
huangxinjing
6cea07f749
Add args check
4 years ago
yao_yf
3ef26288a2
parallel_sparse_attention_ops_fix_repeated_cal
4 years ago
i-robot
fa12d62d4d
!21776 set device_id master 0813
Merge pull request !21776 from mindspore_ding/set_device_id_master_0813
4 years ago
dingpeifei
b4bc6000dc
set device id master 0813
4 years ago
i-robot
d87d0e07c2
!22255 recompute_interface_modify
Merge pull request !22255 from yao_yf/recompute_interface_modify
4 years ago
yao_yf
39055af6e4
recompute interface modify
4 years ago
i-robot
cc8d614b25
!22650 fixed sparse attention modify
Merge pull request !22650 from yao_yf/fixed_sparse_attention_modify
4 years ago
i-robot
389f3a6b6c
!21835 make alltoall and neighborexchange to be interface && revert pr 21395
Merge pull request !21835 from zhoufeng/revert-same-input-to-comm-op
4 years ago
yao_yf
82889ec56b
fixed sparse attention
4 years ago
yao_yf
68dd138462
add parallel sparse attention ops: dsd_matmul
4 years ago
zhoufeng
ecae690a19
Revert "fix same node is used by two comm op"
This reverts commit b09d411dc4 .
add AlltoAll and NeighborExchange as interface
4 years ago
yao_yf
b8a9cbe2a3
add cus_matmul_dds parallel ops
4 years ago
Zhang Qinghua
a137fa1d0b
Optimize the Executors routines.
- Fix the key generating.
- Distinguish the executors.
4 years ago
zhihenghu
ce12c02343
Add Sparse Attention
adjut the file structure and name
Deleted extra information
Do some formatting work
Add test case and fix some document
fix imports
4 years ago
i-robot
e6e1f37ae4
!22346 [Core] Fix the bug of scope setting when cloning nodes
Merge pull request !22346 from Xiaoda/86-fix-the-fullname-scope-bug
4 years ago
i-robot
8d00a8d803
!22360 Fix Transformer Mirror Error
Merge pull request !22360 from huangxinjing/fix_transformer_mirror_error
4 years ago
Xiaoda Zhang
b2703879c6
fix the scope setting error when cloning nodes
4 years ago
i-robot
edcbb68d71
!22386 fix neighborexchange empty input case
Merge pull request !22386 from zhoufeng/fix-neighbor-empty-input-bak
4 years ago
zhoufeng
e5a1582e4b
fix neighborexchange empty input case
Signed-off-by: zhoufeng <zhoufeng54@huawei.com>
4 years ago