huangxinjing
f354ab22a3
add pipeline shard interface
Add support for no pipeline accugradient
Add delay tag for fusion op
Optimizer the visite order
add mirror for mini step control
Move the group to attributes
Add gradient_shard control for the mini step
Fix code stype
Fix ut description
Add interface
4 years ago
ttudu
e953c15cd2
NeighborExchangeV2 & Grad
4 years ago
lichenever
9584e68767
fix_pipeline_backward_bug
4 years ago
zhoufeng
ecae690a19
Revert "fix same node is used by two comm op"
This reverts commit b09d411dc4 .
add AlltoAll and NeighborExchange as interface
4 years ago
Ziyan
be1f5a43d7
opt shard fit micro batch
4 years ago
lichenever
db5d508356
pipeline_split_adapt_master
5 years ago
Ziyan
ec9793861f
fix grad accu
5 years ago
yangzhenzhang
a70d616841
mini step grad accumulation
5 years ago
yangzhenzhang
9da3f9bec9
mini step grad accumulation
5 years ago
Ziyan
98566ddc07
enable gradients mean in opt shard
5 years ago
mindspore-ci-bot
b67aaf6773
!9832 expose_allgather_fusion_to_users
From: @gong_zi_yan
Reviewed-by:
Signed-off-by:
5 years ago
Ziyan
bbf8ec82b9
expose allgather fusion interface to users
5 years ago
lichenever
9595502278
optimizer_pipline_split
5 years ago
huangxinjing
565ce81b29
Fix allsawp
5 years ago
mindspore-ci-bot
6b9e402790
!9396 enable allgather fusion
From: @gong_zi_yan
Reviewed-by: @stsuteng,@yangzhenzhang,@kisnwang
Signed-off-by: @stsuteng,@kisnwang
5 years ago
Ziyan
e29f5c96cb
enable_allgather_fusion
5 years ago
lichenever
78e131cf15
pipeline_split adapt parallel
5 years ago
ZPaC
db3a2d60cb
GPU supports p2p nccl interfaces
5 years ago
lichenever
ee2478c05d
change send_recv to inner
5 years ago
mindspore-ci-bot
8980bc3de7
!8675 support allreduce prod
From: @yao_yf
Reviewed-by: @stsuteng,@kisnwang
Signed-off-by: @stsuteng
5 years ago
huangxinjing
23284f0b35
Add AllSwap Op
5 years ago
yao_yf
1529d544b9
add allreduce prod
5 years ago
lichenever
2e1c43483e
add auto parallel pipeline
5 years ago
huangxinjing
12e9107162
Fix VirtualDiv Int32 error
5 years ago
Ziyan
ddc0113058
enable parallel optimizer in auto parallel
5 years ago
Yi Huaijie
6066b16838
implement parallel Pack
5 years ago
yangzhenzhang
f4bb43bbaf
add concat op
5 years ago
panyifeng
34e50e5d6e
fix cell bprop
5 years ago
lirongzhen1
51796aa624
fix sparse feature bug for auto parallel
5 years ago
wangnan39@huawei.com
86889c59cb
optimizer adapt IndexedSlices
5 years ago
Yi Huaijie
2eb739de6e
change HostAllGather and HostReduceScatter to internal interface
5 years ago
lirongzhen1
516b56cb64
sparse feature bp
5 years ago
Yi Huaijie
2f8e7ff693
add operator HostAllGather and HostReduceScatter
6 years ago
lirongzhen1
0b4648881b
add reducescatter bprop
6 years ago
zhunaipan
930a1fb0a8
initial version
Signed-off-by: leonwanghui <leon.wanghui@huawei.com>
6 years ago