i-robot
ad9757ccf0
!30661 [Auto parallel] [MoE] Fix an error of configuring MoE parallel
Merge pull request !30661 from Xiaoda/124-moe-changes
4 years ago
i-robot
789c1d6bd3
!30614 [AutoParallel] Fix Overflow As the cast is inserted before Mirror
Merge pull request !30614 from huangxinjing/fix_cast_error
4 years ago
Xiaoda Zhang
81e5abe580
fix an error of configuring parallel
4 years ago
i-robot
a92c54b206
!30496 [Fallback] Suppport scipy module.
Merge pull request !30496 from huangbingjian/support_scipy
4 years ago
i-robot
a8686ae3d9
!18827 [assistant][ops]New operator implementation, include LFWDataset
Merge pull request !18827 from Wangsong95/lfw_dataset
4 years ago
zx
2132f62d98
[feat][assistant][I3J6VQ] add new data operator LFW
4 years ago
huangxinjing
896daee845
[AutoParallel]Fix insert error for the mirror
4 years ago
yangzhenzhang
43e6e16da3
check platform for resizebilinear
4 years ago
i-robot
5deccfe64b
!30260 [MD][AutoTune] Re-enable AT for non-sink models
Merge pull request !30260 from danishfarid/re-enable_AT
4 years ago
i-robot
0341d96dd6
!30469 add shard function to support part of the graph executed in auto_parallel under pynative mode
Merge pull request !30469 from wangjun/0223_pp
4 years ago
huangbingjian
a69d13bc44
[Fallback] Suppport scipy module.
4 years ago
i-robot
cfe0f76d2b
!30491 ut for allgather fusion
Merge pull request !30491 from jiahongQian/master
4 years ago
danishfarid
6c4697fc8b
reenable AT for non-sink
tests re-enable
code check fix 1
remove self
lint fixing
lint fix 2
remove test as per req
4 years ago
wangjun
24d448239c
add pynative_parallel
4 years ago
i-robot
981eae461a
!30118 自动优化器并行特性
Merge pull request !30118 from zhuyuxiao/I4S85V
4 years ago
jiahongQian
25f57505bf
ut for allgather fusion
4 years ago
i-robot
bbcfbce9e0
!29997 [Auto parallel] [MoE] Support data_parallel + expert_parallel
Merge pull request !29997 from Xiaoda/124-moe-changes
4 years ago
zhuyuxiao
d0e0e305d3
good
4 years ago
i-robot
f2130e7434
!30483 [AutoParallel]Pipeline Automatic detection Opt
Merge pull request !30483 from lichen/pipeline_opt_detection
4 years ago
yao_yf
e21f878e14
adasum ut fix
4 years ago
Xiaoda Zhang
b714451937
implementing expert_parallel+data_parallel in MoE:
1) extending _Linear's input as 4-dimension tensor: [outer_batch, expert_dim, -1, hidden], and _Liner's BatchMatMul becomes BatchMatMul(4_dim_tensor, 3_dim_tensor);
2) configuring the _Linear's BatchMatMul sharding strategy as [[dp, ep, 1, 1], [ep, 1, mp]];
3) introducing a new parameter 'expert_parallel' in TransformerOpParallelConfig, creating a new class MoEParallelConfig to include 'data_parallel', 'model_parallel' and 'expert_parallel';
4) changing parallel config for FeedForward, TransformerEncoderLayer, TransformerDecoderLayer.
4 years ago
wangshengnan12@huawei.com
acbefd80ea
pipeline_opt_detection
4 years ago
i-robot
2a00ffd3b1
!30259 [MD]Update set_autotune_enable API to add save filepath
Merge pull request !30259 from cathwong/ckw_at_save_api
4 years ago
Cathy Wong
46e223e569
[MD] Update set_autotune_enable API to add save filepath
4 years ago
i-robot
81260a2319
!30466 takedown test_auto_parallel_adasum.py to ensure stability, again
Merge pull request !30466 from yanghaoran/master
4 years ago
i-robot
14393503b7
!30431 allreduce allgather fusion
Merge pull request !30431 from jiahongQian/master
4 years ago
yanghaoran
71d6b7d506
takedown test_auto_parallel_adasum.py to ensure stability, again
4 years ago
i-robot
2e8eac8341
!30367 auto_parallel_adasum_support_data_parallel
Merge pull request !30367 from yao_yf/auto_parallel_adasum_support_data_parallel
4 years ago
jiahongQian
8a2151d8bb
allgather reducescatter fusion
4 years ago
i-robot
eeb731ae3e
!18738 [assistant][ops]New operator implementation, include LSUNDataset
Merge pull request !18738 from Wangsong95/lsun_dataset
4 years ago
i-robot
5bee7156b9
!30369 add_virtualdataset_ut
Merge pull request !30369 from lilei/add_virtualdataset_ut
4 years ago
i-robot
0f24b679ec
!29819 Add GlobalNorm Search
Merge pull request !29819 from huangxinjing/add_global_norm_search
4 years ago
i-robot
abb89d3f06
!29705 [MD][Offload] Support Multi-Column Datasets and Map Column Names to Index
Merge pull request !29705 from markuskunej/offload_multi_col_ds
4 years ago
i-robot
6edc6fccee
!30189 Optimize error message while outmost network input is wrong
Merge pull request !30189 from zhangzhaoju/master_outmost
4 years ago
yao_yf
19236b1a70
auto parallel adasum support data parallel and hybrid parallel
4 years ago
huangxinjing
092ba035e3
Add global norm parallel support
4 years ago
liyong
3e1c668247
fix mnist dataset
4 years ago
yanghaoran
bfe139b662
takedonw test_auto_parallel_adasum.py to ensure gate stability
4 years ago
lilei
bc62e24d94
add_virtualdataset_ut
4 years ago
i-robot
ce5e93593d
!30301 modify error log
Merge pull request !30301 from lianliguang/change-error-log
4 years ago
i-robot
94c8c6355c
!30294 auto_parallel_adasum_checks_and_ut.
Merge pull request !30294 from yao_yf/auto_parallel_adasum_checks_and_ut
4 years ago
i-robot
926fb6f5ae
!25559 [feat] [assistant] [I48O7X, I48O6R] Add Asin, AsinGrad
Merge pull request !25559 from 桂宁馨/Asin
4 years ago
zhangzhaoju
e000889265
Fix issue I4KVTZ
The description of "outmost network" in error message instead by
method/function name
4 years ago
i-robot
95f5075371
!30190 [AUTO_PARALLEL]Fix CI error
Merge pull request !30190 from huangxinjing/fix_ci_error
4 years ago
lianliguang
efabace125
change error log
4 years ago
i-robot
908beb6b2f
!26420 [feat] [assistant] [I48O90, I48O4P] Add AsinhGrad,Asinh
Merge pull request !26420 from 桂宁馨/Asinh
4 years ago
i-robot
f6215c614c
!30064 modify ckpt type error
Merge pull request !30064 from changzherui/mod_ckpt_type_err
4 years ago
huangxinjing
5e325ac336
[AUTO_PARALLEL]Fix insert nodes error
4 years ago
i-robot
a5242d33e1
!30181 support TensorShape ops
Merge pull request !30181 from lianliguang/change-dynshape-to-tensorshape
4 years ago
i-robot
7386612515
!29820 moe_topk routing
Merge pull request !29820 from wangshengnan123/moe_topk_routing
4 years ago