chenjianping
c9f4889d1b
check mpi_adapter instance
5 years ago
mindspore-ci-bot
4c6bff75af
!1393 Gpu Support AdamWeightDecay optimizer fusion
Merge pull request !1393 from chenweifeng/adam_weight_decay
5 years ago
mindspore-ci-bot
0a368494db
!2499 HostAllGather and HostReduceScatter change to internal interface
Merge pull request !2499 from yihuaijie/master
5 years ago
mindspore-ci-bot
cc2655e599
!2428 building _ms_mpi with mpi_interface
Merge pull request !2428 from chenjianping/host_reduce
5 years ago
chenjianping
343889cdb7
building _ms_mpi with mpi_interface
5 years ago
lichenever
1c34c8c970
fix_hccl_to_support_big_tensor
5 years ago
mindspore-ci-bot
69c15470a5
!2492 Add an output to apply_proximal_adagrad op register
Merge pull request !2492 from YuJianfeng/master
5 years ago
mindspore-ci-bot
ce9c68d8da
!2505 [Code Reivew] fix code review content
Merge pull request !2505 from jjfeing/master
5 years ago
wilfChen
034d2ea2aa
Gpu Adam Fusion
5 years ago
mindspore-ci-bot
cc5a2408e6
!2491 add cpu kernel profiling log
Merge pull request !2491 from kisnwang/add-cpu-kernel-profiling
5 years ago
mindspore-ci-bot
8870956954
!2441 add fake quant test case for gpu
Merge pull request !2441 from chenzhongming/master
5 years ago
jjfeing
32442d6246
code review stage 2
5 years ago
Yi Huaijie
2eb739de6e
change HostAllGather and HostReduceScatter to internal interface
5 years ago
kswang
dc29cfcbf7
add cpu profile time
5 years ago
jiangjinsheng
a1e148cb4d
vm for LRN and LRNGrad
5 years ago
mindspore-ci-bot
5b14292f69
!2140 Implementation of mindspore debugger
Merge pull request !2140 from ShidaHe/debugger_dev
5 years ago
chenzomi
8873f9dc7e
add fake quant test case for gpu
5 years ago
mindspore-ci-bot
64f8bc5278
!2457 GPU layernorm momentum support fp16
Merge pull request !2457 from VectorSL/gpu-type-expand
5 years ago
mindspore-ci-bot
a2cd05339f
!2180 Gpu Gelu kernel support fp16
Merge pull request !2180 from chenweifeng/gelu-fp16
5 years ago
yujianfeng
304427cd93
Add an output to apply_proximal_adagrad op register
5 years ago
mindspore-ci-bot
fe620f2195
!2434 fix atomic addr clean
Merge pull request !2434 from jjfeing/master
5 years ago
mindspore-ci-bot
ac78ac9700
!2297 add vm support for operators include MatrixDiag, MatrixDiagPart etc
Merge pull request !2297 from jiangjinsheng/vm_matrixdiag
5 years ago
mindspore-ci-bot
106f798092
!2451 fix perchannel num_channels not set bug and adjust quant.py params order
Merge pull request !2451 from 王东旭/master
5 years ago
Shida He
4c056855e0
Implementation for mindspore debugger
5 years ago
mindspore-ci-bot
7c358ca464
!2460 optimize cpu reduce gradient
Merge pull request !2460 from kisnwang/optimize-cpu-reduce-gradient
5 years ago
VectorSL
da71a9148e
gpu momentum layernorm layernormgrad support fp16
5 years ago
kswang
b867d6d60a
parallel reduce sparse gradient
5 years ago
jiangjinsheng
017ff492af
vm for MatrixDiag,MatrixDiagPart.MatrixSetDiag
5 years ago
wangdongxu
02584fe2c7
fix perchannel num_channels not set bug and adjust quant.py params order
5 years ago
WilliamLian
88d3dc6606
fix code review
5 years ago
mindspore-ci-bot
12e7ddae0a
!2386 Add multiple process for computation of optimizer in cpu
Merge pull request !2386 from YuJianfeng/master
5 years ago
mindspore-ci-bot
f1a69de0b6
!2405 change some comment name in the whole project
Merge pull request !2405 from chenzhongming/master
5 years ago
jjfeing
d535f3a289
fix atomic clean
5 years ago
yujianfeng
6c1bc1c6a9
Add multiple process for computation of sparse optimizers
5 years ago
chenzomi
a834a6308e
change some comment name in the whole project
5 years ago
jiangjinsheng
e71599b5ca
vm for lin_space
5 years ago
mindspore-ci-bot
c9b8a8da0a
!2369 add cpu reduce op and cpu softmax_cross_entropy_with_logits op
Merge pull request !2369 from baihuawei/reduce
5 years ago
mindspore-ci-bot
8b5166e569
!2393 fix bug of hccl kernel info and change cast's kernel info
Merge pull request !2393 from lianliguang/fix-bug-of-merge-cast-to-op-and-gene-hccl-kernel-info
5 years ago
gong chen
a6dfa281ea
Init GraphKernel.
- It provides a unified style to express graph and kernel for user.
- It provides a unified IR to represent graph and kernel for developer.
- It breaks the boundary between graph and kernel.
- It provides more opportunities to do compile optimization.
5 years ago
mindspore-ci-bot
a3e7b30457
!2384 Add split fission pass
Merge pull request !2384 from YuJianfeng/split
5 years ago
mindspore-ci-bot
9969a9b07d
!2394 sync cpu output if needed
Merge pull request !2394 from kisnwang/add-cpu-outputsync
5 years ago
mindspore-ci-bot
53654f94f2
!2056 Enable new control sink
Merge pull request !2056 from zhoufeng/enable-new-control-sink
5 years ago
mindspore-ci-bot
8a913d7586
!2157 mindspore server inference
Merge pull request !2157 from hangq/master
5 years ago
kswang
97216f7404
sync cpu output
5 years ago
WilliamLian
5f9d2759ee
fix bug of hccl kernel info and change cast's kernel info
5 years ago
mindspore-ci-bot
d57decc8a3
!2338 Gpu Minimum & Maximum kernels support int32
Merge pull request !2338 from chenweifeng/nezha
5 years ago
zhoufeng
bbbfaa2441
enable new control sink
Signed-off-by: zhoufeng <zhoufeng54@huawei.com>
5 years ago
yujianfeng
7ad877a948
Add Split fission pass
5 years ago
hangangqiang
dfbb232468
mindspore server inference
5 years ago
mindspore-ci-bot
bdb7d0fd01
!2355 fix cpu sub op input not match
Merge pull request !2355 from dengwentao/fix_sub_op
5 years ago