ZPaC
a4d2a3e6a1
Add collective operation impl.
4 years ago
ZPaC
2b7429c5d2
1.Purge not used API.
2.Adapt for collective_init.h
4 years ago
ZPaC
055a493903
Add implement of collective manager API
4 years ago
ZPaC
e243c1e021
Add collective API for python layer
4 years ago
ZPaC
87057fdc27
Dynamic load ompi and nccl
4 years ago
i-robot
11bec4d85e
!25995 Add nvidia collective lib implementation.
Merge pull request !25995 from ZPaC/dir-of-distributed
4 years ago
ZPaC
9e18bad126
Add nvidia collective lib implementation.
4 years ago
limingqi107
46d4a900c0
unified runtime support the auto monad in the subgraphs connecting scene
4 years ago
ZPaC
0b732431a1
Add MPI implementation.
4 years ago
ZPaC
e5789757d6
Add collective adapter
4 years ago
limingqi107
2a5d9f964b
unified runtime support the sub graph sink
4 years ago
ZPaC
4c1ef4cef6
Fix ps cache broadcast error.
4 years ago
i-robot
b271aa7a25
!24969 device address add the key of device
Merge pull request !24969 from limingqi107/new_actor_runtime
4 years ago
maning202007
8ee33007b1
add const to the data_ptr definition
Fix pclint for debugger
fix bugs
fix duplicated code issue
Fix the depth of method over 4 issue
fix comments
fix CI errors
remove redundancy arguments
4 years ago
limingqi107
be100476d6
device address add the key of device
4 years ago
i-robot
4268cf5de0
!25026 support dynamic shape for cpu kernel of mae net
Merge pull request !25026 from fangzehua/dynamic_mae_1014
4 years ago
fangzehua
53009276d1
add cpu dynamic memory
4 years ago
cononlly
1b119ced2b
Add a pass for allgather_recompute
4 years ago
dayschan
7502345c8f
Add GraphKernelFlags into namespace mindspore::graphkernel
4 years ago
i-robot
18960015b0
!24229 add dynamic shape attr pass
Merge pull request !24229 from hwjiaorui/dynamic-shape-pass
4 years ago
i-robot
0eccf93423
!23757 Add Support for AlltoAll in GPU
Merge pull request !23757 from Cononlly/master
4 years ago
hwjiaorui
eaa3dc57b7
add dynamic shape attr pass
4 years ago
i-robot
811dd1778b
!23959 fix link error if compiled with -s on when import mindspore
Merge pull request !23959 from yanghaitao/yht_sync_shangyong
4 years ago
cononlly
a8088525d5
Add AlltoAll GPU operator and IR pass and tests
4 years ago
lizhenyu
0b6d7e4adb
fix code review warning
4 years ago
yanghaitao1
7f216aac41
fix link error if import mindspore
4 years ago
dayschan
73ecb0c4ab
Add namespace mindspore::graphkernel
* change the graphkernel's passes code(backend/optimizer/graph_kernel/*) to the
new namespace `mindspore::graphkernel`, to decouple from `mindspore::opt`.
* change the original `mindspore::opt::graphkernel` to `mindspore::graphkernel::inner` (graph_kernel/model)
* change the original `mindspore::opt::expanders` to `mindspore::graphkernel::expanders` (graph_kernel/expanders)
TODO: modify graph_kernel_flags, kernel_compiler/akg/
4 years ago
limingqi107
16940f2d95
can not appear exception in the atexit
4 years ago
i-robot
fb58cfa273
!23414 update pynative profiling codes and delete some useless code
Merge pull request !23414 from lvchangquan/master
4 years ago
lvchangquan
b908f40d94
delete some useless codes and update pynative profiling codes
4 years ago
limingqi107
859aa42b41
unified runtime add data prepare actor and optimize code
4 years ago
sabrinasun
220245f592
add security isolation to online and offline debugger
4 years ago
liangyongxiong
a03a8ccd8e
fix compile errors with RDR include files
4 years ago
i-robot
01ade5857d
!22123 update pynative profiling codes in fp and bp
Merge pull request !22123 from lvchangquan/profiling_formal
4 years ago
i-robot
438169e0b9
!22365 RDR adapts for CPU dynamic memory allocation
Merge pull request !22365 from liangyongxiong/fix
4 years ago
lvchangquan
bab311f0c7
update pynative profiling codes in fp and bp
4 years ago
liangyongxiong
9f6b015032
RDR adapts for CPU dynamic memory allocation
4 years ago
zuochuanyong
8fa68ebd98
fix Conv3D precision under fp16
4 years ago
ms_yan
36a8886ca2
Revert "[feat] [assistant] [I3T96T] add new Dataset operator CMUARCTICDataset"
This reverts commit b077aa1cab .
Revert "[feat] [assistant] [I3T96X] add new Dataset operator LibriSpeechDataset"
This reverts commit 4e6f7dc97d .
delete pass_registry_test.cc
comment hiai_nlu_model_multi.pb related line
4 years ago
djc
b077aa1cab
[feat] [assistant] [I3T96T] add new Dataset operator CMUARCTICDataset
4 years ago
djc
4e6f7dc97d
[feat] [assistant] [I3T96X] add new Dataset operator LibriSpeechDataset
4 years ago
lvchangquan
e8d9803258
add profiling codes based on ascend and gpu in pynative mode
4 years ago
ckey_Dou
d293c5eb26
using kernel pool to share the compiling results when running on multi
cards
4 years ago
i-robot
59322811fd
!20648 clean pclint warnings
Merge pull request !20648 from zyli2020/mindrt_debug
4 years ago
lizhenyu
9404bfb9d9
fix pclint warnings
4 years ago
Margaret_wangrui
f9a064e464
Add ref user to UpdateState to ensure the order
4 years ago
lizhenyu
209dc5f584
fix codedex warnings
4 years ago
limingqi107
ae46d0beb9
fix bug of CPU not support more then 2G copy
4 years ago
lizhenyu
856e5d5a77
Remove nop node optimizer for cpu
4 years ago
limingqi107
e761655a42
actor runtime support CPU dump
4 years ago