limingqi107
2a5d9f964b
unified runtime support the sub graph sink
4 years ago
ZPaC
4c1ef4cef6
Fix ps cache broadcast error.
4 years ago
i-robot
b271aa7a25
!24969 device address add the key of device
Merge pull request !24969 from limingqi107/new_actor_runtime
4 years ago
maning202007
8ee33007b1
add const to the data_ptr definition
Fix pclint for debugger
fix bugs
fix duplicated code issue
Fix the depth of method over 4 issue
fix comments
fix CI errors
remove redundancy arguments
4 years ago
limingqi107
be100476d6
device address add the key of device
4 years ago
i-robot
4268cf5de0
!25026 support dynamic shape for cpu kernel of mae net
Merge pull request !25026 from fangzehua/dynamic_mae_1014
4 years ago
fangzehua
53009276d1
add cpu dynamic memory
4 years ago
i-robot
093aa3b7a0
!24867 Fix AllGather Recompute Cycled Graph for GPU
Merge pull request !24867 from Cononlly/master
4 years ago
cononlly
1b119ced2b
Add a pass for allgather_recompute
4 years ago
limingqi107
19330b6f82
unified runtime support graph sink output update
4 years ago
i-robot
c5585a9451
!24467 Add GraphKernelFlags into namespace mindspore::graphkernel
Merge pull request !24467 from DeshiChen/0918_namespace
4 years ago
dayschan
7502345c8f
Add GraphKernelFlags into namespace mindspore::graphkernel
4 years ago
LaiYongqiang
79599546ee
add ascend memory adapter for ascend memory management
4 years ago
i-robot
06b0beced7
!22731 GraphKernel supports CPU
Merge pull request !22731 from DeshiChen/0901_graphkernel_cpu
4 years ago
dayschan
32ecd8ee79
GraphKernel supports CPU
only Linux system is supported now.
change the default value of `ENABLE_AKG` to off, and controlled by option `-K`.
the `ENABLE_AKG` is auto enabled when `ENABLE_GPU` or `ENABLE_D` is on.
since now, we can use `ENABLE_AKG` to control the compilation of graphkernel
and akg codes.
fix usage description for option "-K", it should be "[-K on|off]".
LLVM is required by akg for cpu kernels, so AKG for cpu is default disabled now.
4 years ago
limingqi107
0a9cbd2220
imporve performance of CPU LSTMGrad
4 years ago
i-robot
5bd7e4def9
!24156 Move InsertCastCpu before BuildKernel
Merge pull request !24156 from DeshiChen/0926_insert_cast_cpu
4 years ago
i-robot
18960015b0
!24229 add dynamic shape attr pass
Merge pull request !24229 from hwjiaorui/dynamic-shape-pass
4 years ago
i-robot
0eccf93423
!23757 Add Support for AlltoAll in GPU
Merge pull request !23757 from Cononlly/master
4 years ago
hwjiaorui
eaa3dc57b7
add dynamic shape attr pass
4 years ago
dayschan
5ed932e5fd
Move InsertCastCpu before BuildKernel
It's necessary to immediately insert cast for the nodes that dtype was changed in SelectKernel,
instead of after BuildKernel. otherwise, some fusion passes cannot work as expected,
like Graph-Kernel optimization.
By now, the shape of LSTM's reserved output will be changed in InitKernel (called by BuildKernel),
and this output is only used by its gradient operator, so we can ignore the reserved output in this pass.
4 years ago
i-robot
811dd1778b
!23959 fix link error if compiled with -s on when import mindspore
Merge pull request !23959 from yanghaitao/yht_sync_shangyong
4 years ago
cononlly
a8088525d5
Add AlltoAll GPU operator and IR pass and tests
4 years ago
lizhenyu
0b6d7e4adb
fix code review warning
4 years ago
i-robot
b95a6d459d
!23906 add the interface of ascend supporting unified time
Merge pull request !23906 from limingqi107/new_actor_runtime
4 years ago
yanghaitao1
7f216aac41
fix link error if import mindspore
4 years ago
limingqi107
3e19107875
add the interface of ascend supporting unified time
4 years ago
dayschan
73ecb0c4ab
Add namespace mindspore::graphkernel
* change the graphkernel's passes code(backend/optimizer/graph_kernel/*) to the
new namespace `mindspore::graphkernel`, to decouple from `mindspore::opt`.
* change the original `mindspore::opt::graphkernel` to `mindspore::graphkernel::inner` (graph_kernel/model)
* change the original `mindspore::opt::expanders` to `mindspore::graphkernel::expanders` (graph_kernel/expanders)
TODO: modify graph_kernel_flags, kernel_compiler/akg/
4 years ago
limingqi107
16940f2d95
can not appear exception in the atexit
4 years ago
i-robot
fb58cfa273
!23414 update pynative profiling codes and delete some useless code
Merge pull request !23414 from lvchangquan/master
4 years ago
lvchangquan
b908f40d94
delete some useless codes and update pynative profiling codes
4 years ago
sabrinasun
1b8b453373
add remain isolation for dump
4 years ago
limingqi107
859aa42b41
unified runtime add data prepare actor and optimize code
4 years ago
sabrinasun
220245f592
add security isolation to online and offline debugger
4 years ago
liangyongxiong
a03a8ccd8e
fix compile errors with RDR include files
4 years ago
i-robot
01ade5857d
!22123 update pynative profiling codes in fp and bp
Merge pull request !22123 from lvchangquan/profiling_formal
4 years ago
i-robot
438169e0b9
!22365 RDR adapts for CPU dynamic memory allocation
Merge pull request !22365 from liangyongxiong/fix
4 years ago
lvchangquan
bab311f0c7
update pynative profiling codes in fp and bp
4 years ago
liangyongxiong
9f6b015032
RDR adapts for CPU dynamic memory allocation
4 years ago
zuochuanyong
8fa68ebd98
fix Conv3D precision under fp16
4 years ago
ms_yan
36a8886ca2
Revert "[feat] [assistant] [I3T96T] add new Dataset operator CMUARCTICDataset"
This reverts commit b077aa1cab .
Revert "[feat] [assistant] [I3T96X] add new Dataset operator LibriSpeechDataset"
This reverts commit 4e6f7dc97d .
delete pass_registry_test.cc
comment hiai_nlu_model_multi.pb related line
4 years ago
djc
b077aa1cab
[feat] [assistant] [I3T96T] add new Dataset operator CMUARCTICDataset
4 years ago
djc
4e6f7dc97d
[feat] [assistant] [I3T96X] add new Dataset operator LibriSpeechDataset
4 years ago
limingqi107
02430820ed
unified runtime optimize code
4 years ago
lvchangquan
e8d9803258
add profiling codes based on ascend and gpu in pynative mode
4 years ago
limingqi107
5766234426
code review of gpu backend
4 years ago
ckey_Dou
d293c5eb26
using kernel pool to share the compiling results when running on multi
cards
4 years ago
limingqi107
09ceb1cf5b
unified runtime pclint fixed
4 years ago
limingqi107
fe36a158af
unified runtime pclint fixed
4 years ago
i-robot
59322811fd
!20648 clean pclint warnings
Merge pull request !20648 from zyli2020/mindrt_debug
4 years ago