i-robot
01ade5857d
!22123 update pynative profiling codes in fp and bp
Merge pull request !22123 from lvchangquan/profiling_formal
4 years ago
i-robot
438169e0b9
!22365 RDR adapts for CPU dynamic memory allocation
Merge pull request !22365 from liangyongxiong/fix
4 years ago
lvchangquan
bab311f0c7
update pynative profiling codes in fp and bp
4 years ago
liangyongxiong
9f6b015032
RDR adapts for CPU dynamic memory allocation
4 years ago
zuochuanyong
8fa68ebd98
fix Conv3D precision under fp16
4 years ago
ms_yan
36a8886ca2
Revert "[feat] [assistant] [I3T96T] add new Dataset operator CMUARCTICDataset"
This reverts commit b077aa1cab .
Revert "[feat] [assistant] [I3T96X] add new Dataset operator LibriSpeechDataset"
This reverts commit 4e6f7dc97d .
delete pass_registry_test.cc
comment hiai_nlu_model_multi.pb related line
4 years ago
djc
b077aa1cab
[feat] [assistant] [I3T96T] add new Dataset operator CMUARCTICDataset
4 years ago
djc
4e6f7dc97d
[feat] [assistant] [I3T96X] add new Dataset operator LibriSpeechDataset
4 years ago
lvchangquan
e8d9803258
add profiling codes based on ascend and gpu in pynative mode
4 years ago
ckey_Dou
d293c5eb26
using kernel pool to share the compiling results when running on multi
cards
4 years ago
i-robot
59322811fd
!20648 clean pclint warnings
Merge pull request !20648 from zyli2020/mindrt_debug
4 years ago
lizhenyu
9404bfb9d9
fix pclint warnings
4 years ago
Margaret_wangrui
f9a064e464
Add ref user to UpdateState to ensure the order
4 years ago
lizhenyu
209dc5f584
fix codedex warnings
4 years ago
limingqi107
ae46d0beb9
fix bug of CPU not support more then 2G copy
4 years ago
lizhenyu
856e5d5a77
Remove nop node optimizer for cpu
4 years ago
limingqi107
e761655a42
actor runtime support CPU dump
4 years ago
lizhenyu
a487f8c897
Optimize PyNative Performance
4 years ago
lizhenyu
dec4c98f9c
[bugfix] PyNative of new runtime support non single op
4 years ago
lizhenyu
a3d04728ee
Remove useless SyncHostToDevice for PyNative mode
4 years ago
i-robot
f2c45a6172
!18044 Fit Gpu LoopCount for profiler module
Merge pull request !18044 from gzhcv/FitLoopCount
4 years ago
Parastoo Ashtari
6ed17d52b1
Add debugger to new unified GPU runtime
4 years ago
gzhcv
001985ca25
Fit Gpu LoopCount for profiler module
4 years ago
lizhenyu
3e295ce0ba
update device context key
4 years ago
i-robot
8ccc71ff72
!18047 actor runtime memory optimize of communication op
Merge pull request !18047 from limingqi107/actor_runtime2
4 years ago
lizhenyu
f3e5d67512
fix core dump when destroy device context in PyNative mode
4 years ago
i-robot
7da28cef6f
!17873 [MS][LOG] Make directorys when the path set by GLOG_log_dir is not exist
Merge pull request !17873 from louie5/master
4 years ago
limingqi107
9d2bf9cf6b
actor runtime memory optimize of communication op
4 years ago
limingqi107
7f634d12f0
actor runtime support GraphKrenel
4 years ago
louei5
5f51d03962
Make directory when the path set by GLOG_log_dir is not exist.
4 years ago
louei5
44ff50f6aa
make rdr support recording gpu memory info in Actor framwork
4 years ago
limingqi107
21d168e980
optimize the performance of actor runtimie dynamic shape
4 years ago
mindspore-ci-bot
3fd22fde0b
!17642 fix the coredump of actor runtime singleton instance deconstruction
From: @limingqi107
Reviewed-by: @cristoval,@wilfchen
Signed-off-by: @wilfchen
4 years ago
mindspore-ci-bot
a3b04a78af
!17604 bugfix for summary and pynative bp
From: @zyli2020
Reviewed-by: @cristoval,@limingqi107
Signed-off-by: @limingqi107
4 years ago
limingqi107
5b63f8f2bc
fix the coredump of actor runtime singleton instance deconstruction
4 years ago
mindspore-ci-bot
9aae606590
!17434 dynamic shape for unify runtime
From: @zyli2020
Reviewed-by: @limingqi107,@cristoval
Signed-off-by: @cristoval
4 years ago
lizhenyu
3d11311ec1
bugfix for summary and pynative bp
4 years ago
lizhenyu
82dab96449
unified runtime supports dynamic shape
4 years ago
limingqi107
d405964aab
actor runtimie supports allreduce multi-stream
4 years ago
lizhenyu
2b50100d79
Unify runtime support profiling
4 years ago
lizhenyu
b3fbdf9d65
unify runtime for PyNative distributed mode
4 years ago
lizhenyu
ef03c0e826
support transform to DAG dynamicly
5 years ago
limingqi107
57a14f55e8
actor runtime code review modify
4 years ago
limingqi107
fba1dd8f2f
add the continue memory alloc of communication kernel for actor runtime
4 years ago
dayschan
771e3f61f3
Clean GraphKernel's codes from frontend
1. set class GraphKernel as deprecated, and treat it as Cell
2. set class InplaceAssign as deprecated, suggested using Assign instead.
3. set op_selector as deprecated, removed the _selected_ops and _selected_grad_ops, replaced with real operations
4. removed the two passes of GraphKernel from frontend
5. removed the GraphKernel's codes from other modules
5 years ago
limingqi107
2f5b786daa
add dump info and fix process bug of actor runtime
5 years ago
lizhenyu
25541f803e
[unify runtime]remove HideNopNode after SetKernelInfo
5 years ago
lizhenyu
a87b141cf9
init device address for anf node
5 years ago
lizhenyu
c57a9edb9d
add graph compiler
5 years ago
limingqi107
98ce7c2039
add the impl of runtime actors
5 years ago