yuchaojie
d2cb3aa1c2
unify AvgPoolGrad
5 years ago
mindspore-ci-bot
50542793c8
!12077 optimize gpu backend logger
From: @wilfchen
Reviewed-by: @cristoval,@limingqi107
Signed-off-by: @limingqi107
5 years ago
chendongsheng
db0a6f1e19
replace ps-lite
5 years ago
wilfChen
58196f1faf
modify gpu backend logger
5 years ago
mindspore-ci-bot
5524280075
!12550 [MS][RDR] recording func_graph in pipeline and task debug info
From: @louie5
Reviewed-by:
Signed-off-by:
5 years ago
louei5
9a48405a41
recording func_graph in pipeline and task debug information
5 years ago
Zhang Qinghua
9b26c210f4
Eliminate all useless nodes related to UpdateStates.
5 years ago
mindspore-ci-bot
5224241ca7
!12577 fix bug for dynamic_shape_depends
From: @zhupuxu
Reviewed-by: @jjfeing,@zhoufeng54
Signed-off-by: @zhoufeng54
5 years ago
zhupuxu
b15d182cd2
fix bug for dynamic_shape_depends
Signed-off-by: zhupuxu <zhupuxu@huawei.com>
5 years ago
louei5
3d540a515a
add task_debug_info recorder
5 years ago
mindspore-ci-bot
c74b4d5d73
!12412 nlp perf(Pynative): change memory sync mode from synchronous to asynchronous in SyncHostToDevice
From: @zuochuanyong
Reviewed-by:
Signed-off-by:
5 years ago
luopengting
7fe901491a
support recording stream execute order
1. add ExecNode and StreamRxecOrderRecorer
2. add json format and json dump for stream exec order
3. add stream exec order api
4. add in CMakeLists
5 years ago
zuochuanyong
3fa26683ac
nlp perf(Pynative): change memory sync mode from synchronous to asynchronous in SyncHostToDevice
5 years ago
luopengting
b4d56f6358
add functions to check for directory and filename
1. add validation for directory and filename
2. refactor the file operations about path
3. add in CMakeLists
4. add logger in nn_ops
5 years ago
He Wei
7d9a783993
[auto-monad] Support side-effects by auto-monad
The basic idea is: exploits data dependency to control the execution order
of side-effect operations, and keep the semantics of ANF unchanged.
The ControlDepend primitive is removed and there are two primitives added:
1. UpdateState:
```
a = Assign(para, value)
```
became:
```
a = Assign(para, value, u)
u = UpdateState(u, a)
```
2. Load:
```
x = Add(para, value)
```
became:
```
p = Load(para, u)
x = Add(p, value)
u = UpdateState(u, p)
```
5 years ago
lizhenyu
6649153c49
add input data type check for ps cache mode
5 years ago
mindspore-ci-bot
5cd97cef87
!12198 Revert Pull Request !11074
From: @xsmq
Reviewed-by: @c_34,@lilongfei15
Signed-off-by: @lilongfei15
5 years ago
mindspore-ci-bot
2cfd913e2b
!11830 Add cpu operator profiling and heterogeneous timeline
From: @gzhcv
Reviewed-by:
Signed-off-by:
5 years ago
gongxiaoqing
7f538b51e7
回退 'Pull Request !11074 : replace tdt with acltdt interface'
5 years ago
mindspore-ci-bot
c2582dcab9
!11074 replace tdt with acltdt interface
From: @ms_yan
Reviewed-by: @lilongfei15
Signed-off-by:
5 years ago
mindspore-ci-bot
9382fe64d8
!11954 modify hccl op number per-stream
From: @zhoufeng54
Reviewed-by: @lilongfei15,@xsmq
Signed-off-by: @xsmq
5 years ago
jjfeing
502be04491
upgrade 0204
5 years ago
gzhcv
2617f6c620
Add cpu profiler
5 years ago
mindspore-ci-bot
cf40d56aa7
!12057 fix overflow print bug
From: @jjfeing
Reviewed-by:
Signed-off-by:
5 years ago
mindspore-ci-bot
025fec757c
!12087 optimize the memory alloc error info
From: @limingqi107
Reviewed-by: @kisnwang,@cristoval
Signed-off-by: @cristoval
5 years ago
mindspore-ci-bot
0d24b45a71
!12017 add ps cache mode check
From: @zyli2020
Reviewed-by: @cristoval,@limingqi107
Signed-off-by: @limingqi107
5 years ago
limingqi107
366f3e668d
optimize the memory alloc error info
5 years ago
lizhenyu
c89e00e13b
add ps cache sparse mode check
5 years ago
jjfeing
84dcb184b1
fix call log print
5 years ago
mindspore-ci-bot
db3fe6a461
!11928 move dynamic_shape_depends to backend
From: @zhupuxu
Reviewed-by:
Signed-off-by:
5 years ago
zhupuxu
667e227f0d
move dynamic shape to backend
Signed-off-by: zhupuxu <zhupuxu@huawei.com>
5 years ago
jjfeing
bff8bf0253
remove no use log
5 years ago
ms_yan
293f81128d
init add acltdt handle create and destory
add hostpush part modify
optimize previous code
provide aclhandle access method
modify CMakeList format
add device_id parameter into TransferNode
5 years ago
zhoufeng
409c50ae94
modify hccl op number per-stream
Signed-off-by: zhoufeng <zhoufeng54@huawei.com>
5 years ago
liubuyu
8ff36d3079
reset cube compute mode
5 years ago
mindspore-ci-bot
a9dfb07cf1
!11660 get cube size by data type
From: @liubuyu
Reviewed-by:
Signed-off-by:
5 years ago
mindspore-ci-bot
ff8e69fbb1
!11779 modify profiling end log id from 255 to 65535
From: @yanghaitao1
Reviewed-by: @ouwenchang,@yelihua
Signed-off-by: @yelihua
5 years ago
mindspore-ci-bot
de54033dfc
!11896 fix bug about split node optimizer.
From: @liu_xiao_93
Reviewed-by: @wuxuejian,@liangchenghui
Signed-off-by: @liangchenghui
5 years ago
mindspore-ci-bot
ce89cc5e8b
!11761 Change GatherV2 to Gather (merge from r1.1 to master)
From: @liangzhibo
Reviewed-by:
Signed-off-by:
5 years ago
mindspore-ci-bot
f0dd5563ea
!11857 Add RDR module compile switch
From: @louie5
Reviewed-by: @ouwenchang
Signed-off-by:
5 years ago
liuxiao93
314c0cbdb2
split optimizer
5 years ago
yanghaitao1
f67a68e4fa
modify profiling log id from 255 to 65535
5 years ago
liuxiao93
68e9be725e
split optimizer
5 years ago
laiyongqiang
b465f21d90
NonTask Split Process
5 years ago
mindspore-ci-bot
9fa0499fa0
Change GatherV2 to Gather r1.1 to master
5 years ago
louei5
762feb591c
Add RDR module compiling switch and check saving filename
5 years ago
mindspore-ci-bot
4364abc7ee
!11798 Support RunOpsInGraph on CPU&GPU in pynative mode
From: @HulkTang
Reviewed-by:
Signed-off-by:
5 years ago
lizhenyu
f17534af08
ps cache support sparse
5 years ago
liubuyu
4a3cbda322
get cube size by dtype
5 years ago
tanghuikang
6f2cd92aba
Support RunOpsInGraph on CPU&GPU in pynative mode
5 years ago