yuchaojie
da4f6c51dc
code check clean
4 years ago
djc
b077aa1cab
[feat] [assistant] [I3T96T] add new Dataset operator CMUARCTICDataset
4 years ago
djc
4e6f7dc97d
[feat] [assistant] [I3T96X] add new Dataset operator LibriSpeechDataset
4 years ago
limingqi107
5766234426
code review of gpu backend
4 years ago
limingqi107
8c690e3d18
unified runtime codedex fixed
4 years ago
LaiYongqiang
48f4cc33cb
fix code review
4 years ago
limingqi107
70d156562a
add dump pb and nccl protected
4 years ago
linqingke
d0e2fabf1f
fix cudnn malloc memory failed.
4 years ago
LaiYongqiang
d4d6fb940d
memory reuse code clean
4 years ago
linqingke
40b3d923ab
add memory unit size setting.
update set unit.
4 years ago
mindspore-ci-bot
46c2e84e09
!17599 fix static check warning in mem reuse
From: @laiyongqiang
Reviewed-by: @yuchaojie,@zhoufeng54
Signed-off-by: @zhoufeng54
4 years ago
LaiYongqiang
9aa51c64fa
fix static check warning in SOMAS
4 years ago
LaiYongqiang
6514a99423
fix static check warning
4 years ago
wilfChen
d4ceec9d18
pclint
4 years ago
lizhenyu
2df0e89a6a
clean pc-lint warnings
4 years ago
laiyongqiang
de0f6edf5a
support hbm profile
5 years ago
mindspore-ci-bot
0909d69b4e
!14067 fix graph compile time over 3h
From: @zyli2020
Reviewed-by: @cristoval,@limingqi107
Signed-off-by: @limingqi107
5 years ago
lizhenyu
d2598d73be
fix graph compiling cost time over 3h
5 years ago
mindspore-ci-bot
946e403df8
!13701 dynamic memory pool support multi-thread
From: @limingqi107
Reviewed-by: @cristoval,@kisnwang
Signed-off-by: @kisnwang
5 years ago
dayschan
11ee3b1624
add context graph_kernel_flags
used the flag "opt_level" to control GraphKernel,
0 means disabled while non-zero value means enabled.
the default value is controlled by context "enable_graph_kernel",
but if it's also set in "graph_kernel_flags", then the flag will prevail.
supported the whitelist and blacklist operators for GraphKernelExpander.
"enable_expand_ops", "enable_expand_ops_only", "disable_expand_ops".
5 years ago
limingqi107
698bc15881
dynamic memory pool support multi-thread
5 years ago
dingpeifei
87e41aaeee
IR operators of GPU and CPU are unified as batchnorm
5 years ago
He Wei
7d9a783993
[auto-monad] Support side-effects by auto-monad
The basic idea is: exploits data dependency to control the execution order
of side-effect operations, and keep the semantics of ANF unchanged.
The ControlDepend primitive is removed and there are two primitives added:
1. UpdateState:
```
a = Assign(para, value)
```
became:
```
a = Assign(para, value, u)
u = UpdateState(u, a)
```
2. Load:
```
x = Add(para, value)
```
became:
```
p = Load(para, u)
x = Add(p, value)
u = UpdateState(u, p)
```
5 years ago
mindspore-ci-bot
1e13cbb8da
!11241 optimize the dynamic memory alloc
From: @limingqi107
Reviewed-by: @cristoval,@kisnwang
Signed-off-by: @kisnwang
5 years ago
limingqi107
fd9f91b6c9
optimize the dynamic memory alloc
5 years ago
liangzelang
a97ac180ba
opt get_func by uniforming format
5 years ago
lizhenyu
b0caa42a94
support compile on centos
5 years ago
l00591931
1d1cab986d
Codedex change for tensor_loader
5 years ago
baihuawei
7d09dff880
add hccl send recv
5 years ago
lizhenyu
6f6a0dfd7a
[bug_fix]GPU distributed training occur core dump when memory is not enough
5 years ago
mindspore-ci-bot
1f9d034e53
!8276 Try to cache object to accelerate "AllocKernelDynamicRes" and "FreeKernelDynamicRes"
From: @tronzhang
Reviewed-by: @ckey_dou,@gaoxiong1
Signed-off-by: @gaoxiong1
5 years ago
tronzhang
1cf2482ba5
try to get address pointer from cache
5 years ago
wilfChen
e4e9362bd0
gpu suppor dynamic shape
5 years ago
lizhenyu
ae295120a7
fix codedex warning
5 years ago
Harshvardhan Gupta
744355a005
remove dbg args from runtime and remove needless argument from IsWatchpoint
5 years ago
laiyongqiang
ebc18d1edb
log improvement
5 years ago
wilfChen
b420b6cda7
cudnn inplace optimizer
5 years ago
limingqi107
740254161e
gpu code review
5 years ago
laiyongqiang
e13beaa7ed
clean codex warning
5 years ago
mindspore-ci-bot
c543db0585
!6180 clean codex warning in memreuse
Merge pull request !6180 from laiyongqiang/codex
5 years ago
laiyongqiang
4063a69846
clean codex warning
5 years ago
John Tzanakakis
b0a7ebdeb0
enable debugger by default and set correct log message severity
5 years ago
laiyongqiang
0c1e391556
add atomic clean op for every communication op's input
5 years ago
liangzelang
7e6189a076
fix global_step error in multi-cases
5 years ago
zhoufeng
663278112f
optimize code compile performance
Signed-off-by: zhoufeng <zhoufeng54@huawei.com>
5 years ago
liubuyu
d81862a916
decoupling core and context
5 years ago
mindspore-ci-bot
f87983833d
!3636 Fix opt unnecessary catch
Merge pull request !3636 from Kang/opt
5 years ago
wuyongkang
f1e639a86d
Fix opt unnecessary catch
5 years ago
limingqi107
af39ca8252
modify the wrong word
5 years ago
laiyongqiang
d99786e938
fix refnode input type assign
5 years ago