lizhenyu
6f6a0dfd7a
[bug_fix]GPU distributed training occur core dump when memory is not enough
5 years ago
mindspore-ci-bot
1f9d034e53
!8276 Try to cache object to accelerate "AllocKernelDynamicRes" and "FreeKernelDynamicRes"
From: @tronzhang
Reviewed-by: @ckey_dou,@gaoxiong1
Signed-off-by: @gaoxiong1
5 years ago
tronzhang
1cf2482ba5
try to get address pointer from cache
5 years ago
wilfChen
e4e9362bd0
gpu suppor dynamic shape
5 years ago
lizhenyu
ae295120a7
fix codedex warning
5 years ago
Harshvardhan Gupta
744355a005
remove dbg args from runtime and remove needless argument from IsWatchpoint
5 years ago
laiyongqiang
ebc18d1edb
log improvement
5 years ago
wilfChen
b420b6cda7
cudnn inplace optimizer
5 years ago
limingqi107
740254161e
gpu code review
5 years ago
laiyongqiang
e13beaa7ed
clean codex warning
5 years ago
mindspore-ci-bot
c543db0585
!6180 clean codex warning in memreuse
Merge pull request !6180 from laiyongqiang/codex
5 years ago
laiyongqiang
4063a69846
clean codex warning
5 years ago
John Tzanakakis
b0a7ebdeb0
enable debugger by default and set correct log message severity
5 years ago
laiyongqiang
0c1e391556
add atomic clean op for every communication op's input
5 years ago
liangzelang
7e6189a076
fix global_step error in multi-cases
5 years ago
zhoufeng
663278112f
optimize code compile performance
Signed-off-by: zhoufeng <zhoufeng54@huawei.com>
5 years ago
liubuyu
d81862a916
decoupling core and context
5 years ago
mindspore-ci-bot
f87983833d
!3636 Fix opt unnecessary catch
Merge pull request !3636 from Kang/opt
5 years ago
wuyongkang
f1e639a86d
Fix opt unnecessary catch
5 years ago
limingqi107
af39ca8252
modify the wrong word
5 years ago
laiyongqiang
d99786e938
fix refnode input type assign
5 years ago
lizhenyu
c67e562373
refine GPU memory swap performance
5 years ago
mindspore-ci-bot
cfafdcbcf0
!3246 refine gpu memory swap performance
Merge pull request !3246 from zyli2020/refine_gpu_mem_swap
5 years ago
lizhenyu
3ace75509b
refine gpu memory swap performance
5 years ago
mindspore-ci-bot
72a2b7d496
!3117 not reuse ref node input's memory
Merge pull request !3117 from laiyongqiang/refnode_input
5 years ago
laiyongqiang
acba03b191
not reuse ref node input's memory
5 years ago
liubuyu
76dc80e7b7
Unified code style
5 years ago
mindspore-ci-bot
8a8de7e062
!3171 gpu fix the graph of 'nop node + depend + node'
Merge pull request !3171 from limingqi107/master
5 years ago
laiyongqiang
b570dec7ab
add right align border for communication op's single output
5 years ago
limingqi107
a596dd6e43
gpu fix the graph of 'nop node + depend + node'
5 years ago
mindspore-ci-bot
25ee322ba3
!2966 reuse communication op output's memory
Merge pull request !2966 from laiyongqiang/hcom_memreuse
5 years ago
laiyongqiang
68c78ab6bb
reuse communication op output's memory
5 years ago
lichen_101010
6bb2182134
Add partial memory reuse support to debugger
move pre-execution of debugger from rungraph to build/compile graph
support partial mem reuse for a scope of nodes
set default mem reuse to be true for debugger
remove some redundant lines
remove redundant code and fix a bug for supporting partial no mem reuse a scope of nodes
resolve CI errors
Solve CI errors
solve cpplint errors
solve CI build error
manually fix the CI compile UT error
Optimize code for mem reuse support
Debug optimization of debugger memory reuse
debug code for debugger memory reuse part2
address clang-format errors
Switch memory reuse on and off based on environment variable
Fix typo
Fix typo
Load watchpoint value only
fix bugs
Addressed comments from lupengcheng
fix typo
Fix typo
fix CI errors
refactor some code
fix typo
addressed comments from canadian teamates
remove locking from TensorLoader
fix CI errors
add lock to tensor_loader
fix rebase-to-master conflict
fix rebase conflicts
fix rebase conflicts part 2
fix rebase conflicts part 3
5 years ago
liubuyu
43c79eb853
mindspore path adjust
5 years ago