mindspore-ci-bot
b13c7a3d48
!3268 refine GPU memory swap performance
Merge pull request !3268 from zyli2020/refine_gpu_mem_swap
5 years ago
mindspore-ci-bot
6b9a61ea82
!3238 Clean codex
Merge pull request !3238 from caifubi/clean-codex
5 years ago
ZPaC
0bc74f28c5
Enable get rank id and size by group
5 years ago
lizhenyu
c67e562373
refine GPU memory swap performance
5 years ago
mindspore-ci-bot
1deaa40938
!3279 fix bug with use trans_data to reduce print time
Merge pull request !3279 from lvchangquan/master
5 years ago
mindspore-ci-bot
28c8a5cc26
!3267 Fix performance issue for wide&deep
Merge pull request !3267 from mamba_ni/master
5 years ago
lvchangquan
a91f076e67
fix bug with use trans_data to reduce print time in graph mode
5 years ago
zongha
226dbde481
fix bug for wide&deep
fix bug
5 years ago
caifubi
e870c3a717
clean codex
5 years ago
mindspore-ci-bot
0aaa2d479c
!3202 cpu conv2d support diff kernelsize
Merge pull request !3202 from kisnwang/cpu-conv2d-support-diff-kernel-size
5 years ago
mindspore-ci-bot
cfafdcbcf0
!3246 refine gpu memory swap performance
Merge pull request !3246 from zyli2020/refine_gpu_mem_swap
5 years ago
kswang
634035827e
cpu conv2d support diff winsize
5 years ago
lizhenyu
3ace75509b
refine gpu memory swap performance
5 years ago
mindspore-ci-bot
72a2b7d496
!3117 not reuse ref node input's memory
Merge pull request !3117 from laiyongqiang/refnode_input
5 years ago
mindspore-ci-bot
bae2f964e5
!3213 Unified code style
Merge pull request !3213 from liubuyu/dev
5 years ago
mindspore-ci-bot
72b23b0fa4
!3174 insert transdata in print process
Merge pull request !3174 from lvchangquan/transdata
5 years ago
laiyongqiang
acba03b191
not reuse ref node input's memory
5 years ago
liubuyu
76dc80e7b7
Unified code style
5 years ago
lvchangquan
7b48a122dd
insert trans_data to reduce time in print process
5 years ago
mindspore-ci-bot
ad5c649e86
!3165 support library cusolver for gpu backend
Merge pull request !3165 from mamba_ni/master
5 years ago
mindspore-ci-bot
55cd091f5e
!3189 Reback memory pool to memory offset
Merge pull request !3189 from JoyLvliang/reback-memory-pool-to-memory-offset
5 years ago
mindspore-ci-bot
8a8de7e062
!3171 gpu fix the graph of 'nop node + depend + node'
Merge pull request !3171 from limingqi107/master
5 years ago
lvliang
d9e5b97d3b
reback-memory-pool-to-memory-offset
5 years ago
caifubi
4ad4d583ad
Fix Security Bug
5 years ago
limingqi107
a596dd6e43
gpu fix the graph of 'nop node + depend + node'
5 years ago
zongha
82412429cf
support cusolverDn
fix clang format
5 years ago
mindspore-ci-bot
251683096a
!3045 Gpu support TopK kernel
Merge pull request !3045 from chenweifeng/sort
5 years ago
mindspore-ci-bot
7f8a753621
!3112 Remove build option -S
Merge pull request !3112 from caifubi/data-dump-build
5 years ago
mindspore-ci-bot
2c3f02b653
!3081 Enable stream sync when context is graph but run pynative
Merge pull request !3081 from JoyLvliang/enable-stream-sync-when-context-is-graph-but-run-in-pynative
5 years ago
caifubi
cb8b5dbd76
Remove build option -S
5 years ago
mindspore-ci-bot
25ee322ba3
!2966 reuse communication op output's memory
Merge pull request !2966 from laiyongqiang/hcom_memreuse
5 years ago
laiyongqiang
68c78ab6bb
reuse communication op output's memory
5 years ago
mindspore-ci-bot
45d18f5e86
!3108 Fix bug of graph id changing in data dump release
Merge pull request !3108 from caifubi/data-dump-new
5 years ago
caifubi
236d65b7d3
Fix graph id changed in data dump release
5 years ago
lvliang
b48592ce79
enable-stream-sync-when-context-is-graph-but-run-in-pynative
5 years ago
mindspore-ci-bot
8300802b95
!3089 Fix data dump device address cannot access
Merge pull request !3089 from caifubi/data-dump-new
5 years ago
mindspore-ci-bot
691f588927
!3059 GPU supports to create groups for auto-parallel
Merge pull request !3059 from ZPaC/gpu-supports-to-create-groups
5 years ago
mindspore-ci-bot
130cc29603
!2931 Ascend control flow not split graphs
Merge pull request !2931 from zhoufeng/liantiao1
5 years ago
wilfChen
c10e07734c
gpu support TopK kernel
5 years ago
zhoufeng
439d6d618f
Control flow not split graph
Signed-off-by: zhoufeng <zhoufeng54@huawei.com>
5 years ago
caifubi
3f03480efe
fix dump device address cannot access
5 years ago
ZPaC
ab23776f5f
GPU supports to create groups for auto parallel.
5 years ago
lichen_101010
6bb2182134
Add partial memory reuse support to debugger
move pre-execution of debugger from rungraph to build/compile graph
support partial mem reuse for a scope of nodes
set default mem reuse to be true for debugger
remove some redundant lines
remove redundant code and fix a bug for supporting partial no mem reuse a scope of nodes
resolve CI errors
Solve CI errors
solve cpplint errors
solve CI build error
manually fix the CI compile UT error
Optimize code for mem reuse support
Debug optimization of debugger memory reuse
debug code for debugger memory reuse part2
address clang-format errors
Switch memory reuse on and off based on environment variable
Fix typo
Fix typo
Load watchpoint value only
fix bugs
Addressed comments from lupengcheng
fix typo
Fix typo
fix CI errors
refactor some code
fix typo
addressed comments from canadian teamates
remove locking from TensorLoader
fix CI errors
add lock to tensor_loader
fix rebase-to-master conflict
fix rebase conflicts
fix rebase conflicts part 2
fix rebase conflicts part 3
5 years ago
He Wei
ee1510da41
Eliminate circular dependency between 'ir' and 'device/kernel'
5 years ago
liubuyu
43c79eb853
mindspore path adjust
5 years ago