lizhenyu
|
0beaf288b3
|
fix code review warnings
|
4 years ago |
hwjiaorui
|
e97df3a58f
|
clean code
|
4 years ago |
ckey_Dou
|
d293c5eb26
|
using kernel pool to share the compiling results when running on multi
cards
|
4 years ago |
wangjunbao
|
f9d99e97d2
|
fix ci warning for not handling function retrun of RDR
|
4 years ago |
limingqi107
|
06a6e8d186
|
fix the test case of CPU dump
|
4 years ago |
limingqi107
|
008dbf2da0
|
fix device bug of the input empty
|
4 years ago |
limingqi107
|
70d156562a
|
add dump pb and nccl protected
|
4 years ago |
linqingke
|
84b26853a6
|
fix mem_unit_size overflow bug.
|
4 years ago |
yuchaojie
|
bf13a8a01e
|
support group conv2d in pynative
|
4 years ago |
limingqi107
|
3cb5075e38
|
fix bug of memory swap
|
4 years ago |
i-robot
|
a87fd539d4
|
!18726 Fix cudnn malloc memory failed.
Merge pull request !18726 from linqingke/gpu_memory
|
4 years ago |
linqingke
|
d0e2fabf1f
|
fix cudnn malloc memory failed.
|
4 years ago |
7347157+joylvliang@user.noreply.gitee.com
|
6efc47853f
|
fix_bug_of_resnet50_512_batch_size_memory_not_enough
|
4 years ago |
yuchaojie
|
689158f79b
|
FracZ format conversion when conv2d group > 1
|
4 years ago |
i-robot
|
538fb5dd8a
|
!18535 add RDR export point for SyncStream failed
Merge pull request !18535 from liangyongxiong/master
|
4 years ago |
i-robot
|
368aff4714
|
!18588 fix bug of CPU actor runtime
Merge pull request !18588 from limingqi107/actor_runtime2
|
4 years ago |
liangyongxiong
|
7fd56d6244
|
add RDR export point for SyncStream failed
|
4 years ago |
i-robot
|
26ed1427e9
|
!18521 Fix collective log
Merge pull request !18521 from ZPaC/multi-host-use-hostid
|
4 years ago |
i-robot
|
35b15bda0b
|
!18465 clean code for memory reuse
Merge pull request !18465 from laiyongqiang/mem_clean
|
4 years ago |
limingqi107
|
b25d00731c
|
fix bug of CPU actor runtime
|
4 years ago |
ZPaC
|
3013e08d1a
|
Fix collective log
|
4 years ago |
LaiYongqiang
|
d4d6fb940d
|
memory reuse code clean
|
4 years ago |
limingqi107
|
c664b7f37b
|
add device id info when memory alloc failed
|
4 years ago |
i-robot
|
8301489439
|
!18329 Optimize collective log
Merge pull request !18329 from ZPaC/optimize-collective-log
|
4 years ago |
i-robot
|
5a34e74551
|
!17505 Update gpu memory reuse.
Merge pull request !17505 from linqingke/gpu_memory
|
4 years ago |
lizhenyu
|
887d96063b
|
unify runtime support pynative hook
|
4 years ago |
ZPaC
|
9211e05f7f
|
optimize collective log
|
4 years ago |
linqingke
|
40b3d923ab
|
add memory unit size setting.
update set unit.
|
4 years ago |
i-robot
|
9852cced86
|
!17839 Fix device memory can not release in PyNative mode
Merge pull request !17839 from zyli2020/fix_issue_defect
|
4 years ago |
i-robot
|
71bb69695f
|
!12151 Add UNet Model for GPU
Merge pull request !12151 from fanrb/unet
|
4 years ago |
fan1997
|
be3d4e6fd3
|
1.Optimize bias add grad kernel
2.Optimize slice grad kernel
3.Add Unet GPU Model
|
5 years ago |
lizhenyu
|
f3e5d67512
|
fix core dump when destroy device context in PyNative mode
|
4 years ago |
i-robot
|
4932854776
|
!17987 fix repeated release device resource of actor runtime
Merge pull request !17987 from limingqi107/actor_runtime
|
4 years ago |
limingqi107
|
e9b0eab177
|
fix repeated release device resource of actor runtime
|
4 years ago |
mindspore-ci-bot
|
8fa9e3e611
|
!17712 fix pclint & codex in profiler
From: @yanghaitao1
Reviewed-by: @ouwenchang,@yelihua
Signed-off-by: @yelihua
|
4 years ago |
i-robot
|
8fe3da0ddc
|
!17819 Add all gather fusion and concat pass for gpu
Merge pull request !17819 from ZPaC/master-add-gpu-all-gather-fusion
|
4 years ago |
yanghaitao1
|
127e4d4068
|
fix profiler pclint&codex
|
4 years ago |
zengzitao
|
43cf630e38
|
fix code_docs for gpu_kernel_runtime.h
|
4 years ago |
ZPaC
|
35b639868d
|
Add all gather fusion and concat pass for gpu
|
4 years ago |
zengzitao
|
31a372da88
|
fix oom bug when open graphkernel flag in network
|
4 years ago |
mindspore-ci-bot
|
83a9fc2939
|
!17466 GPU fix reduce precision
From: @VectorSL
Reviewed-by: @limingqi107,@wilfchen
Signed-off-by: @wilfchen
|
4 years ago |
VectorSL
|
cbe01fc836
|
fix gpu reduce precision
|
4 years ago |
limingqi107
|
d405964aab
|
actor runtimie supports allreduce multi-stream
|
4 years ago |
wilfChen
|
0ad757f74c
|
trt operator
|
4 years ago |
mindspore-ci-bot
|
2173d08ba1
|
!16978 fix codecheck and pclint
From: @limingqi107
Reviewed-by: @cristoval,@wilfchen
Signed-off-by: @wilfchen
|
4 years ago |
limingqi107
|
c22185d586
|
fix codecheck and pclint
|
4 years ago |
TinaMengtingZhang
|
da6e068ed7
|
fix ci codecheck alarm in master
|
4 years ago |
mindspore-ci-bot
|
9f77a71d30
|
!16803 [GraphKernel]Simplify GetPrevNodeAddr Codes
From: @jiaoy1224
Reviewed-by: @gaoxiong1,@ckey_dou
Signed-off-by: @ckey_dou
|
4 years ago |
lizhenyu
|
2b50100d79
|
Unify runtime support profiling
|
4 years ago |
Yang Jiao
|
6693484ef3
|
simplify getPrevAddr code
|
4 years ago |