caifubi
d3b978147f
Ascend Dynamic Shape
5 years ago
mindspore-ci-bot
f90529a4a8
!6996 In debugger, Add output filter for BatchNorm to avoid loading dirty output values
Merge pull request !6996 from lichen_101010/batch_norm
5 years ago
mindspore-ci-bot
14d1c4390b
!7086 Only dump the specified kernels in dump config file, GPU mode
Merge pull request !7086 from AdelShafiei/kernels_dump
5 years ago
Adel Shafiei
aa33ae12fb
Only dump the specified kernel in GPU mode
5 years ago
mindspore-ci-bot
21c5607fca
!6971 cudnn inplace optimizer
Merge pull request !6971 from chenweifeng/tensoradd_inplace
5 years ago
Harshvardhan Gupta
7c5e0541ba
load inputs before suspending execution in dbg
5 years ago
wilfChen
b420b6cda7
cudnn inplace optimizer
5 years ago
lichen_101010
ea8c8361d6
add output filter for BatchNorm operator
Add some comments
addressed John's comments
CI check
CI check part2
5 years ago
mindspore-ci-bot
129261220e
!6499 refactor debugger code in main mindspore functions
Merge pull request !6499 from john_tzanakakis/master_ms1_grpc
5 years ago
liubuyu
fc7ea3db78
code warning clean
5 years ago
John Tzanakakis
0e0d7eda19
code refactor
5 years ago
VectorSL
9e6bd72e04
fix codex
5 years ago
mindspore-ci-bot
0cbb4f6330
!6317 Add relu6 gpu
Merge pull request !6317 from TuDouNi/relu6
5 years ago
dayschan
3c2da3197f
Fix review_bot and codedex problems
5 years ago
root
d11239f70f
Add relu6 gpu
5 years ago
mindspore-ci-bot
234b7a6033
!6131 fix gpu pynative eval
Merge pull request !6131 from panfengfeng/fix_pynative_gpu_eval
5 years ago
caifubi
372c2e7951
Combine Async Dump and E2E Dump
5 years ago
panfengfeng
db558cd01b
fix pynative gpu eval
5 years ago
lichen_101010
dffa61b228
send info when training is done
5 years ago
mindspore-ci-bot
ae7e8a7492
!5964 GeneratorDataset continue to send remain data when release gpu resource for GPU
Merge pull request !5964 from anzhengqi/I1TK0O-close-gpu2
5 years ago
mindspore-ci-bot
7152fe04be
!5783 GraphKernel supports GPU
Merge pull request !5783 from DeshiChen/graph_kernel_1.0
5 years ago
anzhengqi
d2ba20bdbc
continue to send remain data when release gpu resource
5 years ago
dayschan
37a48f6aac
GraphKernel supports GPU
1. Update akg submodule
2. Refactor akg_kernel_build, akg_ascend_kernel_build, akg_gpu_kernel_build
3. Add akg_kernel_json_decoder to support converting kernel_json to AnfNode.
4. Add GraphKernel Cost Model. (mindspore/_extends/graph_kernel)
5. Add some GraphKernel passes to GpuSession, move these passes to backend/optimizer/graph_kernel.
6. Add global id for ir files.
7. Fix bug in ConstInputToAttr.
5 years ago
VectorSL
197b00331e
tensorcore can only used in Volta or newer devices
5 years ago
Zhang Qinghua
c0070d3d49
Use the unified Execute function to run Graph or Single Op Graph.
5 years ago
mindspore-ci-bot
417cce87ee
!5761 search nvcc in entire PATH
Merge pull request !5761 from zyli2020/bug_fix
5 years ago
lizhenyu
aeb833f212
search nvcc in entire PATH
5 years ago
limingqi107
5058e844cd
gpu inceptionv3 optimize
5 years ago
lizhenyu
c3d6918649
add kernel select after optimize pass
5 years ago
mindspore-ci-bot
981bfbfa74
!5190 Add API to query GPU queue size and capacity
Merge pull request !5190 from anthonyaje/gpu_queue_size
5 years ago
limingqi107
7ec2f6a550
clear graph output address in graph destructor
5 years ago
limingqi107
341200ab97
gpu kernel_info_setter code review
5 years ago
fary86
fcbb3e0edc
Refactor ms_context implementation
5 years ago
lizhenyu
6fdd52080d
add mode black list checker
5 years ago
mindspore-ci-bot
82ae946fe6
!5362 gpu GoogleNet performance optimize
Merge pull request !5362 from VectorSL/slice
5 years ago
VectorSL
f95fe92ad3
slice support nhwc
5 years ago
limingqi107
ff6b64a598
gpu GoogleNet performance optimize
5 years ago
limingqi107
7029a861d7
add kernel release resource
5 years ago
mindspore-ci-bot
5b1cf18cb9
!5055 prepare to support int64
Merge pull request !5055 from lirongzhen1/int64
5 years ago
lizhenyu
57b27c9fb2
code refine for BN docs
5 years ago
lirongzhen1
531ad4df70
prepare to support int64
5 years ago
anthonyaje
09a99cf80b
Add Size() and Capacity() in gpu queue.
5 years ago
mindspore-ci-bot
8c7444ab47
!5140 add cuda path checker
Merge pull request !5140 from zyli2020/add_cuda_path_check
5 years ago
lizhenyu
551879240c
add cuda path checker
5 years ago
lizhenyu
5d6f7204d3
[bugfix]LSTM SyncDeviceToHost failed
5 years ago
lizhenyu
1becddf3a4
[bugfix]SyncDeviceToHost failed when device address size is zero
5 years ago
mindspore-ci-bot
a245ee665e
!4934 fix nccl kernel memory align bug
Merge pull request !4934 from zyli2020/bug_fix
5 years ago
lizhenyu
fcaf86f5d9
fix nccl kernel memory align bug
5 years ago
qianlong
113619f1ca
Revert "Add Size() and Capacity() in gpu queue."
This reverts commit e2b346d5af .
5 years ago
mindspore-ci-bot
b69b1ca8a8
!4830 [gpu] fix continuous allreduces bug
Merge pull request !4830 from yuchaojie/gpu_allreduce
5 years ago