mindspore-ci-bot
55a277d1de
!7572 GPU reduce precision fix
Merge pull request !7572 from VectorSL/fix-reduce-pre
5 years ago
mindspore-ci-bot
93c03ca4df
!7565 Decouple GraphKernel ops from akg op register
Merge pull request !7565 from DeshiChen/1012_gen_json_from_anf
5 years ago
mindspore-ci-bot
08dad79529
!7533 fix bug of gpu‘s kernel setter
Merge pull request !7533 from lianliguang/master
5 years ago
VectorSL
bbcdd81d1b
fix reduce precision: deal tuplegetitem and param
5 years ago
dayschan
f429fe7888
Decouple GraphKernel ops from akg op register
1. Extract op info from AnfNode when generating composite operators.
2. Ignore the op register info when selecting kernel.
3. Update akg submodule.
4. Remove akg gpu op registers that added for GraphKernel.
5. Add an interface `IsNodeInGraphKernel` into `AnfRuntimeAlgorithm`, next to the `IsGraphKernel`.
6. Sort included fils for the clang-format rule is changed.
TODO: Remove the json generator in GraphKernelExpander, it's ok to call this interface.
5 years ago
mindspore-ci-bot
89cd882077
!7520 Critical path performance optimization
Merge pull request !7520 from chenweifeng/resnet50-profiling
5 years ago
wilfChen
b8e1c03cdc
Critical path performance optimization
5 years ago
Lianliguang
61f3c134c0
fix bug of gpu refresh parameter & valuenode's format info when kernel selecting && do not refresh ZN_LSTM format for valuenode
5 years ago
Harshvardhan Gupta
744355a005
remove dbg args from runtime and remove needless argument from IsWatchpoint
5 years ago
mindspore-ci-bot
687e9126aa
!7267 dump kernel inputs/outputs based on the input_output field of the config file.
Merge pull request !7267 from AdelShafiei/kernels_dump
5 years ago
mindspore-ci-bot
4424873e7e
!7256 GPU add reduce precision
Merge pull request !7256 from VectorSL/reduceprecision
5 years ago
Adel Shafiei
e4ebdc599e
Added input/output kernel dump support based on the config file
5 years ago
lichen_101010
678962d718
implementation for dumping const values for each step
fix no output address error
address clang-format error
address some comments
clang_format check
5 years ago
VectorSL
ccab6f88d5
gpu add reduce precision:int64->int32
5 years ago
caifubi
d3b978147f
Ascend Dynamic Shape
5 years ago
mindspore-ci-bot
f90529a4a8
!6996 In debugger, Add output filter for BatchNorm to avoid loading dirty output values
Merge pull request !6996 from lichen_101010/batch_norm
5 years ago
mindspore-ci-bot
14d1c4390b
!7086 Only dump the specified kernels in dump config file, GPU mode
Merge pull request !7086 from AdelShafiei/kernels_dump
5 years ago
Adel Shafiei
aa33ae12fb
Only dump the specified kernel in GPU mode
5 years ago
mindspore-ci-bot
21c5607fca
!6971 cudnn inplace optimizer
Merge pull request !6971 from chenweifeng/tensoradd_inplace
5 years ago
Harshvardhan Gupta
7c5e0541ba
load inputs before suspending execution in dbg
5 years ago
wilfChen
b420b6cda7
cudnn inplace optimizer
5 years ago
lichen_101010
ea8c8361d6
add output filter for BatchNorm operator
Add some comments
addressed John's comments
CI check
CI check part2
5 years ago
mindspore-ci-bot
129261220e
!6499 refactor debugger code in main mindspore functions
Merge pull request !6499 from john_tzanakakis/master_ms1_grpc
5 years ago
liubuyu
fc7ea3db78
code warning clean
5 years ago
John Tzanakakis
0e0d7eda19
code refactor
5 years ago
VectorSL
9e6bd72e04
fix codex
5 years ago
mindspore-ci-bot
0cbb4f6330
!6317 Add relu6 gpu
Merge pull request !6317 from TuDouNi/relu6
5 years ago
dayschan
3c2da3197f
Fix review_bot and codedex problems
5 years ago
root
d11239f70f
Add relu6 gpu
5 years ago
mindspore-ci-bot
234b7a6033
!6131 fix gpu pynative eval
Merge pull request !6131 from panfengfeng/fix_pynative_gpu_eval
5 years ago
caifubi
372c2e7951
Combine Async Dump and E2E Dump
5 years ago
panfengfeng
db558cd01b
fix pynative gpu eval
5 years ago
lichen_101010
dffa61b228
send info when training is done
5 years ago
mindspore-ci-bot
ae7e8a7492
!5964 GeneratorDataset continue to send remain data when release gpu resource for GPU
Merge pull request !5964 from anzhengqi/I1TK0O-close-gpu2
5 years ago
mindspore-ci-bot
7152fe04be
!5783 GraphKernel supports GPU
Merge pull request !5783 from DeshiChen/graph_kernel_1.0
5 years ago
anzhengqi
d2ba20bdbc
continue to send remain data when release gpu resource
5 years ago
dayschan
37a48f6aac
GraphKernel supports GPU
1. Update akg submodule
2. Refactor akg_kernel_build, akg_ascend_kernel_build, akg_gpu_kernel_build
3. Add akg_kernel_json_decoder to support converting kernel_json to AnfNode.
4. Add GraphKernel Cost Model. (mindspore/_extends/graph_kernel)
5. Add some GraphKernel passes to GpuSession, move these passes to backend/optimizer/graph_kernel.
6. Add global id for ir files.
7. Fix bug in ConstInputToAttr.
5 years ago
VectorSL
197b00331e
tensorcore can only used in Volta or newer devices
5 years ago
Zhang Qinghua
c0070d3d49
Use the unified Execute function to run Graph or Single Op Graph.
5 years ago
mindspore-ci-bot
417cce87ee
!5761 search nvcc in entire PATH
Merge pull request !5761 from zyli2020/bug_fix
5 years ago
lizhenyu
aeb833f212
search nvcc in entire PATH
5 years ago
limingqi107
5058e844cd
gpu inceptionv3 optimize
5 years ago
lizhenyu
c3d6918649
add kernel select after optimize pass
5 years ago
mindspore-ci-bot
981bfbfa74
!5190 Add API to query GPU queue size and capacity
Merge pull request !5190 from anthonyaje/gpu_queue_size
5 years ago
limingqi107
7ec2f6a550
clear graph output address in graph destructor
5 years ago
limingqi107
341200ab97
gpu kernel_info_setter code review
5 years ago
fary86
fcbb3e0edc
Refactor ms_context implementation
5 years ago
lizhenyu
6fdd52080d
add mode black list checker
5 years ago
mindspore-ci-bot
82ae946fe6
!5362 gpu GoogleNet performance optimize
Merge pull request !5362 from VectorSL/slice
5 years ago
VectorSL
f95fe92ad3
slice support nhwc
5 years ago