Adel Shafiei
a322184057
debugger backend compability support
5 years ago
gzhcv
dead0bbf80
fix the bug of index out of range in step trace feature
5 years ago
lizhenyu
6f6a0dfd7a
[bug_fix]GPU distributed training occur core dump when memory is not enough
5 years ago
ZPaC
db3a2d60cb
GPU supports p2p nccl interfaces
5 years ago
gzhcv
3d6c1fc5f6
fix the bug of step_trace
5 years ago
mindspore-ci-bot
418c8b5e3f
!8550 add gpu step trace
From: @gzhcv
Reviewed-by:
Signed-off-by:
5 years ago
gzhcv
6f6b56bfe1
add gpu step_trace
5 years ago
mindspore-ci-bot
a3066105d5
!7081 Mindspore debugger supporting multigraph
From: @lichen_101010
Reviewed-by:
Signed-off-by:
5 years ago
lichen_101010
1b6265fa43
Debugger multi-graph support implementation
Other Contributor: Adel Shafiei, John Tzanakakis
5 years ago
John Tzanakakis
d45ef4cacb
debugger won't read unused slots for BatchNorm
5 years ago
mindspore-ci-bot
fb3a7b4d5a
!8502 GPU optimize check sm warning log
From: @VectorSL
Reviewed-by: @cristoval
Signed-off-by: @cristoval
5 years ago
VectorSL
b60be1a023
optimize warning log
5 years ago
wilfChen
e3a7b7ab92
gpu support dynamic shape
5 years ago
mindspore-ci-bot
1f9d034e53
!8276 Try to cache object to accelerate "AllocKernelDynamicRes" and "FreeKernelDynamicRes"
From: @tronzhang
Reviewed-by: @ckey_dou,@gaoxiong1
Signed-off-by: @gaoxiong1
5 years ago
tronzhang
1cf2482ba5
try to get address pointer from cache
5 years ago
wilfChen
e4e9362bd0
gpu suppor dynamic shape
5 years ago
Harshvardhan Gupta
727d424553
support watchpoints on weights and bias, add support advanced watchpoints
5 years ago
John Tzanakakis
531c2952b3
fixes input dump files missing on GPU
5 years ago
mindspore-ci-bot
c6246d7a7e
!7908 add reduce precision in pynative mode
Merge pull request !7908 from chujinjin/add_reduce_precision_in_pynative_mode
5 years ago
mindspore-ci-bot
2abe941e0b
!7938 fix gpu queue partial copy issue
Merge pull request !7938 from chenweifeng/gpu-queue-copy
5 years ago
chujinjin
9197d9f2ee
add reduce precision in pynative mode
5 years ago
wilfChen
7e1cc4a11a
fix gpu queue partial copy issue
5 years ago
ZPaC
5059d8c3f9
Set gpu device id for multiple threads
5 years ago
mindspore-ci-bot
93d394ff9e
!7706 Fixed the misssing input/output tensor dump if there are tensors with the same name
Merge pull request !7706 from AdelShafiei/missingdumpinput
5 years ago
VectorSL
509b25ef1e
gpu nhwc
5 years ago
Adel Shafiei
b8fc889368
fixed the problem with missing inputs dumps when using GPU
5 years ago
mindspore-ci-bot
55a277d1de
!7572 GPU reduce precision fix
Merge pull request !7572 from VectorSL/fix-reduce-pre
5 years ago
mindspore-ci-bot
93c03ca4df
!7565 Decouple GraphKernel ops from akg op register
Merge pull request !7565 from DeshiChen/1012_gen_json_from_anf
5 years ago
mindspore-ci-bot
08dad79529
!7533 fix bug of gpu‘s kernel setter
Merge pull request !7533 from lianliguang/master
5 years ago
VectorSL
bbcdd81d1b
fix reduce precision: deal tuplegetitem and param
5 years ago
dayschan
f429fe7888
Decouple GraphKernel ops from akg op register
1. Extract op info from AnfNode when generating composite operators.
2. Ignore the op register info when selecting kernel.
3. Update akg submodule.
4. Remove akg gpu op registers that added for GraphKernel.
5. Add an interface `IsNodeInGraphKernel` into `AnfRuntimeAlgorithm`, next to the `IsGraphKernel`.
6. Sort included fils for the clang-format rule is changed.
TODO: Remove the json generator in GraphKernelExpander, it's ok to call this interface.
5 years ago
mindspore-ci-bot
89cd882077
!7520 Critical path performance optimization
Merge pull request !7520 from chenweifeng/resnet50-profiling
5 years ago
wilfChen
b8e1c03cdc
Critical path performance optimization
5 years ago
Lianliguang
61f3c134c0
fix bug of gpu refresh parameter & valuenode's format info when kernel selecting && do not refresh ZN_LSTM format for valuenode
5 years ago
Harshvardhan Gupta
744355a005
remove dbg args from runtime and remove needless argument from IsWatchpoint
5 years ago
mindspore-ci-bot
687e9126aa
!7267 dump kernel inputs/outputs based on the input_output field of the config file.
Merge pull request !7267 from AdelShafiei/kernels_dump
5 years ago
mindspore-ci-bot
4424873e7e
!7256 GPU add reduce precision
Merge pull request !7256 from VectorSL/reduceprecision
5 years ago
Adel Shafiei
e4ebdc599e
Added input/output kernel dump support based on the config file
5 years ago
lichen_101010
678962d718
implementation for dumping const values for each step
fix no output address error
address clang-format error
address some comments
clang_format check
5 years ago
VectorSL
ccab6f88d5
gpu add reduce precision:int64->int32
5 years ago
caifubi
d3b978147f
Ascend Dynamic Shape
5 years ago
mindspore-ci-bot
f90529a4a8
!6996 In debugger, Add output filter for BatchNorm to avoid loading dirty output values
Merge pull request !6996 from lichen_101010/batch_norm
5 years ago
mindspore-ci-bot
14d1c4390b
!7086 Only dump the specified kernels in dump config file, GPU mode
Merge pull request !7086 from AdelShafiei/kernels_dump
5 years ago
Adel Shafiei
aa33ae12fb
Only dump the specified kernel in GPU mode
5 years ago
mindspore-ci-bot
21c5607fca
!6971 cudnn inplace optimizer
Merge pull request !6971 from chenweifeng/tensoradd_inplace
5 years ago
Harshvardhan Gupta
7c5e0541ba
load inputs before suspending execution in dbg
5 years ago
wilfChen
b420b6cda7
cudnn inplace optimizer
5 years ago
lichen_101010
ea8c8361d6
add output filter for BatchNorm operator
Add some comments
addressed John's comments
CI check
CI check part2
5 years ago
mindspore-ci-bot
129261220e
!6499 refactor debugger code in main mindspore functions
Merge pull request !6499 from john_tzanakakis/master_ms1_grpc
5 years ago
liubuyu
fc7ea3db78
code warning clean
5 years ago