mindspore-ci-bot
ff3438d9c2
!7158 [GraphKernel] Promote complex tensor as graph's input and recorrect getitem index for graph kernels fusion.
Merge pull request !7158 from TronZhang/promotion_const_for_gk
5 years ago
caifubi
e3f47285d7
Fix dump wrong device_id on gpu
5 years ago
tronzhang
c32bf5ac28
promote complex tensor as graph's input and recorrect getitem index for graph kernels fusion.
5 years ago
kswang
11989b5e30
enable async run
5 years ago
mindspore-ci-bot
21c5607fca
!6971 cudnn inplace optimizer
Merge pull request !6971 from chenweifeng/tensoradd_inplace
5 years ago
Harshvardhan Gupta
7c5e0541ba
load inputs before suspending execution in dbg
5 years ago
wilfChen
b420b6cda7
cudnn inplace optimizer
5 years ago
mindspore-ci-bot
e7296ffd69
!6695 GPU change keep_batchnorm_fp32 to false
Merge pull request !6695 from VectorSL/keep
5 years ago
VectorSL
48db7f8c4f
gpu change bncast
5 years ago
mindspore-ci-bot
129261220e
!6499 refactor debugger code in main mindspore functions
Merge pull request !6499 from john_tzanakakis/master_ms1_grpc
5 years ago
mindspore-ci-bot
63d6e139ab
!6561 [Data Dump] gpu dump iteration bug
Merge pull request !6561 from caifubi/dump
5 years ago
caifubi
5d8e493834
Fix GPU dump bug
5 years ago
yelihua
f2f35d2176
fix the bug for sending suspend
5 years ago
caifubi
97be0fbc54
fix gpu dump iteration bug
5 years ago
John Tzanakakis
0e0d7eda19
code refactor
5 years ago
mindspore-ci-bot
0c2e7f5092
!6321 codex warning
Merge pull request !6321 from chenweifeng/codex
5 years ago
mindspore-ci-bot
c57a472748
!5918 optimizer pynative memory
Merge pull request !5918 from flywind/optimizer_pynative_memory
5 years ago
mindspore-ci-bot
af5ebcf1a9
!6232 fix gpu heterogeneous bug
Merge pull request !6232 from baihuawei/embedding
5 years ago
kpy
4338dd266e
optimizer pynative graph memory
5 years ago
wilfChen
aacf7c2e34
codex warning
5 years ago
baihuawei
09a3f2ff5e
fix GPU hete
5 years ago
mindspore-ci-bot
8f6c5e3cc2
!6160 open graph kernel expander opt for gpu
Merge pull request !6160 from r1chardf1d0/expander
5 years ago
r1chardf1d0
88de0cffa9
open graph kernel expander opt for gpu
5 years ago
caifubi
372c2e7951
Combine Async Dump and E2E Dump
5 years ago
mindspore-ci-bot
939737c017
!5970 enable debugger by default and set correct log message severity
Merge pull request !5970 from john_tzanakakis/master_ms1_grpc
5 years ago
John Tzanakakis
b0a7ebdeb0
enable debugger by default and set correct log message severity
5 years ago
zhousiyi
ab74dfc839
refact backend/session from python
5 years ago
dayschan
37a48f6aac
GraphKernel supports GPU
1. Update akg submodule
2. Refactor akg_kernel_build, akg_ascend_kernel_build, akg_gpu_kernel_build
3. Add akg_kernel_json_decoder to support converting kernel_json to AnfNode.
4. Add GraphKernel Cost Model. (mindspore/_extends/graph_kernel)
5. Add some GraphKernel passes to GpuSession, move these passes to backend/optimizer/graph_kernel.
6. Add global id for ir files.
7. Fix bug in ConstInputToAttr.
5 years ago
Zhang Qinghua
c0070d3d49
Use the unified Execute function to run Graph or Single Op Graph.
5 years ago
lizhenyu
c3d6918649
add kernel select after optimize pass
5 years ago
mindspore-ci-bot
1944b8e53b
!5612 Resnet50 pattern Fusion
Merge pull request !5612 from chenweifeng/BatchNormAddReluGrad
5 years ago
limingqi107
7823555e7a
gpu add the pass of remove redundant transpose
5 years ago
wilfChen
5316061fa3
gpu resnet50 fusion
5 years ago
kswang
5614b2ba6c
add tensor sync status
5 years ago
limingqi107
341200ab97
gpu kernel_info_setter code review
5 years ago
fary86
fcbb3e0edc
Refactor ms_context implementation
5 years ago
mindspore-ci-bot
18253952f5
!5481 add mode black list checker
Merge pull request !5481 from zyli2020/master
5 years ago
mindspore-ci-bot
d81b30e6a0
!5312 make backend/optimizer free of pybind
Merge pull request !5312 from xychow/remove-backend-py-dependency-2
5 years ago
lizhenyu
6fdd52080d
add mode black list checker
5 years ago
kswang
756bb6d53f
async run graph
5 years ago
zhousiyi
c25e37e7bf
make backend/optimizer pybind free
5 years ago
chujinjin
c594e3d4a9
fix load input data error when input is a tuple
5 years ago
VectorSL
f36f72b99b
GPU add log in loadinputdata when tensor input != graph input
5 years ago
lizhenyu
1becddf3a4
[bugfix]SyncDeviceToHost failed when device address size is zero
5 years ago
lizhenyu
839ec02542
Add FusedBatchEx support
5 years ago
limingqi107
5b76e8f3d7
gpu add format transform pass
5 years ago
wenchunjiang
a747a1e29e
fix codedex and reviewbot
5 years ago
liubuyu
d81862a916
decoupling core and context
5 years ago
mindspore-ci-bot
40dc0eff6d
!3900 Delete master hard code in pull node
Merge pull request !3900 from ZPaC/master-delete-hard-code-in-pull-node
5 years ago
mindspore-ci-bot
bfbef01063
!3788 optimize update output in PyNative mode on GPU
Merge pull request !3788 from chujinjin/optimize_updateoutput_in_GPU
5 years ago