mindspore-ci-bot
cdb4fb815a
!6072 fix bug of kernel select
Merge pull request !6072 from lianliguang/fix-bug-of-kernel-select
5 years ago
kswang
80f6c084d8
fix build -p on gettime error
5 years ago
lichen_101010
dffa61b228
send info when training is done
5 years ago
WilliamLian
e9d9101ba2
fix bug of kernel select index mathed error
5 years ago
John Tzanakakis
b0a7ebdeb0
enable debugger by default and set correct log message severity
5 years ago
mindspore-ci-bot
6568904078
!5973 score + 1 when match output format with pri_format
Merge pull request !5973 from lianliguang/fix-maskrcnn
5 years ago
WilliamLian
8d8981a2fe
do not malloc address when parameter is not used by others
5 years ago
mindspore-ci-bot
ae7e8a7492
!5964 GeneratorDataset continue to send remain data when release gpu resource for GPU
Merge pull request !5964 from anzhengqi/I1TK0O-close-gpu2
5 years ago
WilliamLian
3b1dded345
add output score when matched output format with infer
5 years ago
mindspore-ci-bot
7152fe04be
!5783 GraphKernel supports GPU
Merge pull request !5783 from DeshiChen/graph_kernel_1.0
5 years ago
anzhengqi
d2ba20bdbc
continue to send remain data when release gpu resource
5 years ago
dayschan
37a48f6aac
GraphKernel supports GPU
1. Update akg submodule
2. Refactor akg_kernel_build, akg_ascend_kernel_build, akg_gpu_kernel_build
3. Add akg_kernel_json_decoder to support converting kernel_json to AnfNode.
4. Add GraphKernel Cost Model. (mindspore/_extends/graph_kernel)
5. Add some GraphKernel passes to GpuSession, move these passes to backend/optimizer/graph_kernel.
6. Add global id for ir files.
7. Fix bug in ConstInputToAttr.
5 years ago
mindspore-ci-bot
f480e48271
!5913 add count of graphs using the parameter
Merge pull request !5913 from limingqi107/master
5 years ago
mindspore-ci-bot
0c316e522d
!5866 clean idle mem at proper time
Merge pull request !5866 from liangzelang/fix_global_step_error
5 years ago
limingqi107
d712ac0da0
add count of graphs using the parameter
5 years ago
mindspore-ci-bot
31844bff2e
!5902 GPU add support device
Merge pull request !5902 from VectorSL/format
5 years ago
liangzelang
4c7291078c
clean idle mem in the beginning of ascend session
5 years ago
VectorSL
197b00331e
tensorcore can only used in Volta or newer devices
5 years ago
kswang
b5e579567c
fix unnecessary tensor sync
5 years ago
wuxuejian
bd527a331d
update aicpu proto and update module: graphengine
Support Dynamic Shape Aicpu Run Package
adapt tensorengin modify, fix ub fusion
5 years ago
Zhang Qinghua
c0070d3d49
Use the unified Execute function to run Graph or Single Op Graph.
5 years ago
mindspore-ci-bot
417cce87ee
!5761 search nvcc in entire PATH
Merge pull request !5761 from zyli2020/bug_fix
5 years ago
mindspore-ci-bot
5ef6c08260
!5756 Move ascend dependent functions to ascend kernel runtime.
Merge pull request !5756 from 张清华/master
5 years ago
lizhenyu
aeb833f212
search nvcc in entire PATH
5 years ago
Zhang Qinghua
5ac60ff202
Move ascend dependent functions to ascend kernel runtime.
5 years ago
limingqi107
5058e844cd
gpu inceptionv3 optimize
5 years ago
lizhenyu
c3d6918649
add kernel select after optimize pass
5 years ago
mindspore-ci-bot
6e463f29d3
!5524 fix stream bug
Merge pull request !5524 from gukecai/stream-bug
5 years ago
mindspore-ci-bot
cd5a1bf1f4
!5625 fix issue of if Test cases running together failed
Merge pull request !5625 from wenchunjiang/fix_memory_leak
5 years ago
kswang
1492442f99
set context when init
5 years ago
mindspore-ci-bot
981bfbfa74
!5190 Add API to query GPU queue size and capacity
Merge pull request !5190 from anthonyaje/gpu_queue_size
5 years ago
wenchunjiang
4faa3304ef
1. replace weak_ptr to shared_ptr in child_graph_order_ an parent_graph_
2. erase kAttrChildGraph after finish using
5 years ago
mindspore-ci-bot
f42b3bbfbc
!5598 add tensor sync status
Merge pull request !5598 from kisnwang/async-run-graph
5 years ago
mindspore-ci-bot
24f00cc6dc
!5617 clear graph output address in graph destructor
Merge pull request !5617 from limingqi107/master
5 years ago
mindspore-ci-bot
402bb4c053
!5461 add atomic clean for every communication op's inputs
Merge pull request !5461 from laiyongqiang/atomic_clean_every_master
5 years ago
limingqi107
7ec2f6a550
clear graph output address in graph destructor
5 years ago
kswang
5614b2ba6c
add tensor sync status
5 years ago
laiyongqiang
0c1e391556
add atomic clean op for every communication op's input
5 years ago
limingqi107
341200ab97
gpu kernel_info_setter code review
5 years ago
fary86
fcbb3e0edc
Refactor ms_context implementation
5 years ago
gukecai
bf0acf4ce0
fix stream bug
5 years ago
lizhenyu
6fdd52080d
add mode black list checker
5 years ago
kswang
756bb6d53f
async run graph
5 years ago
mindspore-ci-bot
7336ed94fd
!5395 fix assign memory error in pynative mode
Merge pull request !5395 from chujinjin/fix_assign_memory_error_in_pynative
5 years ago
mindspore-ci-bot
82ae946fe6
!5362 gpu GoogleNet performance optimize
Merge pull request !5362 from VectorSL/slice
5 years ago
mindspore-ci-bot
11b3fa4bc6
!5349 gpu GoogleNet performance optimize
Merge pull request !5349 from limingqi107/master
5 years ago
VectorSL
f95fe92ad3
slice support nhwc
5 years ago
limingqi107
ff6b64a598
gpu GoogleNet performance optimize
5 years ago
mindspore-ci-bot
a8317acee8
!5111 Support int64 for cpu sparse optimizers
Merge pull request !5111 from YuJianfeng/int64
5 years ago
mindspore-ci-bot
2995c36267
!5315 add kernel release resource
Merge pull request !5315 from limingqi107/master
5 years ago