Peilin Wang
f020e19636
add int32 support to greater gpu kernel
fix ci
5 years ago
mindspore-ci-bot
6fdb43d22d
!5895 gpu maximum minimum absgrad kernel fix
Merge pull request !5895 from chenweifeng/maximum-fix
5 years ago
mindspore-ci-bot
98725bc865
!5790 [MS][GPU][CUDA] Dedicated new user facing Pad API kernel
Merge pull request !5790 from danishnxt/GPU_three
5 years ago
mindspore-ci-bot
b717a686cf
!5690 ROIAlign kernel memory leak
Merge pull request !5690 from JonathanY/rcnn
5 years ago
wilfChen
3b54e55223
gpu maximum & minimum kernel with fp16 input
5 years ago
ZPaC
87bf2a7dcd
Add PS context.
5 years ago
wuxuejian
bd527a331d
update aicpu proto and update module: graphengine
Support Dynamic Shape Aicpu Run Package
adapt tensorengin modify, fix ub fusion
5 years ago
mindspore-ci-bot
a3d0ddb4db
!5779 tenoradd profiling
Merge pull request !5779 from chenweifeng/broadcast-refactor
5 years ago
wilfChen
6ebe132cd3
broadcast refactor
5 years ago
mindspore-ci-bot
b9345d1d34
!5775 fix categorical in GraphMode
Merge pull request !5775 from baihuawei/0902
5 years ago
baihuawei
92f1855a79
fix categorical in GraphMode
5 years ago
danish
273fc0071c
New User facing Pad Kernel + ST + Allows for channel padding
style fix
lint fixes
added check in NN layer for > 4 paddings, plus lint fix
fix python lint
lint fix
lint fix
updating to pytest asserts to improve testing
removed unnecc vars from test file fail checks
5 years ago
limingqi107
5058e844cd
gpu inceptionv3 optimize
5 years ago
mindspore-ci-bot
bc4c5afc1a
!5667 add kernel select after optimize pass
Merge pull request !5667 from zyli2020/code_refactor
5 years ago
lizhenyu
c3d6918649
add kernel select after optimize pass
5 years ago
danish
a46fb8dd9f
bug fixes for new MirrorPadGrad NMS IOU comparison
5 years ago
mindspore-ci-bot
12ff0be5fa
!5716 Unify float to int cast and get initial accum for ps ftrl.
Merge pull request !5716 from ZPaC/master-unify-float-to-int-cast
5 years ago
ZPaC
997304e2c5
Unify float to int cast.
5 years ago
mindspore-ci-bot
7a636939eb
!5651 Modify interface for normal_normal function
Merge pull request !5651 from lilei/normal
5 years ago
mindspore-ci-bot
1944b8e53b
!5612 Resnet50 pattern Fusion
Merge pull request !5612 from chenweifeng/BatchNormAddReluGrad
5 years ago
Jonathan Yan
bbd19dbe43
roi align memory leak
5 years ago
jjfeing
2735dcd14b
when read json failed, sleep 500ms, retry again.
5 years ago
lilei
1a0047c9b7
Modify interface for normal_normal function
5 years ago
mindspore-ci-bot
749979e7c4
!5458 NMS GPU OP Performance improvement
Merge pull request !5458 from danishnxt/GPU_two
5 years ago
mindspore-ci-bot
981bfbfa74
!5190 Add API to query GPU queue size and capacity
Merge pull request !5190 from anthonyaje/gpu_queue_size
5 years ago
mindspore-ci-bot
413e3eb5c0
!5622 fix nccl broadcast
Merge pull request !5622 from baihuawei/0901
5 years ago
wilfChen
5316061fa3
gpu resnet50 fusion
5 years ago
baihuawei
572a7c4741
fix nccl broadcast
5 years ago
mindspore-ci-bot
660aa8e60d
!4958 Fix GPU-ArgMaxWithValue
Merge pull request !4958 from 34bunny/GPU-argmaxwithvalue-fix
5 years ago
ZPaC
442b38dc20
Delete extra file
5 years ago
fary86
fcbb3e0edc
Refactor ms_context implementation
5 years ago
mindspore-ci-bot
b9c2da520f
!5489 modify the format info of tensorAdd
Merge pull request !5489 from limingqi107/master
5 years ago
mindspore-ci-bot
b4caf21f63
!5454 fix Categorical log_prob
Merge pull request !5454 from baihuawei/categorical
5 years ago
limingqi107
109e2e9bcc
modify the format info of tensorAdd
5 years ago
baihuawei
779e27b91d
support categorical log_prob
5 years ago
mindspore-ci-bot
be138e4618
!5430 Clean codex and reviewbot warnings
Merge pull request !5430 from huanghui/clear-warning
5 years ago
danish
7d7fa760a0
reduce based nms final pass - speed improv
refactored faster nms
refactored faster nms + typo fix
added box flipping choice
set choice to true for testing - yz
switching back
new test file
5 years ago
huanghui
998ff0399b
clear codex and reviewbot warning
5 years ago
mindspore-ci-bot
2b86c92fec
!5413 Combine sparse embedding gradient
Merge pull request !5413 from chengang/combine_grad
5 years ago
mindspore-ci-bot
2060edb015
!5160 Call SHA256 through C++ instead of Python
Merge pull request !5160 from huangbingjian/master
5 years ago
mindspore-ci-bot
bb8b410998
!5394 Remove unused header file definitions
Merge pull request !5394 from huangbingjian/remove_h
5 years ago
cristoval
505108633c
combine sparse embedding gradient
5 years ago
HuangBingjian
0377deb3e4
Remove unused header file definitions
5 years ago
mindspore-ci-bot
82ae946fe6
!5362 gpu GoogleNet performance optimize
Merge pull request !5362 from VectorSL/slice
5 years ago
mindspore-ci-bot
11b3fa4bc6
!5349 gpu GoogleNet performance optimize
Merge pull request !5349 from limingqi107/master
5 years ago
HuangBingjian
43329202b7
add sha256
5 years ago
VectorSL
f95fe92ad3
slice support nhwc
5 years ago
limingqi107
ff6b64a598
gpu GoogleNet performance optimize
5 years ago
mindspore-ci-bot
a8317acee8
!5111 Support int64 for cpu sparse optimizers
Merge pull request !5111 from YuJianfeng/int64
5 years ago
mindspore-ci-bot
2995c36267
!5315 add kernel release resource
Merge pull request !5315 from limingqi107/master
5 years ago