mindspore-ci-bot
|
9297ba0a8d
|
!5048 fix gpu multinomial
Merge pull request !5048 from baihuawei/0821
|
5 years ago |
baihuawei
|
c085c5f071
|
add multinomial
|
5 years ago |
mindspore-ci-bot
|
75af54647f
|
!4954 Fix GPU non-sparse cross-entropy op returning all zeros
Merge pull request !4954 from tom_chen/cross_entropy
|
5 years ago |
mindspore-ci-bot
|
af45133a8f
|
!4996 Fix GPU-RandomChoiceWithMask speed bottleneck
Merge pull request !4996 from 34bunny/GPU-RandomChoiceWithMask-fixv2
|
5 years ago |
TFbunny
|
10c2918558
|
fix speed bottleneck for SrandInit and Shuffle in GPU-RandomChoiceWithMask
|
5 years ago |
mindspore-ci-bot
|
8e360888d0
|
!4590 fix gpu matmul fp32 accuracy
Merge pull request !4590 from qujianwei/master
|
5 years ago |
mindspore-ci-bot
|
e2203bed01
|
!3957 Gpu StridedSlice dims exceeds
Merge pull request !3957 from chenweifeng/strided_slice_dims_exceeds
|
5 years ago |
tom__chen
|
8fa4422dac
|
fix non-sparse cross entropy gpu kernel
fix white space
|
5 years ago |
mindspore-ci-bot
|
65c343c8be
|
!4935 GPU fix conv2dback bug
Merge pull request !4935 from VectorSL/conv-fix
|
5 years ago |
VectorSL
|
7d7b81348e
|
gpu fix conv bug
|
5 years ago |
lizhenyu
|
fcaf86f5d9
|
fix nccl kernel memory align bug
|
5 years ago |
wilfChen
|
837aecf9af
|
gpu stridedslice
|
5 years ago |
mindspore-ci-bot
|
6511491a7c
|
!4691 Fix MASS and FasterRcnn README.
Merge pull request !4691 from linqingke/new_ops
|
5 years ago |
mindspore-ci-bot
|
13d1738ff3
|
!4706 fix SmoothL1Loss gpu kernel
Merge pull request !4706 from Peilin/smoothL1Loss-fix
|
5 years ago |
mindspore-ci-bot
|
87c7cf46b6
|
!4475 Add 2 reserve output for BatchNormGrad
Merge pull request !4475 from JonathanY/batchnormgrad
|
5 years ago |
lizhenyu
|
839ec02542
|
Add FusedBatchEx support
|
5 years ago |
mindspore-ci-bot
|
a3dae6344b
|
!4758 GPU kernel support NHWC
Merge pull request !4758 from VectorSL/nhwc-support
|
5 years ago |
mindspore-ci-bot
|
b9bde4c826
|
!4559 codex warning
Merge pull request !4559 from chenweifeng/warning
|
5 years ago |
mindspore-ci-bot
|
4ec4205e87
|
!4221 gpu add format transform pass
Merge pull request !4221 from limingqi107/master
|
5 years ago |
mindspore-ci-bot
|
b0b590ef9e
|
!4760 fix gpu loss grad with reduction
Merge pull request !4760 from baihuawei/loss
|
5 years ago |
VectorSL
|
e939d61a2c
|
conv pooling pad support NHWC
|
5 years ago |
wilfChen
|
061bbf1f87
|
code warning
|
5 years ago |
limingqi107
|
5b76e8f3d7
|
gpu add format transform pass
|
5 years ago |
lizhenyu
|
19d50fea3e
|
add FusedBatchNormGradEx gpu kernel
|
5 years ago |
Peilin Wang
|
0d5220d33c
|
modified documentation and gpu kernel for smoothL1Loss
fix pylint
changed doc and code for SmoothL1Loss to be same a dchip. fixed grad kernel
fix ci
|
5 years ago |
Jonathan Yan
|
72d2597cb7
|
Add 2 reserve output for BatchNormGrad v1
|
5 years ago |
lizhenyu
|
7ddddc41a9
|
add FusedBatchNoramEx gpu kernel
|
5 years ago |
baihuawei
|
04f4be4818
|
fix gpu loss grad
|
5 years ago |
baihuawei
|
772e14d00d
|
add categorical
|
5 years ago |
mindspore-ci-bot
|
2b4febb430
|
!4436 Refactor uniform ops in GPU context
Merge pull request !4436 from peixu_ren/custom_gpu
|
5 years ago |
mindspore-ci-bot
|
505b6b5a9b
|
!4699 NMS_GPU_Sorting_Mem Fix
Merge pull request !4699 from danishnxt/GPU_two
|
5 years ago |
linqingke
|
642d48e9dd
|
fix floordiv and iou ops.
|
5 years ago |
peixu_ren
|
5dd4933328
|
Refactor uniform ops in GPU context
|
5 years ago |
Peilin Wang
|
6719169a7f
|
added type support for atomic add and scatternd
fix ci
fix ci
|
5 years ago |
danish
|
97f08e74ec
|
nms_sorting fix
lint py fix 2
nms_py_file test value fix
lint fix
|
5 years ago |
qujianwei
|
c21ffc0317
|
fix gpu matmul fp32 accuracy
|
5 years ago |
mindspore-ci-bot
|
3fb58fcbe4
|
!4585 add gpu nccl broadcast
Merge pull request !4585 from baihuawei/broadcast
|
5 years ago |
baihuawei
|
b9ebd9c280
|
add gpu nccl broadcast
|
5 years ago |
lizhenyu
|
d667d6ee92
|
bugfix:SigmoidCrossEntropyWithLogitsGrad need multiply dout
|
5 years ago |
mindspore-ci-bot
|
ca7ec00c71
|
!4479 Fix gpu registration macro to work with type unsigned char
Merge pull request !4479 from Peilin/gpu-reg-macro-fix
|
5 years ago |
Jonathan Yan
|
19d7488195
|
ROIAlignGrad kernel output size is wrong
|
5 years ago |
Peilin Wang
|
7baf5352ca
|
allow unsigned char types during registration, convert char to unsigned char
fixed some bugs
fixed cpplint
fix cpplint
|
5 years ago |
mindspore-ci-bot
|
c7b50bcdd2
|
!4251 adding type support for gpu kernels for EfficientNet
Merge pull request !4251 from Peilin/efficientnet
|
5 years ago |
mindspore-ci-bot
|
a23dd7147a
|
!4246 add type support for gpu kernelsl for faster-rcnn
Merge pull request !4246 from Peilin/faster-rcnn-type-support
|
5 years ago |
mindspore-ci-bot
|
01962afd23
|
!4024 Support half data type in ROIAlign/ROIAlignGrad Kernel
Merge pull request !4024 from JonathanY/roihalf
|
5 years ago |
mindspore-ci-bot
|
c041f4a295
|
!4368 add fix to GPU-RandomChoiceWithMask
Merge pull request !4368 from 34bunny/GPU-RandomChoiceWithMask-fix
|
5 years ago |
Peilin Wang
|
571094f473
|
added type support for transpose and maxgrad
fix pylint
addressed code review comment
|
5 years ago |
Peilin Wang
|
3cb3a5c7d8
|
type support for faster rcnn gpu kernels
addressed code review comments
fix cpplint and pylint
trying to fix python ut
fix smoke test
|
5 years ago |
TFbunny
|
17d01e838f
|
add fix to GPU-RandomChoiceWithMask (bitonicsort & testcase)
|
5 years ago |
mindspore-ci-bot
|
1856fb6af1
|
!3800 add gpu multinomial backend
Merge pull request !3800 from baihuawei/multinomial-c
|
5 years ago |