i-robot
b49174cd40
!23096 Generalize gpu PadOp to support more than 4 dimensions
Merge pull request !23096 from Peilin/pad-bugfix
4 years ago
Peilin Wang
d7b23ca4b8
fix pad
remove 4d error testcase
fix ci
add 4d error nn back
4 years ago
i-robot
63114a3dfd
!22385 Fix bug in GPU conv3dtranspose
Merge pull request !22385 from fanrb/fix_conv3dtrans
4 years ago
simson
8a0087bceb
fix precision error of resizebilineargrad
4 years ago
fan1997
84a540e743
Fix bug in conv3dtranspose gpu
4 years ago
i-robot
f536d88570
!22821 modify resizebilineargrad input type
Merge pull request !22821 from Simson/opinfer
4 years ago
simson
f00e22342b
modify resizebilineargrad input type
4 years ago
i-robot
5e6287bec1
!22585 Add grad implementation of AdaptiveAvgPool2D
Merge pull request !22585 from zuochuanyong/adaptive_avgpool2d_grad
4 years ago
zuochuanyong
068191f222
add AdaptiveAvgPool2DGrad op
4 years ago
simson
7a2fbdda85
modify resizebilinear infer type
4 years ago
djc
b077aa1cab
[feat] [assistant] [I3T96T] add new Dataset operator CMUARCTICDataset
4 years ago
djc
4e6f7dc97d
[feat] [assistant] [I3T96X] add new Dataset operator LibriSpeechDataset
4 years ago
Peilin Wang
6a1b1495d9
initial commit: add nullptr exception in GetDeviceAddress
all cudnn functions now use the new GetPossiblyNullDeviceAddress
fix batchnorm
fix ci
fix nll loss
fix cast and concat
fix cast: skip kernel if null input and output
fix ci
fix concat: allow null input
fix concat: allow for null inputs
4 years ago
i-robot
22e9299c17
!20885 add dtypes & fft kernels for SPONGE
Merge pull request !20885 from huangmengxi/sponge_ccsrc
4 years ago
huangmengxi
e32297dc6b
add dtypes for sponge
4 years ago
danishfarid
92d9bc7ccd
fix for async mem_init bilinearResize_grad
fix - typo
4 years ago
Peilin Wang
594571fd4c
initial commit: fix 11 dts tickets
fix ci
4 years ago
zuochuanyong
1d565f9f8a
support MaxPool3DGrad on GPU
4 years ago
i-robot
d16e9bc3f6
!20369 GPU fix maxpoolgrad
Merge pull request !20369 from VectorSL/fix-maxpool
4 years ago
i-robot
fb33ba2b47
!19941 [MS][GPU] resizeBilinearGrad - Op FP16 fix
Merge pull request !19941 from danishfarid/resizeBilinearFix
4 years ago
VectorSL
a3590bca46
fix maxpool grad
4 years ago
danishfarid
aa37923aa5
first commit
typo fix
sep paths for fp32 and fp16 without fp32 copy
template dec fix
added 0 init for output for fp32 path
4 years ago
buxue
2b2efb0a75
fix prelu weight grad accuracy error fp16 on GPU
4 years ago
buxue
5bf41bfbd2
improve PReLU forward and implement backward on GPU
4 years ago
i-robot
8e043090be
!18472 Implement UNet3d on GPU
Merge pull request !18472 from likesen/master
4 years ago
likesen
99a995b432
Implement UNet3d on GPU
4 years ago
zuochuanyong
e890c2a2ae
fix accuracy error when input H is not equal to W
4 years ago
i-robot
26c7d274c9
!18441 Fix conv3d cudnn algorithm error
Merge pull request !18441 from tom_chen/conv3d
4 years ago
i-robot
6c33e0b710
!18392 fix the exception when occur error and replace magic number with const value.
Merge pull request !18392 from wangshuide/wsd_master
4 years ago
i-robot
9085be08b9
!18163 Support ConvTranspose3D on GPU
Merge pull request !18163 from likesen/master
4 years ago
tom__chen
35f6a1af56
fix conv3d cudnn algorithm error
4 years ago
Li Kesen
7d94095730
Support Conv3dTranspose for GPU
4 years ago
wangshuide2020
30690b1f27
fix the exception when occur error and replace magic number with const value.
4 years ago
buxue
d50d46013b
code security check
4 years ago
markuskunej
2fece8a7c2
added nll_loss_grad for gpu
4 years ago
i-robot
63b91904ec
!18098 Change NLL_Loss total_weight output for gpu
Merge pull request !18098 from markuskunej/nll_loss_total_weight_fix
4 years ago
i-robot
71bb69695f
!12151 Add UNet Model for GPU
Merge pull request !12151 from fanrb/unet
4 years ago
markuskunej
b2b9dfd24a
changed nll_loss total_weight output to be sum of the target weights instead of sum of weights
4 years ago
fan1997
be3d4e6fd3
1.Optimize bias add grad kernel
2.Optimize slice grad kernel
3.Add Unet GPU Model
5 years ago
i-robot
d2c3f68232
!17726 GPU resize_bilinear grad op support
Merge pull request !17726 from tom_chen/resize_bilinear
4 years ago
linqingke
c68ba5ac6a
fix dropout and matrix_inverse op bug.
4 years ago
tom__chen
4ba9ebf9c4
add resize bilinear grad gpu op
4 years ago
mindspore-ci-bot
9e2c8eed80
!17654 Implement NLL_Loss operation for GPU
From: @markuskunej
Reviewed-by:
Signed-off-by:
4 years ago
markuskunej
35e6994fb2
implemented nll_loss operation for gpu
4 years ago
tom__chen
e14554a752
fix conv3d shape indices
4 years ago
mindspore-ci-bot
2db8656048
!16320 Local Response Norm GPU Op support
From: @tom__chen
Reviewed-by: @robingrosman
Signed-off-by: @robingrosman
4 years ago
tom__chen
f280cec88d
Local Reponse Norm GPU Op
5 years ago
zhangxinfeng3
35e95b7d19
05311 sponge_ops update_thor_gradient thor_ops.py im2col
4 years ago
zhangzhaoju
2d667f9ed7
ci format problem fix.
4 years ago
mindspore-ci-bot
d738f7064c
!15861 Remove use_batch_statistics from InstanceNorm2d API
From: @tom__chen
Reviewed-by: @zhao_ting_v,@liangchenghui,@robingrosman
Signed-off-by: @liangchenghui
4 years ago