caifubi
|
5b963aef2b
|
Change uintptr_t to void ptr
|
5 years ago |
mindspore-ci-bot
|
47275427da
|
!1210 Add exception check for BiasAdd kernel
Merge pull request !1210 from chenweifeng/cudnn_exception
|
5 years ago |
wilfChen
|
b330766a0f
|
cuda exception check
|
5 years ago |
wilfChen
|
83151509dc
|
UnsortedSegmentSum kernel support Nd
|
5 years ago |
wilfChen
|
1991a89f40
|
LayerNormGrad fix & codex
|
5 years ago |
VectorSL
|
9996e0d4d2
|
gpu update shape infer
|
5 years ago |
mindspore-ci-bot
|
8f40f36c6c
|
!924 gpu queue support multi-inputs
Merge pull request !924 from chenweifeng/dataset
|
5 years ago |
wilfChen
|
59c4cf256c
|
gpu support broadcast kernels
|
5 years ago |
wilfChen
|
ccf6dabe13
|
gpu queue support multi-inputs
|
5 years ago |
mindspore-ci-bot
|
680ce090a3
|
!1057 matmul support fp16
Merge pull request !1057 from chenweifeng/matmul
|
5 years ago |
mindspore-ci-bot
|
0edc6d254a
|
!370 Gpu Support UnsortedSegmentSum kernel
Merge pull request !370 from chenweifeng/unsorted_segment_sum
|
5 years ago |
mindspore-ci-bot
|
907b609b05
|
!994 gpu broadcast kernel support different dims
Merge pull request !994 from chenweifeng/broadcast_unequal_dims
|
5 years ago |
mindspore-ci-bot
|
b5096e1f6c
|
!1021 gpu support MinimumGrad & MaximumGrad kernel
Merge pull request !1021 from chenweifeng/broadcast_grad
|
5 years ago |
mindspore-ci-bot
|
da7054645a
|
!948 gpu support LogSoftmax & LogSoftmaxGrad kernel
Merge pull request !948 from chenweifeng/logsoftmax
|
5 years ago |
wilfChen
|
b56572bb89
|
matmul support fp16
|
5 years ago |
mindspore-ci-bot
|
96e2f9cbbe
|
!1032 quantization aware training bug fix.
Merge pull request !1032 from SanjayChan/bug_fix
|
5 years ago |
limingqi107
|
05e8d95e7f
|
optimize the gpu context switch
|
5 years ago |
chenzomi
|
97648de5e4
|
bug fix
|
5 years ago |
wilfChen
|
00e78bf6c4
|
gpu support MinimumGrad & MaximumGrad kernel
|
5 years ago |
wilfChen
|
31f3611f9a
|
gpu support UnsortedSegmentSum kernel
|
5 years ago |
wilfChen
|
0a1195ddf5
|
broadcast kernel support unqual dims & half
|
5 years ago |
ZPaC
|
d3936b9f2a
|
GPU kernels adapt with special dimensions.
|
5 years ago |
wilfChen
|
1eb60df5d4
|
gpu support logsoftmax & logsoftmaxgrad kernel
|
5 years ago |
mindspore-ci-bot
|
bda4ebd591
|
!322 Gpu Support RMSProp kernel
Merge pull request !322 from chenweifeng/rmsprop
|
5 years ago |
mindspore-ci-bot
|
f602970990
|
!323 Gpu Concat support 4 inputs
Merge pull request !323 from chenweifeng/concat
|
5 years ago |
mindspore-ci-bot
|
4e25fec769
|
!324 Gpu Slice kernel performance improve
Merge pull request !324 from chenweifeng/slice
|
5 years ago |
mindspore-ci-bot
|
378a7122a5
|
!372 Gpu support BatchMatMul kernel
Merge pull request !372 from chenweifeng/batchmatmul
|
5 years ago |
mindspore-ci-bot
|
97d21ba014
|
!502 Gpu Support Gelu & GeluGrad
Merge pull request !502 from chenweifeng/gelu
|
5 years ago |
mindspore-ci-bot
|
a97f30ba7d
|
!516 Gpu support Tanh & TanhGrad kernel
Merge pull request !516 from chenweifeng/tanh
|
5 years ago |
mindspore-ci-bot
|
38c56fd1a5
|
!945 gpu queue support Sqrt & Rsqrt kernel
Merge pull request !945 from chenweifeng/unary
|
5 years ago |
mindspore-ci-bot
|
d004ef2234
|
!962 GPU conv2dBprop update size init
Merge pull request !962 from VectorSL/conv-init-update
|
5 years ago |
mindspore-ci-bot
|
36970a3b10
|
!934 gpu support broadcast kernels
Merge pull request !934 from chenweifeng/broadcast
|
5 years ago |
wilfChen
|
a304304c30
|
gpu support Gelu & GeluGrad kernels
|
5 years ago |
wilfChen
|
311bf41e6d
|
gpu support tanh & tanhgrad kernel
|
5 years ago |
wilfChen
|
67a0cc3bf1
|
gpu queue support unary
|
5 years ago |
VectorSL
|
19972fd347
|
update conv2d_bprop init
|
5 years ago |
mindspore-ci-bot
|
e03359cc32
|
!896 GPU tensoradd add shape checking
Merge pull request !896 from VectorSL/update-tensoradd
|
5 years ago |
VectorSL
|
944c9ec933
|
gpu tensoradd add shape validation check
|
5 years ago |
wilfChen
|
16f0688230
|
gpu support broadcast kernels
|
5 years ago |
wilfChen
|
a266fd03b0
|
fix codex warning
|
5 years ago |
changzherui
|
b323199dc1
|
syn code 430
|
5 years ago |
mindspore-ci-bot
|
7eaed2463f
|
!786 GPU update testcase for amp
Merge pull request !786 from VectorSL/gpu-add-test-amp
|
5 years ago |
VectorSL
|
1d7fe758a0
|
gpu add test for amp
|
5 years ago |
mindspore-ci-bot
|
8c035a5171
|
!756 Gpu support LayerNorm kernel
Merge pull request !756 from chenweifeng/layer_norm
|
5 years ago |
VectorSL
|
25af911ed9
|
gpu update bn
|
5 years ago |
wilfChen
|
53b4529558
|
Gpu support LayerNorm kernel
|
5 years ago |
VectorSL
|
1a6f62bd25
|
gpu update type check
|
5 years ago |
mindspore-ci-bot
|
001912237e
|
!677 GPU fix codex for conv2d
Merge pull request !677 from VectorSL/fix-codex-for-gpu-conv2d
|
5 years ago |
VectorSL
|
fe9008f73c
|
fix codex for gpu conv2d
|
5 years ago |
VectorSL
|
ee7a64018c
|
gpu update conv kernel for auto-mixed-precision
|
5 years ago |