mindspore-ci-bot
e6ebb310eb
!8928 [MS][GPU][CUDA] - New GPU kernel -> LinSpace
From: @danishnxt
Reviewed-by:
Signed-off-by:
5 years ago
danishnxt
a17f76dd1d
Initial Commit - GPU LinSpace
comment fix
docString fix
added asserts in test file atop np checks
lint
lint-2
lint3
5 years ago
zengzitao
3ef0e9f053
substitute dropout by cudnnuniformreal and dropout
5 years ago
jonwe
9a6ced3cc7
DinNoNan gpu kernel supports int8/uint8
5 years ago
mindspore-ci-bot
71af3bf1ac
!8283 Minimum Op and Mul Op support dynamic shape
From: @jonwe
Reviewed-by:
Signed-off-by:
5 years ago
Jonathan Yan
5a8238a09c
mul v1
5 years ago
mindspore-ci-bot
f2652858ed
!8456 gpu kernel support int64
From: @wilfchen
Reviewed-by: @cristoval,@limingqi107
Signed-off-by: @cristoval
5 years ago
wilfChen
07d8622c7e
modify dynamic shape condition to fit ME
5 years ago
danishnxt
9dba65c424
TensorDot Conv: P -> C
5 years ago
wilfChen
e4e9362bd0
gpu suppor dynamic shape
5 years ago
Yi Huaijie
d7faa77b5e
support int64 shape
5 years ago
peixu_ren
9a81b50e43
Refactor Cholesky to CholeskyTrsm
5 years ago
peixu_ren
b8abcf858a
Add expm1 op at GPU back-end
5 years ago
mindspore-ci-bot
9c2704030f
!7872 Add log1p operator at GPU back-end and move erf and erf to the unary_op list
Merge pull request !7872 from peixu_ren/custom_pp_ops
5 years ago
mindspore-ci-bot
5ef3d02ce3
!7696 Add calculation of triangle matrix determinant op at GPU back-end
Merge pull request !7696 from peixu_ren/custom_gpu
5 years ago
mindspore-ci-bot
4ebc55b29e
!7841 Add supports to ACosGrad and AsinGrad on gpu
Merge pull request !7841 from zhouyuanshen/ACosGrad_and_AsinGrad
5 years ago
mindspore-ci-bot
f3e8798b40
!7879 fix bug that asin/acos not support fp16 on gpu
Merge pull request !7879 from zhouyuanshen/asin_acos_fp16
5 years ago
zhouyuanshen
1804c042ec
fix bug that asin/acos not support fp16 on gpu
5 years ago
zhouyuanshen
8481fd59d8
Add supports to ACosGrad and AsinGrad
5 years ago
peixu_ren
dfe5a951eb
Add log1p operator at GPU back-end and move erf and erf to the unary_op list
5 years ago
peixu_ren
99f2927c21
Add calculation of triangle matrix determinant op at GPU back-end
5 years ago
mindspore-ci-bot
5b28016b4d
!7735 fix gpu multinomial seed issue and GRAPH_MODE
Merge pull request !7735 from baihuawei/fixmultinomial
5 years ago
baihuawei
05dcec0583
fix multinomial
5 years ago
danishnxt
06a9b4aa37
Commit1_TensorDot
linting errors fix
comments cleanup
typo fix in doc
added more tests, fixed some formatting
changed expected dx values for linter
pylint const data fix
5 years ago
zhouyuanshen
f49bd92b88
support asin and acos with dtype float on gpu
5 years ago
mindspore-ci-bot
b619af917f
!7166 gpu no broad cast kernel dims exceed
Merge pull request !7166 from chenweifeng/broadcast-grad-dims-exceed
5 years ago
mindspore-ci-bot
483f1aca9d
!7106 GPU change kernel shape to size_t
Merge pull request !7106 from VectorSL/gpu-size_t
5 years ago
wilfChen
3eae63e4e9
gpu no broadcast kernel dim exceed
5 years ago
VectorSL
f6d5508942
addn use intermediate results
5 years ago
VectorSL
447a45dbe7
change gpu kernel shape to size_t
5 years ago
VectorSL
f36c2721af
gpu add combine cast fusion
5 years ago
VectorSL
ad0a69a60e
sqrt rsqrt add fp16
5 years ago
mindspore-ci-bot
d60033c8db
!6381 Add dtype float16 that erf and erfc should support
Merge pull request !6381 from peixu_ren/custom_gpu
5 years ago
mindspore-ci-bot
f6f7815fa2
!6440 add sin cos gpu-op
Merge pull request !6440 from wukesong/gpu-sin-cos
5 years ago
wukesong
f9a865fd42
add GPU operator
5 years ago
mindspore-ci-bot
076d8ae530
!6458 GPU codex fix
Merge pull request !6458 from VectorSL/codex-fix
5 years ago
mindspore-ci-bot
1dc71651ae
!6437 clear warning
Merge pull request !6437 from baihuawei/clear-warnings
5 years ago
VectorSL
9e6bd72e04
fix codex
5 years ago
baihuawei
e0c063704c
clear warnings
5 years ago
peixu_ren
8132e56417
Add dtype float16 that erf and erfc should support
5 years ago
mindspore-ci-bot
3f0ac45954
!6206 new add gpu ops sqrt_grad and rsqrt_grad.
Merge pull request !6206 from linqingke/new_ops
5 years ago
linqingke
dda3176fca
new add sqrt_grad and rsqrt_grad.
5 years ago
mindspore-ci-bot
c9fa006b92
!6308 [MS][GPU][CUDA] NMS_Pass Kernel performance improvement
Merge pull request !6308 from danishnxt/GPU_three
5 years ago
peixu_ren
fdd2d8209f
Support erf and erfc ant GPU backend
5 years ago
Danish Farid
8c7cc7943d
NMS perf boost
fix:
revert the hash to orig
comment fix
remove area array
changed first workspace_size variable initialization
fixed variable name cases
5 years ago
Peilin Wang
f020e19636
add int32 support to greater gpu kernel
fix ci
5 years ago
mindspore-ci-bot
a3d0ddb4db
!5779 tenoradd profiling
Merge pull request !5779 from chenweifeng/broadcast-refactor
5 years ago
wilfChen
6ebe132cd3
broadcast refactor
5 years ago
mindspore-ci-bot
b9345d1d34
!5775 fix categorical in GraphMode
Merge pull request !5775 from baihuawei/0902
5 years ago
baihuawei
92f1855a79
fix categorical in GraphMode
5 years ago