63 Commits (bada826b18f3e4dc8aac0f82af3457794de9c7bb)

Author SHA1 Message Date
  mindspore-ci-bot b619af917f !7166 gpu no broad cast kernel dims exceed 5 years ago
  mindspore-ci-bot 483f1aca9d !7106 GPU change kernel shape to size_t 5 years ago
  wilfChen 3eae63e4e9 gpu no broadcast kernel dim exceed 5 years ago
  VectorSL f6d5508942 addn use intermediate results 5 years ago
  VectorSL 447a45dbe7 change gpu kernel shape to size_t 5 years ago
  VectorSL f36c2721af gpu add combine cast fusion 5 years ago
  VectorSL ad0a69a60e sqrt rsqrt add fp16 5 years ago
  mindspore-ci-bot d60033c8db !6381 Add dtype float16 that erf and erfc should support 5 years ago
  mindspore-ci-bot f6f7815fa2 !6440 add sin cos gpu-op 5 years ago
  wukesong f9a865fd42 add GPU operator 5 years ago
  mindspore-ci-bot 076d8ae530 !6458 GPU codex fix 5 years ago
  mindspore-ci-bot 1dc71651ae !6437 clear warning 5 years ago
  VectorSL 9e6bd72e04 fix codex 5 years ago
  baihuawei e0c063704c clear warnings 5 years ago
  peixu_ren 8132e56417 Add dtype float16 that erf and erfc should support 5 years ago
  mindspore-ci-bot 3f0ac45954 !6206 new add gpu ops sqrt_grad and rsqrt_grad. 5 years ago
  linqingke dda3176fca new add sqrt_grad and rsqrt_grad. 5 years ago
  mindspore-ci-bot c9fa006b92 !6308 [MS][GPU][CUDA] NMS_Pass Kernel performance improvement 5 years ago
  peixu_ren fdd2d8209f Support erf and erfc ant GPU backend 5 years ago
  Danish Farid 8c7cc7943d NMS perf boost 5 years ago
  Peilin Wang f020e19636 add int32 support to greater gpu kernel 5 years ago
  mindspore-ci-bot a3d0ddb4db !5779 tenoradd profiling 5 years ago
  wilfChen 6ebe132cd3 broadcast refactor 5 years ago
  mindspore-ci-bot b9345d1d34 !5775 fix categorical in GraphMode 5 years ago
  baihuawei 92f1855a79 fix categorical in GraphMode 5 years ago
  limingqi107 5058e844cd gpu inceptionv3 optimize 5 years ago
  mindspore-ci-bot 749979e7c4 !5458 NMS GPU OP Performance improvement 5 years ago
  limingqi107 109e2e9bcc modify the format info of tensorAdd 5 years ago
  danish 7d7fa760a0 reduce based nms final pass - speed improv 5 years ago
  limingqi107 ff6b64a598 gpu GoogleNet performance optimize 5 years ago
  mindspore-ci-bot 9297ba0a8d !5048 fix gpu multinomial 5 years ago
  baihuawei c085c5f071 add multinomial 5 years ago
  mindspore-ci-bot 8e360888d0 !4590 fix gpu matmul fp32 accuracy 5 years ago
  baihuawei 772e14d00d add categorical 5 years ago
  mindspore-ci-bot 2b4febb430 !4436 Refactor uniform ops in GPU context 5 years ago
  peixu_ren 5dd4933328 Refactor uniform ops in GPU context 5 years ago
  danish 97f08e74ec nms_sorting fix 5 years ago
  qujianwei c21ffc0317 fix gpu matmul fp32 accuracy 5 years ago
  Peilin Wang 571094f473 added type support for transpose and maxgrad 5 years ago
  mindspore-ci-bot 1856fb6af1 !3800 add gpu multinomial backend 5 years ago
  wilfChen 89ce0bdb78 maximumgrad 5 years ago
  baihuawei 40748a30c7 add multinomial backend 5 years ago
  ZPaC 1dcc34e785 Add GPU div kernel 5 years ago
  mindspore-ci-bot eb84ae4593 !4048 Fix broadcast, scatternd, reduce ops. 5 years ago
  mindspore-ci-bot 4554a80807 !4074 fix cumsum bug 5 years ago
  baihuawei 6053b85807 fix cumsum 5 years ago
  linqingke fb405ee6f4 broadcast, slice, scatter_nd ops optimizer. 5 years ago
  mindspore-ci-bot fea930f7aa !4088 make gpu op Less to support int32 5 years ago
  root 3b41023a6b add int32 cal for less gpu 5 years ago
  mamba_ni 96642a76fd support cusolver AND OPS cholesky_solve 5 years ago