mindspore-ci-bot
0d5e119fa4
!7483 Add GPU-UniformSampler and nn.UniformCandidateSampler
Merge pull request !7483 from 34bunny/GPU-ucs
5 years ago
mindspore-ci-bot
2c468a358a
!7537 new gpu op for cbg: repeat_elements
Merge pull request !7537 from Peilin/cbg-new-gpu-ops-repeat_elements
5 years ago
lianliguang
8dfc7485c6
update mindspore/ccsrc/backend/kernel_compiler/tbe/tbe_kernel_select/tbe_kernel_select.cc.
fix bug of nz filter
5 years ago
mindspore-ci-bot
93c03ca4df
!7565 Decouple GraphKernel ops from akg op register
Merge pull request !7565 from DeshiChen/1012_gen_json_from_anf
5 years ago
Peilin Wang
bd0b462691
new gpu op for cbg repeat_elements
fixed ci
fixed ci
addressed comments
5 years ago
TFbunny
e18a78feb9
add GPU UniformSampler
5 years ago
mindspore-ci-bot
6cc37db833
!7482 roialign gpu operator output is zero
Merge pull request !7482 from JonathanY/roialign_zero
5 years ago
mindspore-ci-bot
e24b50f559
!7548 Support elu and elugrad with dtype float and half on gpu
Merge pull request !7548 from zhouyuanshen/master
5 years ago
dayschan
f429fe7888
Decouple GraphKernel ops from akg op register
1. Extract op info from AnfNode when generating composite operators.
2. Ignore the op register info when selecting kernel.
3. Update akg submodule.
4. Remove akg gpu op registers that added for GraphKernel.
5. Add an interface `IsNodeInGraphKernel` into `AnfRuntimeAlgorithm`, next to the `IsGraphKernel`.
6. Sort included fils for the clang-format rule is changed.
TODO: Remove the json generator in GraphKernelExpander, it's ok to call this interface.
5 years ago
zhouyuanshen
2ab0b3d67a
support ops elu and elugrad on gpu
5 years ago
mindspore-ci-bot
fd8ebde400
!7545 GPU reshape add type
Merge pull request !7545 from VectorSL/int64
5 years ago
VectorSL
5c0c363c66
gpu reshepe add type
5 years ago
jonwe
4583bc1c57
roialign zero output
5 years ago
mindspore-ci-bot
2744bad8b9
!7440 add CSE for GraphKernel
Merge pull request !7440 from lingyunli63/GraphKernelCSE
5 years ago
lingyunli63
a500a57c72
add GraphkernelCSE
5 years ago
mindspore-ci-bot
9638055a76
!7488 Support asin and acos with dtype float on gpu
Merge pull request !7488 from zhouyuanshen/master
5 years ago
zhouyuanshen
f49bd92b88
support asin and acos with dtype float on gpu
5 years ago
mindspore-ci-bot
6dd56e8d8b
!7470 fix bug of filter kernel info
Merge pull request !7470 from lianliguang/master
5 years ago
yanghaitao
9770be4bab
gpu profiling
5 years ago
William Lian
70c2920615
fix bug of filter kernel info
5 years ago
mindspore-ci-bot
4424873e7e
!7256 GPU add reduce precision
Merge pull request !7256 from VectorSL/reduceprecision
5 years ago
mindspore-ci-bot
3a088bfa09
!7132 fix GPU P.UniformInt bug
Merge pull request !7132 from 34bunny/GPU-fix_UniformInt
5 years ago
mindspore-ci-bot
237270145c
!7297 fix cpu square int32 type
Merge pull request !7297 from baihuawei/fixsquare
5 years ago
mindspore-ci-bot
6152bdaf16
!7332 cpu conv2d support tuple pad
Merge pull request !7332 from baihuawei/conv2dgrad
5 years ago
baihuawei
a534e0a320
cpu conv2d support tuple pad
5 years ago
mindspore-ci-bot
9aba77dfec
!7276 fix cpu conv2d padding
Merge pull request !7276 from baihuawei/conv2d
5 years ago
mindspore-ci-bot
d4d3d286cb
!7227 fix cpu slice with a certain scene
Merge pull request !7227 from baihuawei/fixslice
5 years ago
mindspore-ci-bot
2fb6b0cc5d
!7140 UnsortedSegmentSum null check
Merge pull request !7140 from chenweifeng/unsorted-null-checkout
5 years ago
baihuawei
91468be560
fix cpu square
5 years ago
mindspore-ci-bot
b619af917f
!7166 gpu no broad cast kernel dims exceed
Merge pull request !7166 from chenweifeng/broadcast-grad-dims-exceed
5 years ago
VectorSL
ccab6f88d5
gpu add reduce precision:int64->int32
5 years ago
baihuawei
ff5fb9d93c
fix cpu conv2d padding
5 years ago
TFbunny
e87767ea00
fix casting problem
5 years ago
mindspore-ci-bot
483f1aca9d
!7106 GPU change kernel shape to size_t
Merge pull request !7106 from VectorSL/gpu-size_t
5 years ago
mindspore-ci-bot
97087fca5b
!7143 GPU addn support same out/in
Merge pull request !7143 from VectorSL/addn-fix
5 years ago
mindspore-ci-bot
cfb131b844
!7030 add Meshgrid ops for aicpu
Merge pull request !7030 from yanzhenxiang2020/br_meshgrid
5 years ago
baihuawei
e7928b9c0e
fix scpu slice
5 years ago
mindspore-ci-bot
ff3438d9c2
!7158 [GraphKernel] Promote complex tensor as graph's input and recorrect getitem index for graph kernels fusion.
Merge pull request !7158 from TronZhang/promotion_const_for_gk
5 years ago
mindspore-ci-bot
d0e29996ec
!4262 Add thread pool
Merge pull request !4262 from huanghui/thread-pool
5 years ago
mindspore-ci-bot
b5723dcd81
!7100 fix GPU broadcast
Merge pull request !7100 from baihuawei/fixbroadcast
5 years ago
huanghui
a4b485eb51
thread pool
5 years ago
caifubi
d3b978147f
Ascend Dynamic Shape
5 years ago
tronzhang
c32bf5ac28
promote complex tensor as graph's input and recorrect getitem index for graph kernels fusion.
5 years ago
wilfChen
3eae63e4e9
gpu no broadcast kernel dim exceed
5 years ago
VectorSL
f6d5508942
addn use intermediate results
5 years ago
wilfChen
859e1dbbb3
null check
5 years ago
VectorSL
447a45dbe7
change gpu kernel shape to size_t
5 years ago
mindspore-ci-bot
8df757143a
!5874 optimize cpu kernel ScatterNdUpdate
Merge pull request !5874 from huanghui/optimize-cpu-kernel
5 years ago
baihuawei
de0b4d089f
fix cpu slice
5 years ago
huanghui
87d6b62488
optimize ScatterNdUpdate cpu kernel
5 years ago