i-robot
|
96f56b827e
|
!27203 while inplace op for cpu backend
Merge pull request !27203 from zhuzhongrui/gmres
|
4 years ago |
i-robot
|
e3f4893ab1
|
!27274 refine log of kernel select
Merge pull request !27274 from zyli2020/code_refactor
|
4 years ago |
z00512249
|
a1b8c2f065
|
while inplace op for cpu backend
|
4 years ago |
i-robot
|
61dd69a824
|
!27262 optimize error about BroadcastOpGrad
Merge pull request !27262 from chentangyu/code_cty_master_I4KOX8
|
4 years ago |
lizhenyu
|
b307016fd1
|
refine log of kernel select
|
4 years ago |
i-robot
|
32525fc643
|
!27219 rl discounted return
Merge pull request !27219 from chenweifeng/rl-discounted-return
|
4 years ago |
tacyi139
|
7413775490
|
optimize error about BroadcastOpGrad
|
4 years ago |
i-robot
|
017a2231cd
|
!27216 fix tensorarray
Merge pull request !27216 from VectorSL/fix-tensorarray
|
4 years ago |
wilfChen
|
4997877583
|
rl discounted return
|
4 years ago |
i-robot
|
f5215745f4
|
!27121 Apply Autodiff in Custom Op
Merge pull request !27121 from jiaoy1224/arithmetic
|
4 years ago |
VectorSL
|
9fe3c65da5
|
modify tensorarray
|
4 years ago |
i-robot
|
8ba5109640
|
!27074 Adapt nccl gpu kernel for compatibility.
Merge pull request !27074 from ZPaC/adapt-nccl-gpu-kernel
|
4 years ago |
i-robot
|
520fe19b27
|
!27064 Reimply tensorarray
Merge pull request !27064 from VectorSL/reimpy-tensorarray
|
4 years ago |
ZPaC
|
e1557c2ed0
|
Adapt nccl gpu kernel for compatibility.
|
4 years ago |
Yang Jiao
|
5acce31c72
|
apply autodiff in custom
|
4 years ago |
i-robot
|
22bf786fba
|
!27099 gpu multinomial kernel profiling
Merge pull request !27099 from chenweifeng/multinomial-profiling
|
4 years ago |
i-robot
|
a5e4fd4c71
|
!26945 tag environment bugfix
Merge pull request !26945 from chenweifeng/tag_env_bugfix
|
4 years ago |
wilfChen
|
cf63527a15
|
multinomial profiling
|
4 years ago |
VectorSL
|
20b38e880b
|
update tensor-array
|
4 years ago |
VectorSL
|
cb3d25c8f0
|
add cpu tensor array
|
4 years ago |
yuximiao
|
e99c0a48e6
|
support start profiler in the minddle of training.
|
4 years ago |
i-robot
|
b38600c11a
|
!26895 optimizes the kernel error description of Split, Meshgrid, Select, etc.
Merge pull request !26895 from wangshuide/wsd_master
|
4 years ago |
i-robot
|
bfd190482f
|
!26842 Speed up random normal sampling
Merge pull request !26842 from zichun_ye/random_normal_speed_up
|
4 years ago |
wilfChen
|
2113cb3c43
|
tag_env_bugfix
|
4 years ago |
tacyi139
|
bb935faca9
|
optimizes the kernel error description of Split, Meshgrid, Select, etc.
|
4 years ago |
i-robot
|
c1798df274
|
!26837 tag environment bugfix
Merge pull request !26837 from chenweifeng/tag-environment-bug-fix
|
4 years ago |
i-robot
|
b1deeb425d
|
!26849 bind stream with handle
Merge pull request !26849 from zhujingxuan/master
|
4 years ago |
zhujingxuan
|
30c6fa7f9b
|
bind stream with handle
|
4 years ago |
wangshuide2020
|
6cbe8dd02e
|
optimizes the kernel error description of LSTM, Pad, ReLU, etc.
|
4 years ago |
Zichun Ye
|
8398c07d68
|
update random normal op impl to speed up sampling
|
4 years ago |
i-robot
|
d66f811022
|
!26751 optimizes the kernel error description of Adagrad, Adam, Conv2d, etc.
Merge pull request !26751 from wangshuide/wsd_master
|
4 years ago |
wilfChen
|
ca8aba5c29
|
tag env bugfix
|
4 years ago |
wangshuide2020
|
674e3aa9d6
|
optimizes the kernel error description of Adagrad, Adam, Conv2d, etc.
|
4 years ago |
i-robot
|
e96f33e887
|
!25943 TensorArray
Merge pull request !25943 from VectorSL/tensor-array-v2
|
4 years ago |
i-robot
|
953acc0335
|
!26672 Use GPU mem Allocator and workspace instead of self allocator
Merge pull request !26672 from wuwenbing/master
|
4 years ago |
VectorSL
|
710289a72d
|
add tensor array
|
4 years ago |
wenbean
|
31053edbe4
|
Use Allocator and workspace pre allocat mem in GPU
|
4 years ago |
i-robot
|
30d182ac18
|
!26626 fix reduce ops axis multiple bug in GPU
Merge pull request !26626 from zhangbuxue/fix_reduce_ops_axis_multiple_bug_in_GPU
|
4 years ago |
buxue
|
89a688f3be
|
fix reduce ops axis multiple bug in GPU
|
4 years ago |
hezhenhao1
|
accc6368aa
|
Add support float64 as input type for ReduceProd GPU op.
|
4 years ago |
i-robot
|
a04fdd04c9
|
!26605 Fix gpu mem leak bug, add cuda memmalloc result check
Merge pull request !26605 from wuwenbing/master
|
4 years ago |
i-robot
|
0d11abc7c2
|
!26518 tag environment implement
Merge pull request !26518 from chenweifeng/tag-env-implement
|
4 years ago |
i-robot
|
1a7a04e4c9
|
!25132 [PyNative][MindRT][GPU] Op Lazy Build
Merge pull request !25132 from caifubi/master-pynative-mindrt-gpu-async-build
|
4 years ago |
caifubi
|
38352c1ba8
|
PyNative MindRT Op Lazy Build
|
4 years ago |
wilfChen
|
79b37042aa
|
tag environment implement
|
4 years ago |
wenbean
|
26d4bf6350
|
Fix meme leak bug, add result expect
|
4 years ago |
wenbean
|
13409f519f
|
Unify GPU/CPU ops input/output(col/rolmajor), modify related testcases, add linalg function and testcases
|
4 years ago |
i-robot
|
b5c02a4ee0
|
!26426 gpu environment kernel
Merge pull request !26426 from chenweifeng/gpu-environment-kernel
|
4 years ago |
i-robot
|
f72820404c
|
!26282 Add GPU eigenvalue/eigenvector ops for real and complex
Merge pull request !26282 from wuwenbing/master
|
4 years ago |
wilfChen
|
68260a6a94
|
gpu environment kernel implement
|
4 years ago |