wenbean
|
13409f519f
|
Unify GPU/CPU ops input/output(col/rolmajor), modify related testcases, add linalg function and testcases
|
4 years ago |
sunsuodong
|
bf208e243e
|
add fp16 kernel
|
4 years ago |
lby
|
63bc0ae7ff
|
算子白名单
|
4 years ago |
i-robot
|
b5c02a4ee0
|
!26426 gpu environment kernel
Merge pull request !26426 from chenweifeng/gpu-environment-kernel
|
4 years ago |
i-robot
|
45947b51ba
|
!26312 add grad support dynamic shape
Merge pull request !26312 from wangnan39/add_grad_support_dynamic_shape
|
4 years ago |
i-robot
|
d837e2c969
|
!26429 add cpu op fp64 register to fix bugs of gmres for backend cpu
Merge pull request !26429 from zhuzhongrui/gmres
|
4 years ago |
jin-xiulang
|
f972de90d0
|
Add exchangeKeys ops and getKeys ops for STABLE_PW_ENCRYPT
fix review syggestions
|
4 years ago |
i-robot
|
f72820404c
|
!26282 Add GPU eigenvalue/eigenvector ops for real and complex
Merge pull request !26282 from wuwenbing/master
|
4 years ago |
i-robot
|
8805939be4
|
!26444 optimize matmul broadcast
Merge pull request !26444 from wangyanling/optimizematmul
|
4 years ago |
wilfChen
|
68260a6a94
|
gpu environment kernel implement
|
4 years ago |
王南
|
f082f4ccec
|
add grad support dynamic
|
4 years ago |
z00512249
|
fe9442761c
|
add cpu op fp64 register to fix bugs of gmres for backend cpu
|
4 years ago |
i-robot
|
9d6248194e
|
!26310 MindSpore support load custom aicpu kernels.
Merge pull request !26310 from linqingke/aicpu
|
4 years ago |
wangyanling
|
ac30c5f317
|
optimize matmul broadcast
|
4 years ago |
xiefangqi
|
24ac2382ff
|
mac compile
|
4 years ago |
i-robot
|
5233c73805
|
!25592 Reshape support shape is variable
Merge pull request !25592 from wangnan39/reshape_support_tensor
|
4 years ago |
i-robot
|
ec4cd6933d
|
!26292 Add GPU operator NeighborExchange
Merge pull request !26292 from Cononlly/master
|
4 years ago |
i-robot
|
c7805f0fc7
|
!26294 [MS][LITE]Fix fuzz
Merge pull request !26294 from gongdaguo/master_fuzz
|
4 years ago |
wenbean
|
9b305b231d
|
Add GPU eigenvalues/eigenvector for symmetric mtrix(real and complex)
add cuda ops real to complex(imag is 0j)
cpu/gpu eigh ops alreas returns complex eigenvalues for complex input
retuns real eigenvalues for real input
|
4 years ago |
王南
|
1163cfe967
|
reshape support shape is tensor
|
4 years ago |
i-robot
|
dafa363bde
|
!26326 matmul int8 sdot
Merge pull request !26326 from zhaozhenlong/lite/issue/matmul-int8
|
4 years ago |
zhaozhenlong
|
34b46257b8
|
matmul int8 sdot
|
4 years ago |
cononlly
|
8035bafd7c
|
Add NeighborExchange support for GPU
|
4 years ago |
linqingke
|
bef2923acf
|
MindSpore support load custom aicpu ops.
|
4 years ago |
z00512249
|
21699e6855
|
fix bugs of gmres for backend cpu
|
4 years ago |
i-robot
|
c87712f775
|
!26287 Decouple AkgKernelJsonGenerator from backend/kernel_compiler.
Merge pull request !26287 from DeshiChen/1109_genjson
|
4 years ago |
i-robot
|
ed935d14f8
|
!26367 [MS][LITE][develop] add fp16 kernel
Merge pull request !26367 from sunsuodong/add_fp16_kernel
|
4 years ago |
i-robot
|
ca06c555f9
|
!26291 clean code for pclint
Merge pull request !26291 from liangxhao/master_cleancode
|
4 years ago |
gongdaguo
|
51d4299bf3
|
fix fuzz
|
4 years ago |
sunsuodong
|
a7f84e86d0
|
add fp16 kernel
|
4 years ago |
i-robot
|
50bbdfef17
|
!26319 [lite]fix codedex
Merge pull request !26319 from 徐安越/master
|
4 years ago |
i-robot
|
dc5278516c
|
!26130 add cholesky, cho_factor primitive and backend gpu implements
Merge pull request !26130 from zhuzhongrui/gmres
|
4 years ago |
i-robot
|
d1922b3556
|
!26309 [MS][LITE]Fix Int8 Clip
Merge pull request !26309 from gongdaguo/add_in8_2
|
4 years ago |
i-robot
|
73bb099deb
|
!26056 easy using for kernel select log print
Merge pull request !26056 from liubuyu/master
|
4 years ago |
z00512249
|
36032e7ee2
|
add cholesky, cho_factor primitive and backend gpu implements
|
4 years ago |
xuanyue
|
6b6d2bd196
|
fix codedex
|
4 years ago |
gongdaguo
|
a2afb88f0d
|
add int8 clip
|
4 years ago |
i-robot
|
a14a777464
|
!26050 [MS][LITE]Add int8 clip
Merge pull request !26050 from gongdaguo/add_int8_clip
|
4 years ago |
lianghao23
|
775007d755
|
clean code for pclint
|
4 years ago |
i-robot
|
ba3ee7d8c1
|
!26246 [lite]fix fuzz bug and element size overflow
Merge pull request !26246 from 徐安越/master
|
4 years ago |
i-robot
|
4dc5beb839
|
!26266 Fix LessEqual
Merge pull request !26266 from zuochuanyong/fix_less_than_gpu
|
4 years ago |
dayschan
|
7cc4e170cc
|
decouple akg_kernel_json_generator
|
4 years ago |
lby
|
ab0efba450
|
easy using for kernel select log print
|
4 years ago |
xuanyue
|
3cecadc8d9
|
fix fuzz bug and element size overflow
|
4 years ago |
i-robot
|
c8e303f9a7
|
!26253 [MS][LITE][develop] fix vs build and run bug
Merge pull request !26253 from sunsuodong/fix_vs_build
|
4 years ago |
zuochuanyong
|
1f289803e9
|
fix LessEqual bug
|
4 years ago |
sunsuodong
|
83875cf9b8
|
fix vs build
|
4 years ago |
sunsuodong
|
3d1066b6a0
|
add fp16 resize kernel
|
4 years ago |
zhaodezan
|
88b73e340d
|
sync commercial pclint clean
|
4 years ago |
i-robot
|
9a0be9152c
|
!26174 [MS][LITE] add tf unpack parser and allow ConstantOfShape to output tensor without elements
Merge pull request !26174 from XianglongZeng/myms_new
|
4 years ago |