zengzitao
62458b5636
adapt graph kernel for cpu
4 years ago
looop5
120930e1ce
modify indent of comments
4 years ago
tronzhang
e2a0c0d613
support parallel for ascend
4 years ago
hanhuifeng2020
cf0376a6a3
[GraphKernel][Ascend]fix a bug about ToNz
4 years ago
Samuel Batissou
b5757a42d2
Corrected typos
4 years ago
ckey_Dou
d91ff90d96
fix cleancode error
4 years ago
dayschan
32ecd8ee79
GraphKernel supports CPU
only Linux system is supported now.
change the default value of `ENABLE_AKG` to off, and controlled by option `-K`.
the `ENABLE_AKG` is auto enabled when `ENABLE_GPU` or `ENABLE_D` is on.
since now, we can use `ENABLE_AKG` to control the compilation of graphkernel
and akg codes.
fix usage description for option "-K", it should be "[-K on|off]".
LLVM is required by akg for cpu kernels, so AKG for cpu is default disabled now.
4 years ago
Yang Jiao
07a0c24126
update akg/switch off h-fuse
4 years ago
Gaoxiong
b1d36a974b
graph cycle bugfix of horizontal fusion
4 years ago
Yang Jiao
32d3ce532b
cluster standardnormal
4 years ago
r1chardf1d0
92cfa53903
[GraphKernel] fix infer shape
4 years ago
r1chardf1d0
c393deffcb
[GraphKernel] ascend support stitch fusion
4 years ago
i-robot
214556ff0c
!22647 Support horizontal fusion
Merge pull request !22647 from Gaoxiong/master
4 years ago
Gaoxiong
0586709409
support horizontal fustion
4 years ago
zengzitao
7d6c6b17bb
fix master warnings
4 years ago
Zichun Ye
a7d89f6686
add graph kernel userdefine op support
fix code check
4 years ago
tronzhang
134ffa9efe
support tot ops
4 years ago
ms_yan
36a8886ca2
Revert "[feat] [assistant] [I3T96T] add new Dataset operator CMUARCTICDataset"
This reverts commit b077aa1cab .
Revert "[feat] [assistant] [I3T96X] add new Dataset operator LibriSpeechDataset"
This reverts commit 4e6f7dc97d .
delete pass_registry_test.cc
comment hiai_nlu_model_multi.pb related line
4 years ago
djc
b077aa1cab
[feat] [assistant] [I3T96T] add new Dataset operator CMUARCTICDataset
4 years ago
djc
4e6f7dc97d
[feat] [assistant] [I3T96X] add new Dataset operator LibriSpeechDataset
4 years ago
i-robot
ff07de80b4
!21603 Fix static-check warnings
Merge pull request !21603 from DeshiChen/0809_clean
4 years ago
dayschan
9add26ad99
Add expanders in c++ code
transplant the op expander code from python to c++, base on LiteGraph.
the c++ expander will be called in priority if it was registered in OpExpanderFactory.
add two examples, BiasAdd and ExpandDims.
remove BiasAdd from python expanders.
since the ExpandDims is also imported by other ops (e.g. BatchNorm), we don't remove it now.
4 years ago
dayschan
fff39e7451
fix static check problems
4 years ago
i-robot
a5388a5227
!21621 fix recompute_fuse
Merge pull request !21621 from lingyunli63/fix_recompute_fuse
4 years ago
zengzitao
b8095efcab
gk fix warnings in master
4 years ago
lingyunli63
5ea00fbabc
fix find_cheap_ops
4 years ago
chenlei_autodiff
0271535429
[GraphKernel] fix bert and add graph kernel ops.
4 years ago
looop5
2fcf970f69
deal with workspace
deal with workspace on cuda
4 years ago
Zichun Ye
22172f18bc
update graph kernel support for argmax/argmin
fix pylint problem
fix conflict
fix op list
fix check warning
fix code based on review comments
update akg commit
fix check warning
4 years ago
tronzhang
94dda0c7c7
speed up bert profermance in ascend for graph kernel
4 years ago
i-robot
ab78bcf16f
!19081 add ops IsNan
Merge pull request !19081 from lingyunli63/add_ops_IsNan
4 years ago
Yang Jiao
582669e41c
add expander checkings
4 years ago
lingyunli63
50a66ae476
isnan isfinite isinf squaresumall identity oneslik
4 years ago
i-robot
e7b8cf2961
!20508 consider case that graph has no tensor input
Merge pull request !20508 from lingyunli63/fix_no_tensor_input
4 years ago
yanglf1121
c30b1e6d06
add graph kernel div, floordiv, mod, floormod, floor
4 years ago
lingyunli63
32571f5a0c
consider no tensor input in graph
4 years ago
wangrao124
7cddde47b0
!215 add ops: LogicalNot, LogicalAnd, LogicalOr, NotEqual, EqualCount, Asinh, Acosh
* add ops: LogicalNot, LogicalAnd, LogicalOr, NotEqual, EqualCount, Asinh, Acosh
4 years ago
huangmengxi
1bd448ef2a
add atan, atan2, expm1
4 years ago
chenlei_autodiff
7d55cef106
[GraphKernel] add sponge ops.
4 years ago
zengzitao
aa019a639f
support complex in gpu about graph_kernel
4 years ago
i-robot
92675f7572
!18747 Add expander for op AddN
Merge pull request !18747 from DeshiChen/0618_expand_addn
4 years ago
dayschan
149dab39c5
Add expander for AddN; update akg submodule
4 years ago
lishanni513
4a64bc6bc5
Enable TensorCore for Bert-Base on GPU, enable akg-conv2d for ssd on GPU
4 years ago
zhengzuohe
448d1cdbb6
Update submodule mindspore/akg
Enable Graph Kernel for W&D Host-Device Mode and ResNext50 on GPU
Enable akg-conv2d in yolov3_darknet on GPU
4 years ago
lingyunli63
3679e99fe8
fix recompute_fuse with reshape user
4 years ago
i-robot
5d5f38c501
!18428 [GraphKernel] Enable Auto-Mixed-Precision & TensorCore for AlexNet on GPU
Merge pull request !18428 from lishanni/master_alexnet
4 years ago
lishanni513
bddfd9ac65
Enable Auto-Mixed-Precision & TensorCore for AlexNet on GPU
4 years ago
lingyunli63
a995bea507
recompute_fuse
4 years ago
i-robot
04f42b9901
!18629 stop mutmul to mul on GPU
Merge pull request !18629 from lingyunli63/stop_matmul_to_mul_GPU
4 years ago
lingyunli63
e810ebb3d0
matmul to mul only work on aicore
4 years ago