Gaoxiong
|
8745e4d3f1
|
GraphKernel performance optimize: cache kernel addr in KernelMod
|
4 years ago |
huangxinjing
|
896daee845
|
[AutoParallel]Fix insert error for the mirror
|
4 years ago |
ckey_Dou
|
e9679ca0bd
|
return default shape when max_shape is empty
|
4 years ago |
zhoufeng
|
f49b195c39
|
extract common as an independent shared library
Signed-off-by: zhoufeng <zhoufeng54@huawei.com>
|
4 years ago |
i-robot
|
dfc6cbb6df
|
!30597 fix codedex warnings
Merge pull request !30597 from zyli2020/master
|
4 years ago |
i-robot
|
6c8f942737
|
!30544 fix codecheck
Merge pull request !30544 from xulei/fix_codecheck
|
4 years ago |
i-robot
|
09f114e52e
|
!30593 correct cublas path in cuda_ops cmakelists
Merge pull request !30593 from jinjiali-kali/cuda_ops
|
4 years ago |
i-robot
|
c4843c4085
|
!29958 upgrade ascend 20220211
Merge pull request !29958 from shenwei41/upgrade_ascend_20220211
|
4 years ago |
shenwei41
|
ff75314719
|
upgrade ascend 20220211
|
4 years ago |
i-robot
|
84ab084fa8
|
!30561 fix copy bug in mac when run mindir
Merge pull request !30561 from zhangbuxue/fix_copy_bug_in_mac_when_run_mindir
|
4 years ago |
jinjiali
|
2c47b286dd
|
correct cublas path in cuda_ops cmakelists
|
4 years ago |
i-robot
|
1de823f1eb
|
!30562 fix cpu matrix_set_diag && matrix_band_part kernel codex && pclint-plus
Merge pull request !30562 from zhuzhongrui/pub_master2
|
4 years ago |
lizhenyu
|
2bd2f8cfca
|
fix codedex warnings
|
4 years ago |
xialingtian
|
da641630cb
|
[feat] [assistant] [I48O93, I48O5Q] Add Sin and Cos operators.
|
4 years ago |
i-robot
|
0341d96dd6
|
!30469 add shard function to support part of the graph executed in auto_parallel under pynative mode
Merge pull request !30469 from wangjun/0223_pp
|
4 years ago |
i-robot
|
f32c92b361
|
!30560 [lite]adjust gather func's in-params' name and synchronize micro
Merge pull request !30560 from 徐安越/master1
|
4 years ago |
i-robot
|
55899ec0c5
|
!30545 fix cpu lu kernel codex && pclint-plus
Merge pull request !30545 from zhuzhongrui/pub_master3
|
4 years ago |
z00512249
|
0de0e0cd4b
|
fix cpu matrix_set_diag && matrix_band_part kernel codex && pclint-plus
|
4 years ago |
xuanyue
|
1173f55061
|
adjust gather func's in-params' name and synchronize micro
|
4 years ago |
xulei
|
4cf320cdbd
|
fix codecheck
|
4 years ago |
buxue
|
26d82b6a76
|
fix copy bug in mac when run mindir
|
4 years ago |
i-robot
|
c554d4a8b1
|
!29398 clean code
Merge pull request !29398 from kisnwang/clean-code
|
4 years ago |
i-robot
|
5e519c4304
|
!30534 remove redundent nullptr-check in TransposedUpdateFusion
Merge pull request !30534 from yuchaojie/ir_fusion2
|
4 years ago |
i-robot
|
f98a396e38
|
!26514 [assistant] [ops] Add new array operator ZerosLike and OnesLike
Merge pull request !26514 from TR-nbu/ZerosLike
|
4 years ago |
i-robot
|
6a3bb4f006
|
!30226 [MSLITE][Bug][Func]Fuzz test.
Merge pull request !30226 from wangshaocong/codex
|
4 years ago |
i-robot
|
b49da6cf95
|
!30505 [MSLITE][CPU] AVX512/256/SSE/NENO Advanced packaging, and Batchnorm Op Refactoring and optimization
Merge pull request !30505 from Greatpan/avx512_batchnorm
|
4 years ago |
i-robot
|
b3c5943bf8
|
!30373 [DynamicShape][GPU]add dynamic shape support of Concat op and its backward for new Network
Merge pull request !30373 from hanhuifeng/dcn_dyn_ops_1
|
4 years ago |
wangjun
|
24d448239c
|
add pynative_parallel
|
4 years ago |
NBUFabio
|
be3c885004
|
[feat] [assistant] [I48O8M] [I48O4B] add new array operator ZerosLike and OnesLike
|
4 years ago |
z00512249
|
0c28100825
|
fix cpu lu kernel codex && pclint-plus
|
4 years ago |
i-robot
|
ac1463a192
|
!30298 Package the cuda operators as a dynamic link library
Merge pull request !30298 from jinjiali-kali/cuda_ops
|
4 years ago |
yuchaojie
|
a3f8382fec
|
remove redundent nullptr-check in TransposedUpdateFusion
|
4 years ago |
i-robot
|
eab84bb7fb
|
!30457 [MS][LITE] fix fuzz bug
Merge pull request !30457 from jianghui58/codex_fuzz_master
|
4 years ago |
i-robot
|
66838d3117
|
!30501 fix the bug of GPU TopK kernel incorrect index
Merge pull request !30501 from zong_shuai/topk_debug_index
|
4 years ago |
i-robot
|
7f2367dddd
|
!28151 [assistant][ops] Add OneHot aicpu operator
Merge pull request !28151 from ZhengPingping/OneHot
|
4 years ago |
i-robot
|
5ff89d56c4
|
!30479 workspace
Merge pull request !30479 from TuDouNi/dynamic_shape_stage3
|
4 years ago |
wangshaocong
|
22ad253b48
|
[MSLITE] FuzzTest.
|
4 years ago |
Zwink
|
cceb9d0b81
|
Update OneHot operator
|
4 years ago |
zong-shuai
|
891f0f87d0
|
debug
|
4 years ago |
greatpanc
|
394a94697d
|
avx512 batchnorm op support
|
4 years ago |
ttudu
|
27f75b5ed8
|
fetch workspace
|
4 years ago |
wYann
|
57cb72e2b7
|
dynamic data sink on Ascend
|
4 years ago |
kswang
|
f26870d437
|
clean code
|
4 years ago |
hanhuifeng2020
|
662c51c019
|
[DynamicShape][GPU]add dynamic shape support of Concat and its backward for DCN
|
4 years ago |
i-robot
|
0ade79cb84
|
!30391 add julia cache and row-major api
Merge pull request !30391 from r1chardf1d0/master
|
4 years ago |
jianghui58
|
429f07a067
|
fix fuzz bug
|
4 years ago |
i-robot
|
243fb6bb0f
|
!29991 dynamic shape
Merge pull request !29991 from TuDouNi/dynamic_shape_stage3
|
4 years ago |
ttudu
|
451ebd1bd1
|
dynamic_shape
|
4 years ago |
i-robot
|
07e6d7608a
|
!28927 parallel calc rms for cpu adafactor
Merge pull request !28927 from kisnwang/add-cpu-adafactor
|
4 years ago |
i-robot
|
48024f5828
|
!30214 cpu profiling support multithread and add free timeline
Merge pull request !30214 from fangzehua/add_profi
|
4 years ago |