i-robot
|
8e496e44f0
|
!22320 support heterogeneous for pynative mode
Merge pull request !22320 from chujinjin/support_heterogeneous_for_pynative
|
4 years ago |
i-robot
|
f7a8b14d0c
|
!26994 update graphkernel cpu code for AkgParallelLaunch
Merge pull request !26994 from ZengZitao/fix_omp
|
4 years ago |
i-robot
|
c5fac5aba4
|
!26811 Support CSRTensor in while loop and subgraph
Merge pull request !26811 from 杨林枫/csr_in_while
|
4 years ago |
i-robot
|
e67f83fdf1
|
!26983 fix FusedMulAddN code trace print
Merge pull request !26983 from yuchaojie/ir_fusion
|
4 years ago |
i-robot
|
84d2e54871
|
!26959 TensorArray CPU
Merge pull request !26959 from VectorSL/tensor-array-cpu
|
4 years ago |
chujinjin
|
d20c9d93d8
|
support pynative heterogeneous
|
4 years ago |
VectorSL
|
cb3d25c8f0
|
add cpu tensor array
|
4 years ago |
i-robot
|
db5ef1136f
|
!26981 Control flow support function type of formal parameter.
Merge pull request !26981 from gaoyong10/runtime_second12
|
4 years ago |
zengzitao
|
28ab0a963a
|
fix omp num_threads by using get_max_threads
|
4 years ago |
i-robot
|
7c90ead837
|
!26975 Supporting start profiler in the middle of training.
Merge pull request !26975 from yuximiao/add_on-off
|
4 years ago |
gaoyong10
|
080aad5e1d
|
control flow support function type of formal parameter.
|
4 years ago |
i-robot
|
8cb8660051
|
!26970 [MS][LITE]support call parameter populate
Merge pull request !26970 from mengyuanli/populate_call_para
|
4 years ago |
yanglf1121
|
343b17c61f
|
support csr in while loop
|
4 years ago |
i-robot
|
acbaff17f7
|
!26960 [MSLITE][DEVELOP] fix bug of some op attr check
Merge pull request !26960 from yangruoqi713/master_fuzz
|
4 years ago |
i-robot
|
6daea6079b
|
!26906 Optimize DynamicRNNGrad fission when batch_size is multiple of 16 or t_size is 1
Merge pull request !26906 from yuchaojie/ir_fusion3
|
4 years ago |
yuchaojie
|
d3d1d5e97b
|
fix FusedMulAddN code trace print
|
4 years ago |
i-robot
|
f513e0bfdd
|
!26743 [GraphKernel] lite adapt for akg build.
Merge pull request !26743 from chenlei_autodiff/lite_callback
|
4 years ago |
yuximiao
|
e99c0a48e6
|
support start profiler in the minddle of training.
|
4 years ago |
mengyuanli
|
0da2c8d145
|
add call parameter and populate call
|
4 years ago |
i-robot
|
b38600c11a
|
!26895 optimizes the kernel error description of Split, Meshgrid, Select, etc.
Merge pull request !26895 from wangshuide/wsd_master
|
4 years ago |
i-robot
|
17fcbaae94
|
!26808 fix mindspore mac compile
Merge pull request !26808 from xulei/fix_mac_compile_components
|
4 years ago |
i-robot
|
8932dddfd9
|
!26871 fix output tensor num bug
Merge pull request !26871 from caifubi/master-pynative-lazy-build-bug
|
4 years ago |
i-robot
|
e195131342
|
!26891 move key world 'trace' to interface DumpSourceLines
Merge pull request !26891 from liubuyu/SBB
|
4 years ago |
i-robot
|
bfd190482f
|
!26842 Speed up random normal sampling
Merge pull request !26842 from zichun_ye/random_normal_speed_up
|
4 years ago |
yangruoqi713
|
2e57f87be2
|
[MSLITE][DEVELOP] fix bug of some op attr check
|
4 years ago |
lby
|
618d4b7dfe
|
move key 'trace' to interface DumpSourceLines
|
4 years ago |
i-robot
|
5f0f595d1b
|
!26907 [MS][LITE][develop] add fp16 kernel
Merge pull request !26907 from sunsuodong/add_fp16_kernel_b
|
4 years ago |
i-robot
|
7ee6e3060d
|
!26637 [MS][LITE][develop] add fp16 kernel
Merge pull request !26637 from sunsuodong/add_fp16_kernel
|
4 years ago |
i-robot
|
60bfef499f
|
!26854 fix avx512 compile option
Merge pull request !26854 from zhaosida/zsd_adam_simd
|
4 years ago |
sunsuodong
|
533a6574b2
|
add fp16 kernel
|
4 years ago |
i-robot
|
cdb618984f
|
!26832 Support ValueNode inputs json generation in CollectFusedJsonWithSingleKernel
Merge pull request !26832 from zichun_ye/akg_json_build
|
4 years ago |
sunsuodong
|
90b1318fcc
|
add fp16 kernel
|
4 years ago |
yuchaojie
|
805541a3a9
|
Optimize DynamicRNNGrad fission when batch_size is multiple of 16 or t_size is 1
|
4 years ago |
xulei
|
78b37afc30
|
fix mindspore mac compile
fix debugger, mindarmour, audio, gnn, grpc, ps/fl.
|
4 years ago |
tacyi139
|
bb935faca9
|
optimizes the kernel error description of Split, Meshgrid, Select, etc.
|
4 years ago |
chenlei_autodiff
|
54227cae81
|
[GraphKernel] lite adapt for akg build.
|
4 years ago |
i-robot
|
591e606de9
|
!26873 move graph_kernel_cluster to directory graph_kernel/core
Merge pull request !26873 from DeshiChen/1127_cluster
|
4 years ago |
i-robot
|
c1798df274
|
!26837 tag environment bugfix
Merge pull request !26837 from chenweifeng/tag-environment-bug-fix
|
4 years ago |
zhaosida
|
7de4c114b3
|
fix avx512 compile option
|
4 years ago |
i-robot
|
f85dea2959
|
!26662 fix lu kernel
Merge pull request !26662 from zhuzhongrui/gmres
|
4 years ago |
i-robot
|
2bd6deaeeb
|
!26682 GraphKernel For CPU
Merge pull request !26682 from ZengZitao/gk_cpu
|
4 years ago |
dayschan
|
4b19d483a3
|
move graph_kernel_cluster to directory graph_kernel/core
|
4 years ago |
caifubi
|
c32e63bfc8
|
output without monad
|
4 years ago |
i-robot
|
b1deeb425d
|
!26849 bind stream with handle
Merge pull request !26849 from zhujingxuan/master
|
4 years ago |
i-robot
|
8c5957c387
|
!26831 fix output tensor num bug
Merge pull request !26831 from caifubi/master-pynative-lazy-build-bug
|
4 years ago |
i-robot
|
16a2535205
|
!26820 optimizes the kernel error description of LSTM, Pad, ReLU, etc.
Merge pull request !26820 from wangshuide/wsd_master_new
|
4 years ago |
zengzitao
|
62458b5636
|
adapt graph kernel for cpu
|
4 years ago |
zhujingxuan
|
30c6fa7f9b
|
bind stream with handle
|
4 years ago |
wangshuide2020
|
6cbe8dd02e
|
optimizes the kernel error description of LSTM, Pad, ReLU, etc.
|
4 years ago |
z00512249
|
872d8b37da
|
fix lu factor and solve for cpu backend
|
4 years ago |