i-robot
c2c1fe0f86
!31377 code clean
Merge pull request !31377 from chenweifeng/code-clean
4 years ago
i-robot
3dd2aa584e
!31345 Clean code
Merge pull request !31345 from VectorSL/code_clean
4 years ago
i-robot
c3d15a079e
!31325 Training supports failover
Merge pull request !31325 from zyli2020/worker_failover_bp
4 years ago
i-robot
35d1782023
!31348 Clean Code
Merge pull request !31348 from tanghuikang/clean_code
4 years ago
wilfChen
94837c8c38
code warning clean
4 years ago
lizhenyu
545707b80c
Training supports failover
4 years ago
i-robot
f52272ae34
!31369 Fix code check 2.0 issue
Merge pull request !31369 from ZPaC/static
4 years ago
tanghuikang
dbcdcc2daa
Clean code
4 years ago
i-robot
34f321d946
!31300 [DynamicShape][Gpu]support dynamic shape for the DCN network
Merge pull request !31300 from hanhuifeng/dyn_gpu_bugs
4 years ago
ZPaC
42bdf2bc5b
Fix code check 2.0 issue
4 years ago
i-robot
c9cb729cd3
!31130 fix bug of fake quant perchannel
Merge pull request !31130 from cjh9368/clean_static_error_master
4 years ago
i-robot
4f68c93a8b
!31155 clean code
Merge pull request !31155 from zhupuxu/clean_master
4 years ago
VectorSL
83882d4712
clean code
4 years ago
hanhuifeng2020
089ce0386a
[DynamicShape][Gpu]support dynamic shape for the DCN network
4 years ago
buxue
b7f90e2b16
implement Conv2dBackward and log of pool
4 years ago
i-robot
4b15a46a49
!30858 Refactor cpu kernel factory.
Merge pull request !30858 from TronZhang/refactor_cpu_factory
4 years ago
i-robot
f604a4e1a8
!31023 solve the bug of return err when input_shape less than 2 or more than 4 in mirror pad
Merge pull request !31023 from zong_shuai/mirror_pad
4 years ago
TronZhang
db2931516e
refactor cpu factory
4 years ago
zhupuxu
2ba04fb7e0
clean code
Signed-off-by: zhupuxu <zhupuxu@huawei.com>
4 years ago
Parastoo Ashtari
c61e56cbe2
fix kernel by kernel and mindRT parameter bug
4 years ago
cjh9368
1e4cde75f6
[MS][LITE] fix bug of fakequant perchannel
4 years ago
gaoyong10
a94285147d
Dynamic shape support control flow.
4 years ago
i-robot
d2c23394d8
!30378 replace set/get shape vector
Merge pull request !30378 from zhupuxu/replace_set_and_get
4 years ago
i-robot
c04faad4ad
!30559 modify gpu opt adapter dynamic shape
Merge pull request !30559 from changzherui/mod_gpu_opt
4 years ago
changzherui
47ef477fef
modify gpu opt adapter dynamic shape
4 years ago
i-robot
df55f87899
!31029 fix attr length bug of Conv2DBackpropFilter
Merge pull request !31029 from zhangbuxue/fix_bug_of_conv2dbackpropfliter
4 years ago
zhupuxu
d0d7c26b47
set min/max shape
Signed-off-by: zhupuxu <zhupuxu@huawei.com>
4 years ago
buxue
c24e9c3e93
fix bug of Conv2DBackpropFilter
4 years ago
i-robot
9075e36081
!30980 remove duplicate op impl
Merge pull request !30980 from zhujingxuan/master
4 years ago
i-robot
2ed51f3cc7
!30660 Add host_shape to CreateDeviceAddress API Input
Merge pull request !30660 from jiaorui/host_shape
4 years ago
zhujingxuan
45e7c257af
remove duplicate op impl
matrix_triangular_solve_gpu_kernel and trsm_solve_gpu_kernel are actually the same.
4 years ago
buxue
50f3ad00ab
develop maxpool3d, avgpool3d and conv3d on CPU(including forward and backward)
4 years ago
zong-shuai
cdcbd9aeb0
debug
4 years ago
jiaorui
ca3a273756
add host_shape to CreateDeviceAddress api
4 years ago
i-robot
e537fb3a61
!30820 Support reference for gpu and cpu for parameter update
Merge pull request !30820 from ZPaC/parse-label-for-parameter-update
4 years ago
i-robot
b90cf43562
!30553 Support dataset reset() to recover after failure
Merge pull request !30553 from h.farahat/reset
4 years ago
ZPaC
04ee8f4dd9
Support reference for gpu and cpu for parameter update
4 years ago
h.farahat
a3dec34833
Dataset failover reset
4 years ago
He Wei
67c528e7fc
Reduce object size of node
1. Remove unused fields, such as AnfNode::interpreted_node_, CNode::inputs_value_;
2. Reduce seen_ size from 64 bits to 32 bits;
3. Use bitset for boolean flags;
4. Use pointer for user data;
5. Make some cold fields as user data, such as AnfNode::kernel_info_.
Object size decreased:
AnfNode: 232 -> 144
ValueNode: 264 -> 176
Parameter: 384 -> 272
CNode: 576 -> 400
4 years ago
hanhuifeng2020
b571e67795
[DynamicShape][GPU]add dynamic shape support some ops for new network
4 years ago
i-robot
ff49911889
!30572 delete CpuDynamicshape and GpuDynamicshape
Merge pull request !30572 from TuDouNi/gpu_cpu_delete_dynamic_kernel
4 years ago
ttudu
dac6f5425d
delete GPU/CPUDynamicKernel
4 years ago
Gaoxiong
8745e4d3f1
GraphKernel performance optimize: cache kernel addr in KernelMod
4 years ago
ckey_Dou
e9679ca0bd
return default shape when max_shape is empty
4 years ago
zhoufeng
f49b195c39
extract common as an independent shared library
Signed-off-by: zhoufeng <zhoufeng54@huawei.com>
4 years ago
i-robot
dfc6cbb6df
!30597 fix codedex warnings
Merge pull request !30597 from zyli2020/master
4 years ago
i-robot
6c8f942737
!30544 fix codecheck
Merge pull request !30544 from xulei/fix_codecheck
4 years ago
jinjiali
2c47b286dd
correct cublas path in cuda_ops cmakelists
4 years ago
lizhenyu
2bd2f8cfca
fix codedex warnings
4 years ago
xulei
4cf320cdbd
fix codecheck
4 years ago