i-robot
e537fb3a61
!30820 Support reference for gpu and cpu for parameter update
Merge pull request !30820 from ZPaC/parse-label-for-parameter-update
4 years ago
i-robot
b90cf43562
!30553 Support dataset reset() to recover after failure
Merge pull request !30553 from h.farahat/reset
4 years ago
ZPaC
04ee8f4dd9
Support reference for gpu and cpu for parameter update
4 years ago
h.farahat
a3dec34833
Dataset failover reset
4 years ago
He Wei
67c528e7fc
Reduce object size of node
1. Remove unused fields, such as AnfNode::interpreted_node_, CNode::inputs_value_;
2. Reduce seen_ size from 64 bits to 32 bits;
3. Use bitset for boolean flags;
4. Use pointer for user data;
5. Make some cold fields as user data, such as AnfNode::kernel_info_.
Object size decreased:
AnfNode: 232 -> 144
ValueNode: 264 -> 176
Parameter: 384 -> 272
CNode: 576 -> 400
4 years ago
hanhuifeng2020
b571e67795
[DynamicShape][GPU]add dynamic shape support some ops for new network
4 years ago
i-robot
ff49911889
!30572 delete CpuDynamicshape and GpuDynamicshape
Merge pull request !30572 from TuDouNi/gpu_cpu_delete_dynamic_kernel
4 years ago
ttudu
dac6f5425d
delete GPU/CPUDynamicKernel
4 years ago
Gaoxiong
8745e4d3f1
GraphKernel performance optimize: cache kernel addr in KernelMod
4 years ago
ckey_Dou
e9679ca0bd
return default shape when max_shape is empty
4 years ago
zhoufeng
f49b195c39
extract common as an independent shared library
Signed-off-by: zhoufeng <zhoufeng54@huawei.com>
4 years ago
i-robot
dfc6cbb6df
!30597 fix codedex warnings
Merge pull request !30597 from zyli2020/master
4 years ago
i-robot
6c8f942737
!30544 fix codecheck
Merge pull request !30544 from xulei/fix_codecheck
4 years ago
jinjiali
2c47b286dd
correct cublas path in cuda_ops cmakelists
4 years ago
lizhenyu
2bd2f8cfca
fix codedex warnings
4 years ago
xulei
4cf320cdbd
fix codecheck
4 years ago
i-robot
b3c5943bf8
!30373 [DynamicShape][GPU]add dynamic shape support of Concat op and its backward for new Network
Merge pull request !30373 from hanhuifeng/dcn_dyn_ops_1
4 years ago
i-robot
ac1463a192
!30298 Package the cuda operators as a dynamic link library
Merge pull request !30298 from jinjiali-kali/cuda_ops
4 years ago
zong-shuai
891f0f87d0
debug
4 years ago
hanhuifeng2020
662c51c019
[DynamicShape][GPU]add dynamic shape support of Concat and its backward for DCN
4 years ago
i-robot
2f09bcab3d
!30358 add mul dtype register to support bool && uint_8_t
Merge pull request !30358 from zhuzhongrui/pub_master2
4 years ago
jinjiali
b2d1bad62c
package the cuda operations as a dynamic link library
4 years ago
i-robot
6f88bf7c75
!30324 rename scipy ops matrix_diag_part to matrix_diag_part_V3
Merge pull request !30324 from wuwenbing/master
4 years ago
z00512249
d1e2dfcbd1
add mul dtype register to support bool && uint_8
4 years ago
i-robot
a5242d33e1
!30181 support TensorShape ops
Merge pull request !30181 from lianliguang/change-dynshape-to-tensorshape
4 years ago
wenbean
638ba5b4d1
Fix ops matrix_diag_part different version interplay infershap between ascend and gpu/cpu
4 years ago
lianliguang
5c8af66e53
support TensorShape ops to instead of DynamicShape ops
4 years ago
zhujingxuan
911e4660ce
Rename matrix_triangular_solve to be compatible with tensorflow's MatrixTriangularSolve OP
4 years ago
yanghaoran
739c4b236a
回退 'Pull Request !30120 : Package the cuda operators as a dynamic link library'
4 years ago
i-robot
8e21ffae65
!30120 Package the cuda operators as a dynamic link library
Merge pull request !30120 from jinjiali-kali/cuda_ops_3
4 years ago
i-robot
0efb23fd96
!30121 Optimize eigh ops, backend suppor t 1/2 output,
Merge pull request !30121 from wuwenbing/master
4 years ago
jinjiali
cd7db0f57f
package the cuda operations as a dynamic link library
4 years ago
i-robot
cc2dd686da
!30111 Add OWNERS files for ccsrc subdirectories.
Merge pull request !30111 from marui/master
4 years ago
wenbean
ba3b65b9af
Optimize eigh, backend support 1 or 2 output for GPU/CPU
4 years ago
marui
1d529f2ff3
Add OWNERS files for ccsrc subdirectories
4 years ago
changzherui
83e6ea3d71
add gpu ResetResource
4 years ago
i-robot
705134e01a
!29609 update tensorarray stack for static size
Merge pull request !29609 from VectorSL/static_tensorarray_stack
4 years ago
wenbean
fddedb03c8
Matrix ops move to arrays dir, use scure func
4 years ago
VectorSL
8b45b00f7c
update tensorarry stack for static usage
4 years ago
marui
d47e79b04c
Refactor ccsrc directories and CMakeLists files
4 years ago