ZPaC
78a79a9b5e
Fix comm helper method
4 years ago
lizhenyu
b307016fd1
refine log of kernel select
4 years ago
i-robot
520fe19b27
!27064 Reimply tensorarray
Merge pull request !27064 from VectorSL/reimpy-tensorarray
4 years ago
ZPaC
ae3bae1571
Replace OpenMPI
4 years ago
VectorSL
20b38e880b
update tensor-array
4 years ago
VectorSL
7c7cd34276
move cpu register into cpp kernel
4 years ago
VectorSL
cb3d25c8f0
add cpu tensor array
4 years ago
i-robot
745c1eaff8
!26869 1.Purge not used API. 2.Adapt for collective_init.h
Merge pull request !26869 from ZPaC/dir-of-distributed
4 years ago
ZPaC
2b7429c5d2
1.Purge not used API.
2.Adapt for collective_init.h
4 years ago
wangshuide2020
6cbe8dd02e
optimizes the kernel error description of LSTM, Pad, ReLU, etc.
4 years ago
i-robot
d66f811022
!26751 optimizes the kernel error description of Adagrad, Adam, Conv2d, etc.
Merge pull request !26751 from wangshuide/wsd_master
4 years ago
wangshuide2020
674e3aa9d6
optimizes the kernel error description of Adagrad, Adam, Conv2d, etc.
4 years ago
VectorSL
710289a72d
add tensor array
4 years ago
VectorSL
8160fba758
update cublas error string
4 years ago
i-robot
21ffa1fb7b
!25091 Partial support for multi root graph in online debugger
Merge pull request !25091 from parastooashtari/online_multi_root_graph
4 years ago
sabrinasun
e7d7476a8e
fix dynamic shape dump issue and apply comments from cell dump pr
4 years ago
Parastoo Ashtari
7f682ba2f6
partial support for multi root graph in online debugger
4 years ago
ZPaC
611de83fd8
Fix dynamic load error
4 years ago
i-robot
8072e6d7f7
!26062 add Custom, custom_op_info_register, CustomRegOp to __init__
Merge pull request !26062 from looop5/custom_init_commit
4 years ago
looop5
58e27d87bc
add Custom, custom_op_info_register, CustomRegOp to __init__
4 years ago
Liu_Xuu
255e2c03b4
[MSLITE] add nccl and mpi distribution in tensorrt delegate 1111_05
4 years ago
dayschan
cbb84ff580
Move IsRealKernel and IsRealCNodeKernel from AnfAlgo to AnfUtils
the function IsOneOfPrimitive and IsOneOfPrimitiveCNode is useful,
we can move them into anf.cc
4 years ago
i-robot
11bec4d85e
!25995 Add nvidia collective lib implementation.
Merge pull request !25995 from ZPaC/dir-of-distributed
4 years ago
ZPaC
9e18bad126
Add nvidia collective lib implementation.
4 years ago
looop5
b89d744e80
Custom op supports no reg info
4 years ago
LaiYongqiang
7f251e3f08
add attr kAttrSkipNopOpAddr for nop node hidden in execution order
4 years ago
dayschan
6a26d7f6d9
Move TypeId2String from kernel_compiler/ to ir/dtype_extends.cc
changed the function to "TypeIdToString", and use the Type::ToString() function,
instead of TypeId-String map.
changed the DtypeToTypeId together, the original StringToType can be used.
added a new interface StringToTypeId.
4 years ago
i-robot
19b04d3ff3
!24074 Support AOT Operator for GPU/CPU Backend
Merge pull request !24074 from jiaoy1224/pyfunc
4 years ago
Yang Jiao
40b648b873
add aot
4 years ago
ZPaC
4c1ef4cef6
Fix ps cache broadcast error.
4 years ago
i-robot
b271aa7a25
!24969 device address add the key of device
Merge pull request !24969 from limingqi107/new_actor_runtime
4 years ago
limingqi107
be100476d6
device address add the key of device
4 years ago
looop5
46789f260a
Custom operator supports tbe dsl
4 years ago
i-robot
c5585a9451
!24467 Add GraphKernelFlags into namespace mindspore::graphkernel
Merge pull request !24467 from DeshiChen/0918_namespace
4 years ago
i-robot
59b6125fe7
!24500 add ascend memory adapter for ascend memory management
Merge pull request !24500 from laiyongqiang/memory_code_check
4 years ago
dayschan
7502345c8f
Add GraphKernelFlags into namespace mindspore::graphkernel
4 years ago
LaiYongqiang
79599546ee
add ascend memory adapter for ascend memory management
4 years ago
TinaMengtingZhang
f84b27b444
sync code self check from dev 1.1
4 years ago
i-robot
811dd1778b
!23959 fix link error if compiled with -s on when import mindspore
Merge pull request !23959 from yanghaitao/yht_sync_shangyong
4 years ago
i-robot
2138dd1d70
!24136 cherry-pick code from enterprise-r1.0
Merge pull request !24136 from caifubi/master-code-from-enterprise
4 years ago
tanghuikang
3035f9d367
Use one struct instead of three vectors for kernel input, output and workspace
4 years ago
caifubi
8dc2d1b84c
Cherry-pick code from enterprise
4 years ago
lvchangquan
e8264a135d
fix some warnings in master
4 years ago
Margaret_wangrui
10a5ab7f54
Remove StandardNormal side_effect_mem mark, and fix gpu kernel select bug
4 years ago
yanghaitao1
7f216aac41
fix link error if import mindspore
4 years ago
wangshuide2020
e06beb2ed4
add validation of vector size and non-zero validation of denominator for nn gpu operators.
4 years ago
i-robot
e87f0be14f
!23707 fix summary isolation
Merge pull request !23707 from jiangshuqiang/temp
4 years ago
i-robot
38bab297ec
!23732 Fix code self-check issues for online dbg in master
Merge pull request !23732 from TinaMengtingZhang/code_self_check_master
4 years ago
i-robot
1cd9ffb5e2
!23687 can not appear exception in the atexit
Merge pull request !23687 from limingqi107/bug_fix
4 years ago
jiangshuqiang
c2bd061889
fix summary isolation
4 years ago