qujianwei
7e1aebc2c9
delete ut libgraph.so and liberror_manager.so dependency
4 years ago
shenwei41
ff75314719
upgrade ascend 20220211
4 years ago
marui
d47e79b04c
Refactor ccsrc directories and CMakeLists files
4 years ago
i-robot
f2466fbff2
!29443 update ascend stream assign && add PROF log info
Merge pull request !29443 from lyqlola/master
4 years ago
liyiqi
adb33a15b5
update ascend stream assign && add PROF log info
4 years ago
yujianfeng
dcd6f9e491
Support compile cache in ps mode
4 years ago
i-robot
e4438f3028
!28310 dynamic_kernel_mod
Merge pull request !28310 from TuDouNi/dynamic_shape_stage1
4 years ago
ttudu
9373679c04
dynamic_kernel_mod
4 years ago
caifubi
49328ae86d
Modify Hccl Error Log
4 years ago
yuximiao
393d3621a3
fix no paralle stratege if start profiler in the process of trainging
4 years ago
l00591931
62e474aaf1
Enable mindir for layout
4 years ago
i-robot
4252b24335
!26792 malloc ts memory for label
Merge pull request !26792 from zhoufeng/change-label-memory-type
4 years ago
zhoufeng
881179fa10
malloc ts memory for label
Signed-off-by: zhoufeng <zhoufeng54@huawei.com>
4 years ago
i-robot
3d0f9d8aae
!26683 Enable compile cache feature to load hyper parameter data from python
Merge pull request !26683 from LiangZhibo/mindir
4 years ago
i-robot
cfc6ea32ff
!24714 replace rtmemcpyxx to acl memcpy
Merge pull request !24714 from jjfeing/br_replace_rtmemcpyxx_with_acl_api
4 years ago
l00591931
21df240f23
Enable mindir to load initialize weight from python
4 years ago
jjfeing
05485d991c
replace api with acl api
4 years ago
i-robot
ce00ee1ad1
!25367 use acl api to control profiling
Merge pull request !25367 from yanghaitao/yht_condation_start_profiler
4 years ago
i-robot
9d6248194e
!26310 MindSpore support load custom aicpu kernels.
Merge pull request !26310 from linqingke/aicpu
4 years ago
yanghaitao1
c94aa6b872
use profiler acl api instead
4 years ago
linqingke
bef2923acf
MindSpore support load custom aicpu ops.
4 years ago
yao_yf
501b978d16
find data parallel common group in auto parallel
4 years ago
ougongchang
9229f1c1ff
profiler support to collect parallel strategy info
If SetNodeOutputType functions forcibly splits into multiple functions, the readability decreases, so it blocks lizard scans
4 years ago
baihuawei
e59d07899b
fix reset8p pynative performance
4 years ago
LaiYongqiang
dc7988f4bd
log improvement
4 years ago
i-robot
cb307e24cf
!25153 refactor device loop control
Merge pull request !25153 from laiyongqiang/adjust_kernel_refactory
4 years ago
LaiYongqiang
9bfb2d99fa
refactor device loop control
4 years ago
lby
6872e67131
split compile ang gen kernel mod
4 years ago
i-robot
e920a1c07e
!24593 Continue execution when saving and loading mindir failed
Merge pull request !24593 from YuJianfeng/master
4 years ago
yujianfeng
d384db6c01
Continue execution when saving and loading mindir failed
4 years ago
lby
3e9fd763c3
delete old build process
4 years ago
caifubi
f092e623e0
Compile isolation for Profiling and Dump
4 years ago
i-robot
2c692bf7de
!22450 insert the overflow check operators according to the "gradients" scope name.
Merge pull request !22450 from guoqi/overflow-check-master
4 years ago
guoqi
8fccec4c20
insert overflow check operaters according to the 'gradients' scope
4 years ago
gaojing
fa02606348
step train modified
4 years ago
baihuawei
a9694a9230
ascend add nontask sink mode
4 years ago
ms_yan
36a8886ca2
Revert "[feat] [assistant] [I3T96T] add new Dataset operator CMUARCTICDataset"
This reverts commit b077aa1cab .
Revert "[feat] [assistant] [I3T96X] add new Dataset operator LibriSpeechDataset"
This reverts commit 4e6f7dc97d .
delete pass_registry_test.cc
comment hiai_nlu_model_multi.pb related line
4 years ago
djc
b077aa1cab
[feat] [assistant] [I3T96T] add new Dataset operator CMUARCTICDataset
4 years ago
djc
4e6f7dc97d
[feat] [assistant] [I3T96X] add new Dataset operator LibriSpeechDataset
4 years ago
yanghaitao1
8fc11cb676
adapt delete libms_profiler_fwk.a
4 years ago
caifubi
dfe0e94466
Fix PyNative get_rank_id/get_rank_size
4 years ago
i-robot
6afcd815d2
!21362 add pynative profiling codes based on ascend and gpu
Merge pull request !21362 from lvchangquan/profiling_refactor
4 years ago
zhoufeng
03a56f2bb0
alltoall exception handle
Signed-off-by: zhoufeng <zhoufeng54@huawei.com>
4 years ago
lby
a5029f061c
ascend kernel build refactory
4 years ago
lvchangquan
e8d9803258
add profiling codes based on ascend and gpu in pynative mode
4 years ago
lby
e6cdf098db
op tiling compute interface replace
4 years ago
baihuawei
41de02a58c
ascend support nontask sink
4 years ago
i-robot
69c5021bb5
!20995 pyfunc cpu kernel
Merge pull request !20995 from chenweifeng/cpu-dynamic-input
4 years ago
wilfChen
d6fffdad6e
support dynamic inputs & outputs
4 years ago
yanghaoran
0364650eae
Upgrade Ascend packages 28 Jul 21, with testcases removed
4 years ago