yuximiao
e99c0a48e6
support start profiler in the minddle of training.
4 years ago
casgj
ce3916700d
cleancode in profiler.
4 years ago
i-robot
ce00ee1ad1
!25367 use acl api to control profiling
Merge pull request !25367 from yanghaitao/yht_condation_start_profiler
4 years ago
yanghaitao1
c94aa6b872
use profiler acl api instead
4 years ago
mohammad
5c8ab5f60c
add MD Profiler Save()
4 years ago
Cathy Wong
cdf3126459
MD Profiler Start Stop Support
4 years ago
gaojing
4a4299d9c3
fix the error msg not clear when profiler path have chinese.
4 years ago
ougongchang
34f6b50b17
Fix get profiling job id fail when there are more than one JOB dirs in profiler output path
4 years ago
huangbingjian
d7e97dd74a
change EXCEPTION level to CRITICAL level
4 years ago
casgj
bf3dac8461
Fix profiler calling analysis twice.
4 years ago
gaojing
e4b5d77b8e
fix the wrong value of average flops.
4 years ago
casgj
92052be1b6
fix the error message that profiler does not support pynative mode.
4 years ago
gaojing
b78a13dd78
fix the error message that profiler does not support pynative mode.
4 years ago
buxue
5418a45752
support cpu profiling and code check for cpu ops
4 years ago
i-robot
bc521674cc
!23473 modify the master build alarm
Merge pull request !23473 from zangqx/profiling_gpu_permission
4 years ago
臧庆香
ec03a76c66
modify the master build alarm
4 years ago
i-robot
dc40225638
!23400 remove profiler if compiled with -s on
Merge pull request !23400 from yanghaitao/yht_remove_profiler
4 years ago
yanghaitao1
177f3f75bf
remove profiler if compiled with -s on
4 years ago
i-robot
d7388b40ab
!23295 add dump and profiling warning log when task is not sink
Merge pull request !23295 from baihuawei/fixlog
4 years ago
baihuawei
e1e11b9a47
fix some bugs
4 years ago
zyhStack
55170ef7d9
Fix the problem with the default on communication performance switch
4 years ago
yanghaitao1
83302ad23c
remove profiling parameters in set_context function
4 years ago
gaojing
4f41b868bb
Raise error information for the case that the Profiler output path was not exists
4 years ago
i-robot
7d6ff9d098
!23009 Modify the judge of multi devices training logic
Merge pull request !23009 from 张毅辉/Judge_multi_devices_training_logic
4 years ago
zhangyihui
3a5b1f83f3
Modify and judge multi card training logic
4 years ago
zhangyihui
6a36171a0e
transform device_id to rank_id for cpu_profiler
4 years ago
i-robot
5f9e9d96ec
!22542 Bugfix for scope-level flops data cannot display
Merge pull request !22542 from gzhcv/CommunicationOpNotOverlapped
4 years ago
gzhcv
edb1b4798e
Bugfix for scope-level flops data cannot display
4 years ago
yanghaitao1
c581f9fb88
fix import 'Model' error for Profiler api. Should from 'mindspore' but not 'mindspore.train'
4 years ago
zhangyihui
283f0c73cd
Make hccl_info folder with rank_id suffix
4 years ago
zhangyihui
3e5cb3b506
fix bugs for device_id_to_rank_id
4 years ago
i-robot
6301361570
!22249 Analysis of overlapping time of communication operator and computation operator
Merge pull request !22249 from gzhcv/CommunicationOpNotOverlapped
4 years ago
zhangyihui
3d19949eb4
device_id to rank_id
4 years ago
i-robot
ad4b85e125
!21929 Fix code check.
Merge pull request !21929 from yuximiao/fix_static
4 years ago
gzhcv
2c99884d83
Add cluster bottleneck analyse feature
5 years ago
ms_yan
36a8886ca2
Revert "[feat] [assistant] [I3T96T] add new Dataset operator CMUARCTICDataset"
This reverts commit b077aa1cab .
Revert "[feat] [assistant] [I3T96X] add new Dataset operator LibriSpeechDataset"
This reverts commit 4e6f7dc97d .
delete pass_registry_test.cc
comment hiai_nlu_model_multi.pb related line
4 years ago
djc
b077aa1cab
[feat] [assistant] [I3T96T] add new Dataset operator CMUARCTICDataset
4 years ago
djc
4e6f7dc97d
[feat] [assistant] [I3T96X] add new Dataset operator LibriSpeechDataset
4 years ago
yuximiao
fd44b00f0a
fix code check
4 years ago
Cathy Wong
a2cbd4b5fa
MD Profiling Analyze: Search for device trace file
MinddataProfilingAnalyzer() - remove device_target input parm
4 years ago
i-robot
ec1cde222e
!21120 Make the parameter profile_communication effective only in mutil devices training
Merge pull request !21120 from 张毅辉/profile_communication
4 years ago
zhangyihui
110d7204a3
Make the parameter profile_communication effective only in multi card training
4 years ago
gaojing
310841bd51
profiler cleancode
4 years ago
zhangyihui
0362673ba4
Fix the bug that when output_path is set to None an err will report
4 years ago
zhangyihui
63a7add107
Environment variables are mainly set for the convenience of cloud profiler
4 years ago
gzhcv
001985ca25
Fit Gpu LoopCount for profiler module
4 years ago
yanghaitao1
09aa4cf4d8
add memory profiling option to profiler
4 years ago
mindspore-ci-bot
1c13e935ad
!17558 Cluster slow network query
From: @zyhstack
Reviewed-by:
Signed-off-by:
4 years ago
zhangyihui
3eab41bc36
cluster profiler for slow net
5 years ago
mindspore-ci-bot
7666f97f58
!17126 Add AICORE Flops profiling
From: @gzhcv
Reviewed-by:
Signed-off-by:
4 years ago