mindspore-ci-bot
8c909a6ccd
!722 validate train_id before getting dataset_graph
Merge pull request !722 from luopengting/fix_lineage
5 years ago
luopengting
3eb45eff84
validate train_id before getting dataset_graph
5 years ago
wangshuide2020
0dd6875c43
fix the problem of the redundant code.
5 years ago
luopengting
b9c978830a
clean redundant code after removing lineage public APIs:
1. clean redundant code in lineage parsing and querier
2. delete get_summary_lineage()
3. modify related st and ut to use data_manager
5 years ago
liangyongxiong
6f6be0f512
return unknown state on mindinsight startup when gunicorn error.log is flooded by exceptional process
5 years ago
luopengting
401466e338
place the user-defined parameters behind, if two types of parameters have the same 'importance'
5 years ago
ougongchang
0e0082ef9c
copy security headers to avoid the global varibale is updated
5 years ago
luopengting
6d0c75ae99
sort the returned data in descending order of 'importance' and ascending order of 'name'
5 years ago
mindspore-ci-bot
3291effb9b
!638 optimize some names of variables and functions, and level of some log
Merge pull request !638 from luopengting/optimizer_open
5 years ago
luopengting
b0d3da8415
modify names of some variables and functions, modify some level of log
5 years ago
yelihua
53a4d914f4
fix the redundant code and change log level
5 years ago
yelihua
e57f07bef5
fix the bug for tensor history
5 years ago
Li Hongzhang
c2210200fc
redefine reload interval
5 years ago
mindspore-ci-bot
06a404bdf3
!608 add gpu timeline restful
Merge pull request !608 from gzhcv/bugfix
5 years ago
mindspore-ci-bot
5e6932f0f6
!601 Add the summary loading switch mechanism
Merge pull request !601 from LiHongzhang/fix_caching
5 years ago
yelihua
50e1400505
add debugger module
5 years ago
Li Hongzhang
990800239b
add summary loading switch mechanism
5 years ago
gzhcv
2d5a9a5c7f
add gpu timeline restful
5 years ago
luopengting
b274c774ed
add optimizer:
1. add functions for calculating target buckets and params importances
2. add restful api for target buckets, importances and metadata
5 years ago
Li Hongzhang
f72b4a0a1b
remove sysmetric
5 years ago
Li Hongzhang
4b1524883a
remove lineage public APIs
5 years ago
wangshuide2020
9daf2ae128
kill children processes of worker before worker has been killed by gunicorn master.
5 years ago
wangshuide2020
d3b65356df
remove redundant data to save memory and simplify the tensorcontainer.
5 years ago
ougongchang
c1c5a0fcf1
fix the pylint error in run.py
5 years ago
mindspore-ci-bot
d8cbf988a0
!525 Add gpu profiler.
Merge pull request !525 from yuximiao/yuximiao_gpu_profiler
5 years ago
yuximiao
1460ab4ab1
gpu profiler
5 years ago
ougongchang
e13cceb0ce
set stdout to None and set console output to False when start mindinsight in subprocess
If the env or gunicorn has some error, there will not be any error in console
or error log file when we set the subprocss stdout to devnull.
So I set it to None it will inhert its parent process stdout.
When gunicorn cause error, it will show error in console.
5 years ago
Li Hongzhang
41e47d225d
optional print ssh server ip
5 years ago
Li Hongzhang
6bb9c11cd9
show all accessible ip addresses when started
5 years ago
wangshuide2020
e66e41006c
1. add the limitation of the number of tag in tensor visualization; 2. update the max step per tensor tag to 20; 3. support query one train_job in the interface of train_jobs.
5 years ago
Li Hongzhang
3da4d71dff
add the resource monitor api
- collect_cpu
- collect_mem
- collect_npu
5 years ago
wangshuide2020
7877f33b70
Use multiple processes to calc events.
1. To accelerate summary file parsing, multiple processes are used. As the first step to mindinsight parsing performance optimization, we only made changes to _load_single_file function.
2. This PR will imporve summary parsing throughput dramatically (about cpu_count times)
3. Changes are mainly about _load_single_file function
In the future, a more global concurrent computing framework is needed for mindinsight. See the gitee wiki doc for details.
5 years ago
wangshuide2020
e8ffeb70ef
Support tensor visualization. 1.Tensor display in a table, it can support no more than two dimensions tensor visualization; 2.Tensor histogram visualization for all step in cache.
5 years ago
zhangyunshu
5871887c0b
profiler: fixed error msg in timeline rest api
5 years ago
mindspore-ci-bot
449929adac
!370 update the display name of aicore detail execution time
Merge pull request !370 from chenchao99/profiler_analyser
5 years ago
mindspore-ci-bot
e0f08cf394
!374 Profiler: fixed the issue of using unverified dir path
Merge pull request !374 from zhangyunshu/zys_timeline_fixed_fuzz
5 years ago
chenchao99
0d8808e5b3
update the display name of aicore detail execution time
5 years ago
zhangyunshu
a5c474c211
profiler: fixed timeline fuzz issue
5 years ago
askmiao
21eda46fa3
use the custom response
5 years ago
mindspore-ci-bot
6f953c0cfe
!282 add path_prefix argument for start command
Merge pull request !282 from liangyongxiong/path-prefix
5 years ago
liangyongxiong
f2f80b177a
add path_prefix argument for start command
5 years ago
yuximiao
70857c7b5f
adapt profiling source path change
5 years ago
yelihua
15065c1095
enable to show the fp and bp point
5 years ago
yuximiao
98b5030b3b
add minddata proposer
5 years ago
mindspore-ci-bot
3956a90a39
!287 Change the method to find the step trace files.
Merge pull request !287 from yelihua/dev_profiler
5 years ago
yelihua
82ed137d4f
change the method to find step trace files
5 years ago
zhangyunshu
50b4b0b97a
Add new feature for Profiler:
Collect timeline information after training finished.
Support timeline display on web UI.
5 years ago
root
449bec1fa2
featrue for proposer
5 years ago
chenchao99
ee3f241205
add feature of minddata pipeline stage analysing function
5 years ago
mindspore-ci-bot
d200233c48
!265 Add Dataset profiling.
Merge pull request !265 from yuximiao/master
5 years ago