ougongchang
e93365c664
Add a note for summary only supports linux systems
5 years ago
Li Hongzhang
869ca261bc
rm unused params for SummaryRecord
5 years ago
Li Hongzhang
9050f2ad64
forkserver multiprocessing context
5 years ago
wanyiming
3d354d76fd
mod_callback
5 years ago
Li Hongzhang
de43c11e2e
fix several issues
- handle collection for multiple trains
- how many tensors to collect when sunk
- change loglevel for get_learning_rate
- update calculation of `max_file_size`
- fix how collect_tensor_freq counting
5 years ago
Li Hongzhang
fd03ed8341
fix not-exit issue and docs issue
- fix writer pool not exit when max_file_size too small
- fix API docs to illustrate `collect_tensor_freq` and `max_file_size`
5 years ago
Li Hongzhang
05dd17687a
max_file_size include metadata length and drop last step
5 years ago
Li Hongzhang
88dcd90889
limit summary of exhausting the disk
5 years ago
Li Hongzhang
89462e9c3b
check disk space before writing and remove unused mode value
5 years ago
ougongchang
0ee568b733
Update the Api document of SummaryCollector and SummaryRecord.
Add more detail note for SummaryCollector and SummaryRecord,
else if it is used not right, some proplem will be caused.
5 years ago
mindspore-ci-bot
bc42685436
!2770 Capture the time before hand over to the processes pool to avoid time flips
Merge pull request !2770 from LiHongzhang/capture_time
5 years ago
Li Hongzhang
22dea2fc18
SummaryRecord register close atexit
5 years ago
Li Hongzhang
299469babb
address the importance of closing the SummaryRecord and illustrate how
5 years ago
Li Hongzhang
f9c6d12bc4
capture the time before hand over to processes pool to ensure time order
5 years ago
Li Hongzhang
97d8673018
warn when values duplicate and set mode to 'eval' to avoid extra recording
5 years ago
chenzomi
a834a6308e
change some comment name in the whole project
5 years ago
ougongchang
939cd29d7e
Add a callback named SummaryCollector and delete SummaryStep callback
I added a SummaryCollector to help users automatically collect information
such as the network, loss, learning rate and so on, making it easier to collect this information.
It also can collect train lineage and eval lineage information which is
collected by TrainLineage Callback and EvalLineage Callback in
MindInsight.
I also add some UT for SummaryCollect to keep the code correct.
5 years ago
Li Hongzhang
0921c1e538
enhance the SummaryRecord with set_mode and add_value
5 years ago
mindspore-ci-bot
373832d030
!2193 fix log level too high: step has no summary record is normal
Merge pull request !2193 from wenkai/wk1_log_level_0617
5 years ago
wenkai
a2bad5c72d
fix log level too high: step has no record is normal.
5 years ago
Li Hongzhang
d31e14f593
fix having too many processes and no attribute of '_closed'
1. When initing SummaryRecord, if check failed, self._close is not set,
which leads to 'SummaryRecord' object has no attribute '_closed'
AttributeError later on.
2. There may be too many processes for handling summary adapting.
See issue #I1K6K7
5 years ago
Li Hongzhang
ccf49b7c0e
shape is tuple, not a integer
5 years ago
李鸿章
e798fe2de7
mimic numpy behavior when min == max and give warnings when too large
5 years ago
李鸿章
324195280c
summary handle not a image tensor
6 years ago
shijianning
f2230fb070
fix pylint warnings
6 years ago
李鸿章
32c1d558f4
SummaryRecord as context manager
6 years ago
guozhijian
1b3a0277b7
回退 'Pull Request !713 : Use a resident process to write summary files'
6 years ago
李鸿章
69d3abfdd3
reduce dead step(step % flush_step > 0) summary
6 years ago
李鸿章
e0890e6acc
use a resident process for ordered and fast writing summary
6 years ago
李鸿章
abbdf78897
reduce expensive calculation and memory usage
6 years ago
李鸿章
2ac60a1ad4
style changes and nonfunctional modifies
6 years ago
wenkai
ab04b3dc4b
fix np.histograms(bins='auto') sometimes calc very small width and very large bucket number, which lead to error/long compute time.
6 years ago
candanzg
7e8e44ef56
fixed summary graph bug
Signed-off-by: candanzg <zhangshucheng@huawei.com>
6 years ago
mindspore-ci-bot
f9849bf5aa
!464 Supplement summary note
Merge pull request !464 from candanzg/summary_event_notes
6 years ago
candanzg
a91f82d79f
Supplement summary log
Signed-off-by: candanzg <zhangshucheng@huawei.com>
6 years ago
wenkai
cce61d462c
histogram python dev
6 years ago
zhunaipan
930a1fb0a8
initial version
Signed-off-by: leonwanghui <leon.wanghui@huawei.com>
6 years ago