History

zhouyaqiang c2dc1a7ce6 modify readme in deeplabv3		5 years ago
..
scripts	!3670 remove old MINDSPORE_HCCL_CONFIG_PATH in model zoo	5 years ago

src	!4227 fix pylint warning in model_zoo	5 years ago

README.md	modify readme in deeplabv3	5 years ago

eval.py	move deeplabv3 and resnext50 from model_zoo to model_zoo/official/cv	5 years ago

train.py	move deeplabv3 and resnext50 from model_zoo to model_zoo/official/cv	5 years ago

README.md

DeeplabV3 Description

DeepLabv3 is a semantic segmentation architecture that improves upon DeepLabv2 with several modifications.To handle the problem of segmenting objects at multiple scales, modules are designed which employ atrous convolution in cascade or in parallel to capture multi-scale context by adopting multiple atrous rates.

Paper Chen L C , Papandreou G , Schroff F , et al. Rethinking Atrous Convolution for Semantic Image Segmentation[J]. 2017.

Model architecture

The overall network architecture of DeepLabv3 is show below:

Link

Dataset

Dataset used: VOC2012

20 classes. The train/val data has 11,530 images containing 27,450 ROI annotated objects and 6,929 segmentations. And we need to remove color map from annotation.

Features

Mixed Precision(Ascend)

The mixed precision training method accelerates the deep learning neural network training process by using both the single-precision and half-precision data formats, and maintains the network precision achieved by the single-precision training at the same time. Mixed precision training can accelerate the computation process, reduce memory usage, and enable a larger model or batch size to be trained on specific hardware.
For FP16 operators, if the input data type is FP32, the backend of MindSpore will automatically handle it with reduced precision. Users could check the reduced-precision operators by enabling INFO log and then searching ‘reduce precision’.

Environment Requirements

Hardware（Ascend/GPU）
- Prepare hardware environment with Ascend or GPU processor. If you want to try Ascend , please send the application form to ascend@huawei.com. Once approved, you can get the resources.
Framework
- MindSpore
For more information, please check the resources below：
- MindSpore tutorials
- MindSpore API

Script description

Script and sample code

.
└─DeeplabV3   
    │  README.md   
	│  eval.py
	│  train.py
	├─scripts
	│      run_distribute_train.sh  # launch distributed training with ascend platform(8p)
	│      run_eval.sh				# launch evaluating with ascend platform
	│      run_standalone_train.sh  # launch standalone training with ascend platform(1p)
	└─src
		│  config.py				# parameter configuration
		│  deeplabv3.py				# network definition
		│  ei_dataset.py			# data preprocessing for EI
		│  losses.py				# customized loss function
		│  md_dataset.py			# data preprocessing
		│  miou_precision.py		# miou metrics
		│  __init__.py
		│
		├─backbone
		│      resnet_deeplab.py	# backbone network definition
		│      __init__.py
		│
		└─utils
            adapter.py				# adapter of dataset
            custom_transforms.py    # random process dataset
            file_io.py              # file operation module
            __init__.py

Script Parameters

Major parameters in train.py and config.py are:   
	learning_rate                   Learning rate, default is 0.0014.
    weight_decay                	Weight decay, default is 5e-5.
    momentum                    	Momentum, default is 0.97.
    crop_size                       Image crop size [height, width] during training, default is 513.
    eval_scales                     The scales to resize images for evaluation, default is [0.5, 0.75, 1.0, 1.25, 1.5, 1.75].
	output_stride					The ratio of input to output spatial resolution, default is 16.
	ignore_label					Ignore label value,	default is 255.
	seg_num_classes					Number of semantic classes, including the background class. 
									foreground classes + 1 background class in the PASCAL VOC 2012 dataset, default is 21.
	fine_tune_batch_norm			Fine tune the batch norm parameters or not, default is False.
	atrous_rates					Atrous rates for atrous spatial pyramid pooling, default is None.
	decoder_output_stride			The ratio of input to output spatial resolution when employing decoder
									to refine segmentation results, default is None.
	image_pyramid					Input scales for multi-scale feature extraction, default is None.
	epoch_size						Epoch size, default is 6.
    batch_size                      batch size of input dataset: N, default is 2.
	enable_save_ckpt				Enable save checkpoint, default is true.
	save_checkpoint_steps			Save checkpoint steps, default is 1000.
	save_checkpoint_num				Save checkpoint numbers, default is 1.

Training process

Usage

You can start training using python or shell scripts. The usage of shell scripts as follows:

sh scripts/run_distribute_train.sh RANK_TABLE_FILE DATA_PATH (CKPT_PATH)

Notes:
RANK_TABLE_FILE can refer to Link , and the device_ip can be got in /etc/hccn.conf in ascend server.

Launch

# training example
  python:
      python train.py --dataset_url DATA_PATH 

  shell:
      sh scripts/run_distribute_train.sh RANK_TABLE_FILE DATA_PATH (CKPT_PATH)

Notes:
If you are running a fine-tuning or evaluation task, prepare the corresponding checkpoint file.

Result

Training result will be stored in the example path. Checkpoints will be stored at . /LOG0/chec_deeplabv3-* by default, and training log will be redirected to ./log.txt like followings.

epoch: 1 step: 732, loss is 0.11594
Epoch time: 78748.379, per step time: 107.378
epoch: 2 step: 732, loss is 0.092868
Epoch time: 160917.911, per step time: 36.631

Eval process

Usage

You can start training using python or shell scripts. The usage of shell scripts as follows:

sh scripts/run_eval.sh DEVICE_ID DATA_PATH PRETRAINED_CKPT_PATH

Launch

# eval example
  python:
      python eval.py --device_id DEVICE_ID --dataset_url DATA_DIR --checkpoint_url PATH_CHECKPOINT

  shell:
      sh scripts/run_eval.sh DEVICE_ID DATA_PATH PRETRAINED_CKPT_PATH

checkpoint can be produced in training process.

Result

Evaluation result will be stored in the example path, you can find result like the followings in log.txt.

mIoU = 0.65049

Model description

Performance

Training Performance

Parameters	DeeplabV3
Model Version
Resource	Ascend 910, cpu:2.60GHz 56cores, memory:314G
uploaded Date	08/24/2020
MindSpore Version	0.6.0-beta
Training Parameters	src/config.py
Optimizer	Momentum
Loss Function	SoftmaxCrossEntropy
outputs	probability
Loss	0.98
Accuracy	mIoU:65%
Total time	5mins
Params (M)	94M
Checkpoint for Fine tuning	100M

Inference Performance

Parameters	DeeplabV3
Model Version
Resource	Ascend 910
Uploaded Date	08/24/2020 (month/day/year)
MindSpore Version	0.6.0-beta
Dataset	voc2012/val
batch_size	2
outputs	probability
Accuracy	mIoU:65%
Total time	10mins
Model for inference	97M (.GEIR file)

Description of Random Situation

We use random in custom_transforms.py for data preprocessing.

ModelZoo Homepage

Please check the official homepage.

MindSpore is a new open source deep learning training/inference framework that could be used for mobile, edge and cloud scenarios.

C++ Python Text Unity3D Asset C other

314202276@qq.com 5518576+mindspore_ci@user.noreply.gitee.com tommylike@qq.com zhaozhenlong1@huawei.com jiangjinsheng@huawei.com yiren19920727@163.com zhaojichen1@huawei.com chenzomi12@gmail.com zhoufeng54@huawei.com shiliang10@huawei.com huanghui44@huawei.com guozhijian@huawei.com fary.fanrui@huawei.com wangkaisheng2@huawei.com xiefangqi2@huawei.com 6576637+ms_yan@user.noreply.gitee.com chenweifeng720@huawei.com weiluning@huawei.com 2713219276@qq.com yujianfeng5@huawei.com caifubi1@huawei.com lizhenyu13@huawei.com lianliguang@huawei.com lichentrue@163.com zhoupeichen@huawei.com

README.md

Contents

DeeplabV3 Description

Model architecture

Dataset

Features

Mixed Precision(Ascend)

Environment Requirements

Script description

Script and sample code

Script Parameters

Training process

Usage

Launch

Result

Eval process

Usage

Launch

Result

Model description

Performance

Training Performance

Inference Performance

Description of Random Situation

ModelZoo Homepage

Contributors (25+)
All

README.md

Contents

Usage

Launch

Result

Usage

Launch

Result

Training Performance

Inference Performance

Contributors (25+) All

Contributors (25+)
All