mindspore2022

584 MB

Tree: 052d8e2c99

Author	SHA1	Message	Date
zhupuxu	523c763a02	clean code Signed-off-by: zhupuxu <zhupuxu@huawei.com>	4 years ago
zhupuxu	c0099abd03	pass optimizer Signed-off-by: zhupuxu <zhupuxu@huawei.com>	4 years ago
dayschan	2ac8c65327	Add GraphKernelPassManager to manage the passes of GraphKernel Refactor the original "PassManager" class, and derive the "GraphKernelPassManager" GraphKernel's ir files are dumped into a new sub-directory "graph_kernel" in the original "verbose_ir_files" All GraphKernel's passes are divided into 3 levels, and controlled by the flag "opt_level" by default. when the opt_level is greaterequal to the pass's level, this pass will run. The default "opt_level" is 2 when GraphKernel is enabled. Levels: 1. Basic features, like cluster, splitter, and some preprocess, postprocess. 2. All stable features, mainly includes the optimization passes. 3. Experimental features, like stitch-fusion, parallel-fusion. The two flags "enable_pass" and "disable_pass" are available in this commit. User can manually enable some passes when it's disabled by "opt_level", or disable the enabled passes, by specifying that pass in this format: "stage_id.pass_id" or "stage_name.pass_name", multiple passes are separated by comma(",") the stage/pass index and stage/pass name can be found from the ir filename. e.g. "--enable_pass=cluster.graph_kernel_expander,1.1,1.2" Others: 1. the pass "tensor_promotion" is not useful, remove it. 2. put the pass "InsertPadOps" before "ArithmeticSimplify".	4 years ago
jjfeing	eeb9153c9d	fix code check 2.0	5 years ago
huanghui	de843b45b6	add circle check in ub fusion	5 years ago
huanghui	fa6c23358a	Move some ir files which backend optpass dumped to the fold: verbose_ir_files	5 years ago
huanghui	b7519b7418	unify save_graphs_path	5 years ago
dayschan	37a48f6aac	GraphKernel supports GPU 1. Update akg submodule 2. Refactor akg_kernel_build, akg_ascend_kernel_build, akg_gpu_kernel_build 3. Add akg_kernel_json_decoder to support converting kernel_json to AnfNode. 4. Add GraphKernel Cost Model. (mindspore/_extends/graph_kernel) 5. Add some GraphKernel passes to GpuSession, move these passes to backend/optimizer/graph_kernel. 6. Add global id for ir files. 7. Fix bug in ConstInputToAttr.	5 years ago
fary86	fcbb3e0edc	Refactor ms_context implementation	5 years ago
zhoufeng	663278112f	optimize code compile performance Signed-off-by: zhoufeng <zhoufeng54@huawei.com>	5 years ago
liubuyu	d81862a916	decoupling core and context	5 years ago
liubuyu	43c79eb853	mindspore path adjust	5 years ago

12 Commits (052d8e2c9940aa8a7b147b881958642bfcc33fe7)