mindspore2022

History

mindspore-ci-bot 84607e3a51 !15890 fix an allreduce calculate bug in pynative mode From: @lvchangquan Reviewed-by: Signed-off-by:		5 years ago
..
distribution	GPU supports p2p nccl interfaces	5 years ago

mpi	code warning clean	5 years ago

blocking_queue.cc	add push opt logic	5 years ago

blocking_queue.h	add input data type check for ps cache mode	5 years ago

cuda_common.h	add more dtypes support for gatherdgrad and other bugfix	5 years ago

cuda_driver.cc	fix an allreduce bug with two streams sync problem	5 years ago

cuda_driver.h	fix an allreduce bug with two streams sync problem	5 years ago

cuda_env_checker.cc	search nvcc in entire PATH	5 years ago

cuda_env_checker.h	search nvcc in entire PATH	5 years ago

gpu_bucket.cc	add op atomic clean to clear input addr in launch allreduce	5 years ago

gpu_bucket.h	add op atomic clean to clear input addr in launch allreduce	5 years ago

gpu_buffer_mgr.cc	add error log when set device id failed	5 years ago

gpu_buffer_mgr.h	add push opt logic	5 years ago

gpu_common.h	addtensor size limitation to 2G	5 years ago

gpu_device_address.cc	use host_type vs deprecated type_id	5 years ago

gpu_device_address.h	fix device memory leak	5 years ago

gpu_device_manager.cc	fix an allreduce bug with two streams sync problem	5 years ago

gpu_device_manager.h	fix an allreduce bug with two streams sync problem	5 years ago

gpu_event.cc	PyNative AllReduce Bucket	5 years ago

gpu_event.h	PyNative AllReduce Bucket	5 years ago

gpu_kernel_build.cc	move the akg kernel build timer into AkgKernelBuilder::AkgKernelParallelBuild, so that it can time the Ascend kernel builder	5 years ago

gpu_kernel_build.h	add hardware abstract layer	5 years ago

gpu_kernel_runtime.cc	fix an allreduce bug with two streams sync problem	5 years ago

gpu_kernel_runtime.h	fix an allreduce bug with two streams sync problem	5 years ago

gpu_launch_kernel.cc	add hardware abstract layer	5 years ago

gpu_launch_kernel.h	add op atomic clean to clear input addr in launch allreduce	5 years ago

gpu_launch_mul.cc	add op_mul fusion based on allreduce fusion in pynative mode	5 years ago

gpu_launch_mul.h	add op atomic clean to clear input addr in launch allreduce	5 years ago

gpu_memory_allocator.cc	Refactor ms_context implementation	5 years ago

gpu_memory_allocator.h	Unified code style	6 years ago

gpu_memory_copy_manager.cc	fix an allreduce bug with two streams sync problem	5 years ago

gpu_memory_copy_manager.h	fix an allreduce bug with two streams sync problem	5 years ago

gpu_memory_manager.cc	add the continue memory alloc of communication kernel for actor runtime	5 years ago

gpu_memory_manager.h	profiler memory	5 years ago

gpu_stream_assign.cc	fix an allreduce bug with two streams sync problem	5 years ago

gpu_stream_assign.h	fix_consecutive_allreduce_bug	5 years ago

kernel_info_setter.cc	IR operators of GPU and CPU are unified as batchnorm	5 years ago

kernel_info_setter.h	fix graph output address set in the one time memory application scenarios	5 years ago

queue_common.h	add trace for gpu error/excpt log	5 years ago

readme.md	mindspore path adjust	6 years ago

trt_loader.cc	tensor-rt library dynamic loadg	5 years ago

trt_loader.h	gpu inference mixed precision	5 years ago

readme.md

gpu

阿对对队

C++ Python Text C Unity3D Asset other

314202276@qq.com 5518576+mindspore_ci@user.noreply.gitee.com tommylike@qq.com zhaozhenlong1@huawei.com zhoufeng54@huawei.com sunsuodong@huawei.com wangkaisheng2@huawei.com yangruoqi@huawei.com shiliang10@huawei.com xiefangqi2@huawei.com caifubi1@huawei.com lingqiaomin.huawei.com chenweifeng720@huawei.com fuzhiye@huawei.com liubuyu1@huawei.com changzherui1@huawei.com huanghui44@huawei.com guozhijian@huawei.com yaoyifan1@huawei.com zhaoting23@huawei.com liuxiao93@huawei.com peixu.ren1@huawei.com xuanyue@huawei.com lizhenyu13@huawei.com yuchaojie1@huawei.com

readme.md

Contributors (25+) All

Contributors (25+)
All