chujinjin
d20c9d93d8
support pynative heterogeneous
4 years ago
parastooashtari
1a59dc37bf
add graph execution order history to dump
4 years ago
ZPaC
4c1ef4cef6
Fix ps cache broadcast error.
4 years ago
maning202007
8ee33007b1
add const to the data_ptr definition
Fix pclint for debugger
fix bugs
fix duplicated code issue
Fix the depth of method over 4 issue
fix comments
fix CI errors
remove redundancy arguments
4 years ago
i-robot
d31932febb
!24882 Fix ps testcase.
Merge pull request !24882 from gaoyong10/fix_bug
4 years ago
i-robot
caae6523a3
!24861 codedex fix
Merge pull request !24861 from limingqi107/new_actor_runtime
4 years ago
i-robot
093aa3b7a0
!24867 Fix AllGather Recompute Cycled Graph for GPU
Merge pull request !24867 from Cononlly/master
4 years ago
cononlly
1b119ced2b
Add a pass for allgather_recompute
4 years ago
gaoyong10
75260c5905
Fix ps error.
4 years ago
limingqi107
2985c6bccb
codedex fix
4 years ago
i-robot
ac9f5c5ede
!23992 code clean
Merge pull request !23992 from chenweifeng/code-clean-1
4 years ago
dayschan
7502345c8f
Add GraphKernelFlags into namespace mindspore::graphkernel
4 years ago
gaoyong10
a2bd6b0f66
Fix error when parameter is used as output.
4 years ago
i-robot
18960015b0
!24229 add dynamic shape attr pass
Merge pull request !24229 from hwjiaorui/dynamic-shape-pass
4 years ago
i-robot
0eccf93423
!23757 Add Support for AlltoAll in GPU
Merge pull request !23757 from Cononlly/master
4 years ago
hwjiaorui
eaa3dc57b7
add dynamic shape attr pass
4 years ago
cononlly
a8088525d5
Add AlltoAll GPU operator and IR pass and tests
4 years ago
wilfChen
2e65c4019f
replace #if to #ifdef
4 years ago
dayschan
73ecb0c4ab
Add namespace mindspore::graphkernel
* change the graphkernel's passes code(backend/optimizer/graph_kernel/*) to the
new namespace `mindspore::graphkernel`, to decouple from `mindspore::opt`.
* change the original `mindspore::opt::graphkernel` to `mindspore::graphkernel::inner` (graph_kernel/model)
* change the original `mindspore::opt::expanders` to `mindspore::graphkernel::expanders` (graph_kernel/expanders)
TODO: modify graph_kernel_flags, kernel_compiler/akg/
4 years ago
i-robot
e87f0be14f
!23707 fix summary isolation
Merge pull request !23707 from jiangshuqiang/temp
4 years ago
jiangshuqiang
c2bd061889
fix summary isolation
4 years ago
caifubi
0683b7fd75
clean codex
4 years ago
huanghui
ba66c0d491
add security isolate for save_graphs
4 years ago
sabrinasun
220245f592
add security isolation to online and offline debugger
4 years ago
liangzelang
1832d7c152
Use rtMemcpy trans data in Ascend instead of device -> host -> device
4 years ago
ZPaC
8f0a0682b8
Fix static check
4 years ago
gaoyong10
e7f6b034cf
Fix double output for single device address
4 years ago
i-robot
3f9fed78c4
!21860 PyNative kerenl parallel build in FIRST step
Merge pull request !21860 from caifubi/master-kernel-parallel-build-simple
4 years ago
caifubi
537fce0ee1
PyNative Kernel Parallel Build
1. Create Tensor and DeviceAddress for output before Launch.
2. Push Launch/Build Task to Queue and execute togather.
4 years ago
Parastoo Ashtari
bf034bddb5
Apply comments on tensor stat online and offline debugger
4 years ago
zuochuanyong
8fa68ebd98
fix Conv3D precision under fp16
4 years ago
ms_yan
36a8886ca2
Revert "[feat] [assistant] [I3T96T] add new Dataset operator CMUARCTICDataset"
This reverts commit b077aa1cab .
Revert "[feat] [assistant] [I3T96X] add new Dataset operator LibriSpeechDataset"
This reverts commit 4e6f7dc97d .
delete pass_registry_test.cc
comment hiai_nlu_model_multi.pb related line
4 years ago
djc
b077aa1cab
[feat] [assistant] [I3T96T] add new Dataset operator CMUARCTICDataset
4 years ago
djc
4e6f7dc97d
[feat] [assistant] [I3T96X] add new Dataset operator LibriSpeechDataset
4 years ago
zjun
35aab6144d
Fix pynative memory leak
Signed-off-by: zjun <zhangjun0@huawei.com>
4 years ago
yelihua
72e6058265
get rank id when set hccl env for single card train
4 years ago
limingqi107
5766234426
code review of gpu backend
4 years ago
wangjunbao
f9d99e97d2
fix ci warning for not handling function retrun of RDR
4 years ago
Margaret_wangrui
f9a064e464
Add ref user to UpdateState to ensure the order
4 years ago
limingqi107
1958b436b7
disable mindRT in control flow
4 years ago
maning202007
2b3d215ef8
Fix the forever loop for multigraph on gpu
4 years ago
kswang
bfab67a206
optimize node get target
4 years ago
chendongsheng
ecc8e379e8
fixed log error
4 years ago
kswang
3247c00555
optimize heter memcpy
5 years ago
chendongsheng
7d0d8f2a92
fixed ps data_parallel case result is error
4 years ago
i-robot
1cdaa12cfd
!18093 [Debugger] Add root graph id to sub graph's pb file
Merge pull request !18093 from TinaMengtingZhang/add_graph_id
4 years ago
i-robot
4861711676
!18107 dump and offline debug fixes
Merge pull request !18107 from john_tzanakakis/jt_bug_fixes
4 years ago
i-robot
eaac4f47b3
!18058 Update graph input shape
Merge pull request !18058 from chenweifeng/graph-dynamic
4 years ago
John Tzanakakis
ac1847ffac
fix iter 0 and iter 1 being dumped in dir 0, make op_debug_mode optional for sync mode, read input files for offline debugger
4 years ago
TinaMengtingZhang
dd6884eb6f
add root graph id to pb file
4 years ago