tanghuikang
b6035cf1a1
Ascend swap support communication op
4 years ago
tanghuikang
fd6f97bab4
Fix some MemScheduler bugs
4 years ago
jjfeing
c6520f878f
status record in vm
4 years ago
limingqi107
19330b6f82
unified runtime support graph sink output update
4 years ago
hwjiaorui
4f566c0dd2
move mindir
4 years ago
i-robot
18960015b0
!24229 add dynamic shape attr pass
Merge pull request !24229 from hwjiaorui/dynamic-shape-pass
4 years ago
hwjiaorui
eaa3dc57b7
add dynamic shape attr pass
4 years ago
zhaosida
85e00ba3ab
self code check
4 years ago
tanghuikang
65542fdb21
Set output tensor NeedWait after WaitLockedInputs to avoid deadlock
4 years ago
i-robot
e87f0be14f
!23707 fix summary isolation
Merge pull request !23707 from jiangshuqiang/temp
4 years ago
jiangshuqiang
c2bd061889
fix summary isolation
4 years ago
He Wei
bffa1e6a39
Optimize ordered_map/set performance
1. Add compact_set and use it for node_users;
2. Optimize OrderedMap for shared_ptr;
3. Optimize OrderedSet for shared_ptr;
4. Use better hash for pointer type;
5. Fix a bug about 'erase' in session_basic.cc.
4 years ago
i-robot
fb58cfa273
!23414 update pynative profiling codes and delete some useless code
Merge pull request !23414 from lvchangquan/master
4 years ago
i-robot
6f09891501
!22092 [executor] Add mem scheduler
Merge pull request !22092 from kisnwang/add-mem-extend-cache
4 years ago
lvchangquan
b908f40d94
delete some useless codes and update pynative profiling codes
4 years ago
i-robot
b6eb0d8b93
!23474 revert
Merge pull request !23474 from zhaosida/heter_update
4 years ago
zhaosida
b045668801
Revert "update graph output"
This reverts commit 6ee00680b6 .
4 years ago
kswang
a0dc465041
add mem scheduler
4 years ago
huanghui
ba66c0d491
add security isolate for save_graphs
4 years ago
i-robot
dc7296ba1e
!23134 fix graph output not update
Merge pull request !23134 from zhaosida/heter_update
4 years ago
hwjiaorui
759eb57ac5
error-log modify
4 years ago
zhaosida
6ee00680b6
update graph output
4 years ago
yelihua
7c3994e48e
use Common::CreatePrefixPath instead of Common::GetRealPath
4 years ago
buxue
575554b2de
modify the interface to get realpath
4 years ago
i-robot
516a74f985
!22915 310 support cond graph
Merge pull request !22915 from zhoufeng/310-support-cond-graph
4 years ago
i-robot
38e15f1409
!23022 Fix bug of dynamic shape with ms function
Merge pull request !23022 from JoyLvliang/fix_bug_of_dynamic_shape_with_ms_function
4 years ago
zhoufeng
5a09648667
310 support cond graph
Signed-off-by: zhoufeng <zhoufeng54@huawei.com>
4 years ago
7347157+joylvliang@user.noreply.gitee.com
8f08bc281c
fix_bug_of_dynamic_shape_with_ms_function
4 years ago
caifubi
d8da8a55de
Disable async build pynative_synchronize is on
4 years ago
i-robot
d397fa22ec
!22797 Fix wide&deep allreduce_fusion_index bug in PyNative mode
Merge pull request !22797 from caifubi/master-pynative-allreduce-fusion-index
4 years ago
caifubi
e4354ccc30
Fix Wide&deep AllReduce Fusion Index bug
4 years ago
i-robot
1927d0eb75
!22760 Fix RunOp Graph Cache bug
Merge pull request !22760 from caifubi/master-fix-runop-cache
4 years ago
caifubi
4e82bedb7e
Fix RunOp Graph Cache bug
4 years ago
zjun
b5dfe05dd0
Use cache for mix
77b4f906af
use cell input cast
83fb06b93c
time 115 ok
15288c220c
Add ostringstream
4713a6ca57
4 years ago
i-robot
0f9c1c0d98
!22079 fix bug that find used kernel of parameter
Merge pull request !22079 from hwjiaorui/create_output_tensor
4 years ago
gaoyong10
e7f6b034cf
Fix double output for single device address
4 years ago
hwjiaorui
7a1c90ce27
fix usedbyrealkernel bug
4 years ago
caifubi
537fce0ee1
PyNative Kernel Parallel Build
1. Create Tensor and DeviceAddress for output before Launch.
2. Push Launch/Build Task to Queue and execute togather.
4 years ago
lvchangquan
bab311f0c7
update pynative profiling codes in fp and bp
4 years ago
ms_yan
36a8886ca2
Revert "[feat] [assistant] [I3T96T] add new Dataset operator CMUARCTICDataset"
This reverts commit b077aa1cab .
Revert "[feat] [assistant] [I3T96X] add new Dataset operator LibriSpeechDataset"
This reverts commit 4e6f7dc97d .
delete pass_registry_test.cc
comment hiai_nlu_model_multi.pb related line
4 years ago
djc
b077aa1cab
[feat] [assistant] [I3T96T] add new Dataset operator CMUARCTICDataset
4 years ago
djc
4e6f7dc97d
[feat] [assistant] [I3T96X] add new Dataset operator LibriSpeechDataset
4 years ago
yelihua
72e6058265
get rank id when set hccl env for single card train
4 years ago
yelihua
a6dc9a0a07
get rank id when set hccl env for single card train
4 years ago
hwjiaorui
23b1fb7f3f
opt code and add childgraph manager
4 years ago
zhousiyi
94e5fe6242
keep tuple_getitem as much as possible to reduce the number of func graphs
shrink output of func_graph other than set unused to dead node
4 years ago
hwjiaorui
1fa30a665f
tensor device address setting
4 years ago
kswang
4a456ca954
clean code
4 years ago
i-robot
4c3c3ceee6
!21157 add AdamWeightDecay to optSet
Merge pull request !21157 from kisnwang/update-need-lock-tensors
4 years ago
i-robot
eb5b2c478e
!21230 fix python kernel execute lock
Merge pull request !21230 from chenweifeng/python-kernel-sync-execute
4 years ago