i-robot
3f9fed78c4
!21860 PyNative kerenl parallel build in FIRST step
Merge pull request !21860 from caifubi/master-kernel-parallel-build-simple
4 years ago
i-robot
9c4fe0a085
!22422 log modify
Merge pull request !22422 from liubuyu/master
4 years ago
caifubi
537fce0ee1
PyNative Kernel Parallel Build
1. Create Tensor and DeviceAddress for output before Launch.
2. Push Launch/Build Task to Queue and execute togather.
4 years ago
i-robot
ebe75e403b
!22444 Apply comments on tensor stat online and offline PR
Merge pull request !22444 from parastooashtari/tensor_level_info_online
4 years ago
lby
e099f7fbd5
log modify
4 years ago
i-robot
c9a574dbf7
!22426 fix dump graph with full name with scope
Merge pull request !22426 from jjfeing/master
4 years ago
Parastoo Ashtari
bf034bddb5
Apply comments on tensor stat online and offline debugger
4 years ago
i-robot
01ade5857d
!22123 update pynative profiling codes in fp and bp
Merge pull request !22123 from lvchangquan/profiling_formal
4 years ago
jjfeing
d76664c1b7
dump error graph with full_name_with_scope
4 years ago
i-robot
438169e0b9
!22365 RDR adapts for CPU dynamic memory allocation
Merge pull request !22365 from liangyongxiong/fix
4 years ago
i-robot
8e39dd4ec7
!22173 ascend add nontask sink mode
Merge pull request !22173 from baihuawei/graph_mode_nonsink_part3-2
4 years ago
i-robot
23a5c64ce0
!22218 Add graph kernel userdefine op support
Merge pull request !22218 from zichun_ye/graph_kernel_userdefine
4 years ago
lvchangquan
bab311f0c7
update pynative profiling codes in fp and bp
4 years ago
liangyongxiong
9f6b015032
RDR adapts for CPU dynamic memory allocation
4 years ago
i-robot
49d84b3a87
!22247 error log rectification
Merge pull request !22247 from hwjiaorui/error-log
4 years ago
i-robot
8a5ef35d64
!22105 pynative mode bug fix
Merge pull request !22105 from liubuyu/master
4 years ago
baihuawei
a9694a9230
ascend add nontask sink mode
4 years ago
Zichun Ye
a7d89f6686
add graph kernel userdefine op support
fix code check
4 years ago
lby
a04da35956
fix case core dump in pynative mode
4 years ago
hwjiaorui
12b4940e8a
log rectification
4 years ago
i-robot
2e0ae45a67
!22322 clean code
Merge pull request !22322 from hwjiaorui/clean-code-master
4 years ago
i-robot
f88445dcf0
!22260 Avoid printing meaningless error message
Merge pull request !22260 from tanghuikang/tbe_em
4 years ago
hwjiaorui
2008a2a78c
clean code
4 years ago
i-robot
d2a42f131f
!21963 fix a profiling bug to improve performance
Merge pull request !21963 from lvchangquan/master
4 years ago
i-robot
6279b1a81f
!22111 dump task error data support ms_om_path
Merge pull request !22111 from jjfeing/master
4 years ago
tanghuikang
9d08583307
Avoid printing meaningless error message
4 years ago
i-robot
d4e62bd7df
!22165 add error string of curandStatus
Merge pull request !22165 from hanhuifeng/droupout_log
4 years ago
baihuawei
6eec288c39
opt ascend single op mode runtime code
4 years ago
hanhuifeng2020
f84b80525b
add error string for curandStatus
4 years ago
ms_yan
36a8886ca2
Revert "[feat] [assistant] [I3T96T] add new Dataset operator CMUARCTICDataset"
This reverts commit b077aa1cab .
Revert "[feat] [assistant] [I3T96X] add new Dataset operator LibriSpeechDataset"
This reverts commit 4e6f7dc97d .
delete pass_registry_test.cc
comment hiai_nlu_model_multi.pb related line
4 years ago
djc
b077aa1cab
[feat] [assistant] [I3T96T] add new Dataset operator CMUARCTICDataset
4 years ago
djc
4e6f7dc97d
[feat] [assistant] [I3T96X] add new Dataset operator LibriSpeechDataset
4 years ago
jjfeing
b4e67caf22
support ms_om_path env
4 years ago
i-robot
09a1a7f1f2
!21924 Add subdirectory for glog and ir_dump
Merge pull request !21924 from huanghui/add-submodule-for-dfx-files
4 years ago
i-robot
36f0b3c353
!22062 upgrade ascend package 19 Aug 21
Merge pull request !22062 from yanghaoran/upgrade_ascend_0819
4 years ago
yanghaitao1
8fc11cb676
adapt delete libms_profiler_fwk.a
4 years ago
i-robot
db1995562e
!21917 Opt kernel runtime performance in vm non_task sink mode.
Merge pull request !21917 from liangzelang/opt_runtime
4 years ago
huanghui
1630dcb0c8
add subdirectory for log and ir_dump
4 years ago
i-robot
5e718c5676
!21973 GetRankId failed in PyNative mode
Merge pull request !21973 from caifubi/master-hccl-get-rank-id
4 years ago
caifubi
dfe0e94466
Fix PyNative get_rank_id/get_rank_size
4 years ago
Parastoo Ashtari
6d93ab5e35
Fixed old runtime GPU dump issue for multigraph
4 years ago
lvchangquan
136bd94f77
fix a profiling bug to improve performance
4 years ago
liangzelang
3e4ffd2025
opt runtime
4 years ago
chujinjin
34096bf879
add sync control for pynative
4 years ago
i-robot
16d5427743
!21863 update the method for get rank_id
Merge pull request !21863 from yelihua/new-dev
4 years ago
i-robot
8b4ef20958
!21733 fix DestroyHccl must be called before FreeDeviceMemory
Merge pull request !21733 from jjfeing/master
4 years ago
yelihua
a6dc9a0a07
get rank id when set hccl env for single card train
4 years ago
i-robot
07906235d0
!21759 unified runtime fix the bug of the old and new runtime coexistence of dynamic shape
Merge pull request !21759 from limingqi107/bug_fix
4 years ago
i-robot
6afcd815d2
!21362 add pynative profiling codes based on ascend and gpu
Merge pull request !21362 from lvchangquan/profiling_refactor
4 years ago
limingqi107
2824d80592
unified runtime fix the bug of the old and new runtime coexistence of dynamic shape
4 years ago