caifubi
537fce0ee1
PyNative Kernel Parallel Build
1. Create Tensor and DeviceAddress for output before Launch.
2. Push Launch/Build Task to Queue and execute togather.
4 years ago
ms_yan
36a8886ca2
Revert "[feat] [assistant] [I3T96T] add new Dataset operator CMUARCTICDataset"
This reverts commit b077aa1cab .
Revert "[feat] [assistant] [I3T96X] add new Dataset operator LibriSpeechDataset"
This reverts commit 4e6f7dc97d .
delete pass_registry_test.cc
comment hiai_nlu_model_multi.pb related line
4 years ago
djc
4e6f7dc97d
[feat] [assistant] [I3T96X] add new Dataset operator LibriSpeechDataset
4 years ago
zjun
35aab6144d
Fix pynative memory leak
Signed-off-by: zjun <zhangjun0@huawei.com>
4 years ago
wangjunbao
f9d99e97d2
fix ci warning for not handling function retrun of RDR
4 years ago
buxue
733f765063
add security compilation options
4 years ago
ZPaC
a9a0f590e6
Fix master static check
4 years ago
chujinjin
ac3670ef91
modify log level for pynative ascend
4 years ago
kswang
42155e7e5c
add updated parameter for graph
4 years ago
TinaMengtingZhang
2fa05b66a1
change device_id to rank_id in dump path
4 years ago
baihuawei
e594463960
fix cpu lstm and precision issues
4 years ago
TinaMengtingZhang
4926d74570
unify dir path
dump hccl and config json files to dir
update filename for sync dump except cpu dump
update testcases
4 years ago
mindspore-ci-bot
6f5c74eadb
!15765 Format Transform pass on CPU
From: @zuochuanyong
Reviewed-by: @zhaizhiqiang,@guoqi1024,@zhoufeng54
Signed-off-by: @zhaizhiqiang
5 years ago
ZPaC
12f95b51f4
Add server code part2
5 years ago
zuochuanyong
e7ea343738
add format transform pass on cpu
5 years ago
He Wei
121a6a28d9
[auto-monad] Enforce order of exection for Loads user nodes in frontend
5 years ago
mindspore-ci-bot
fa46d3e6a0
!15359 [MS][RDR] collect backend graph and graph execution order
From: @luopengting
Reviewed-by: @yelihua,@ouwenchang
Signed-off-by: @ouwenchang
5 years ago
luopengting
0cb9a2d218
enhance RDR support on CPU
1. collect backend graph and graph execution order
2. add trigger point when CPUKernelRuntime::Run() raise "Launch kernel failed"
5 years ago
tanghuikang
c88528990b
PyNative support dynamic shape on cpu
5 years ago
buxue
71866950cf
change log level from info to warning when use dump in PyNative mode
5 years ago
kswang
2a48b2ecb8
reconstruct session code
5 years ago
tanghuikang
c8a14ba016
Clean code
5 years ago
mindspore-ci-bot
33edd67261
!14667 add dump data function for CPU
From: @zhangbuxue
Reviewed-by:
Signed-off-by:
5 years ago
buxue
ab59f62147
add dump data function for CPU
5 years ago
kswang
97a97e02db
extract load input
5 years ago
simson
aa7b49fe32
clear memory after run op in CPU
5 years ago
baihuawei
a1c20e568a
add cpu micprecision
5 years ago
huanghui
a2ba47e18a
1. Add unique id for .dat and .dot file to avoid covering
2. Dump the end graph in gpu session and cu session
5 years ago
chendongsheng
db0a6f1e19
replace ps-lite
5 years ago
He Wei
7d9a783993
[auto-monad] Support side-effects by auto-monad
The basic idea is: exploits data dependency to control the execution order
of side-effect operations, and keep the semantics of ANF unchanged.
The ControlDepend primitive is removed and there are two primitives added:
1. UpdateState:
```
a = Assign(para, value)
```
became:
```
a = Assign(para, value, u)
u = UpdateState(u, a)
```
2. Load:
```
x = Add(para, value)
```
became:
```
p = Load(para, u)
x = Add(p, value)
u = UpdateState(u, p)
```
5 years ago
yitongh
2078b7156f
Add support for CPU memory reuse
5 years ago
lvliang
8984cc9c03
pynative-support-dynamic-op-run-in-gpu
5 years ago
mindspore-ci-bot
a554913704
!9548 fix cpu pynative mode mixed precision error
From: @chujinjin
Reviewed-by: @jjfeing,@kisnwang
Signed-off-by: @kisnwang
5 years ago
chujinjin
02c8052e89
fix cpu pynative mixed precision error
5 years ago
kswang
25f97679fd
print trace info when init/run kernel failed
5 years ago
chujinjin
701ab0d05f
add cpu pynative mode
5 years ago
mindspore-ci-bot
dabb82ec7a
!8846 host cpu support dynamic shape
From: @liubuyu
Reviewed-by: @jjfeing
Signed-off-by:
5 years ago
liubuyu
9f5ab8f76f
host cpu support dynamic shape
5 years ago
kswang
3aa875a19f
add trace info for cpu
5 years ago
lizhenyu
b8991ee149
add ps mode consistence check
5 years ago
kswang
11989b5e30
enable async run
5 years ago
ZPaC
0b49f0fb57
change PS dir
5 years ago
John Tzanakakis
b0a7ebdeb0
enable debugger by default and set correct log message severity
5 years ago
Zhang Qinghua
c0070d3d49
Use the unified Execute function to run Graph or Single Op Graph.
5 years ago
yujianfeng
9aa6d203f1
Fix output device address setting for inputs of depend node
5 years ago
kswang
756bb6d53f
async run graph
5 years ago
zhoufeng
663278112f
optimize code compile performance
Signed-off-by: zhoufeng <zhoufeng54@huawei.com>
5 years ago
liubuyu
d81862a916
decoupling core and context
5 years ago
ZPaC
4281f38075
Delete hard code in pull kernel.
5 years ago
z00505269
87668d6ea2
remove predict
5 years ago