LaiYongqiang
77aa07b0f4
fix data dump & refnode bug and disable MindRT in PS mode
4 years ago
i-robot
c3339edf63
!27297 Bug Fix of Ascend MindRT
Merge pull request !27297 from hwjiaorui/mindrt-bug-fix
4 years ago
i-robot
258809d98f
!27067 Copy ref_param back
Merge pull request !27067 from liangzelang/dev_mindrt
4 years ago
i-robot
0380df75e7
!27299 Fix run condition for branch id.
Merge pull request !27299 from gaoyong10/runtime_second12
4 years ago
hwjiaorui
9f838e4684
mindrt ascend
4 years ago
gaoyong10
a081772b5a
Fix run condition for branch id.
4 years ago
i-robot
62d455e818
!27264 Bug Fix of Ascend MindRT
Merge pull request !27264 from hwjiaorui/mindrt-bug-fix
4 years ago
i-robot
10797a0df7
!27240 Weight params in subgraphs should be updated.
Merge pull request !27240 from liangzelang/fix_weight_input_in_subgraph_bug
4 years ago
i-robot
76f1ca3d9f
!26595 PyNative Ascend MindRT
Merge pull request !26595 from caifubi/master-pynative-mindrt-lazy-build-with-ascend
4 years ago
i-robot
04895d54a9
!27239 Add stack actor for return actor
Merge pull request !27239 from gaoyong10/runtime_second12
4 years ago
hwjiaorui
d408691320
bug fix of ascend mindrt
4 years ago
gaoyong10
fc4b8a3caf
Add stack actor for return actor
4 years ago
caifubi
ba5516335e
PyNative Ascend MindRT
4 years ago
liangzelang
e1244589c5
weight params in subgraphs should be updated.
4 years ago
i-robot
2ae6aec58e
!27209 Fix issue ReadTensor returns empty on cloud when file exists
Merge pull request !27209 from parastooashtari/offline_dbg_bug
4 years ago
i-robot
8bc3b734fa
!27092 Fix call node output and alloc device mem for Int64 value.
Merge pull request !27092 from gaoyong10/runtime_second12
4 years ago
Parastoo Ashtari
c9ed3651d8
use S_ISREG instead of DT_REG for files
4 years ago
gaoyong10
38c4375711
Fix call node output and alloc device mem for Int64 value.
4 years ago
i-robot
5a5a97f78d
!27084 Fix GPU mindrt sink True core dumped and dump multi graph error issue
Merge pull request !27084 from parastooashtari/gpu_dump_sink_issue
4 years ago
liangzelang
db778fe42d
Copy ref-param back
4 years ago
Parastoo Ashtari
9c036d1229
Fix GPU sink_mode True and Dump multi graph ERROR issue
4 years ago
i-robot
00c8e9a964
!26742 统一运行时场景支持输入数据shape动态时shape推导
Merge pull request !26742 from chengbin/master
4 years ago
limingqi107
57824d2879
unified runtime control flow add log
4 years ago
i-robot
acd04621fc
!27012 Fetch total front node in kernel graph.
Merge pull request !27012 from gaoyong10/runtime_second14
4 years ago
i-robot
c0d4e5cf0f
!27034 unified runtime control flow optimize the error info
Merge pull request !27034 from limingqi107/bug_fix4
4 years ago
i-robot
8e496e44f0
!22320 support heterogeneous for pynative mode
Merge pull request !22320 from chujinjin/support_heterogeneous_for_pynative
4 years ago
gaoyong10
ca677c0f16
Fetch total front node in kernel graph.
4 years ago
ckey_Dou
ab248e923c
On Ascend the update of dynamic input for graph is Done in
'LoadInputData', which is replaced by DataPrepareActor::PrepareData on
GPU(MindRT). This PR do the update of dynamic input in PrepareData and will
work for all platforms when they switch to MindRT.
4 years ago
limingqi107
7a93e2ca21
unified runtime control flow optimize the error info
4 years ago
chujinjin
d20c9d93d8
support pynative heterogeneous
4 years ago
limingqi107
312b26080b
unified runtime fix execution timeout and no data source actor of control flow
4 years ago
i-robot
db5ef1136f
!26981 Control flow support function type of formal parameter.
Merge pull request !26981 from gaoyong10/runtime_second12
4 years ago
gaoyong10
080aad5e1d
control flow support function type of formal parameter.
4 years ago
xulei
78b37afc30
fix mindspore mac compile
fix debugger, mindarmour, audio, gnn, grpc, ps/fl.
4 years ago
gaoyong10
a7611d0063
control flow support call to call
4 years ago
He Wei
41dcac9c49
Replace std::unordered_map/set with robin-hood-hashing
Robin-hood-hashing (https://github.com/martinus/robin-hood-hashing )
is considered faster then std::unordered_map/set,
so we use it to improve mindspore performance.
1. robin_hood head file in `third_party/robin_hood/include`;
2. In `utils/hash_map.h` and `utils/hash_set.h`, we define:
- mindspore::HashMap as an alias of robin_hood::unordered_map;
- mindspore::HashSet as an alias of robin_hood::unordered_set;
3. Replace:
- `#include <unordered_map>` --> `#include "utils/hash_map.h"`;
- `#include <unordered_set>` --> `#include "utils/hash_set.h"`;
- `std::unordered_map` --> `mindspore::HashMap`;
- `std::unordered_set` --> `mindspore::HashSet`;
- `map.insert(std::pair(key, value))` --> `map.emplace(key, value)`;
- `[] (const std::pair<K, V> &p) {..} ` --> `[] (const auto &p) {..} `;
4. Fix issues found by switch to robin_hood:
- AnfNodeConfig hash and equal;
- Fix a bug in `Slice::operator==()`;
- Fix a bug in `CNode::HasPrimalAttr()`;
- Fix map.erase() usage bugs: `map.erase(iter++)` --> `iter = map.erase(iter)`;
- Fix some iterator invalidated problem;
5. Some std::unordered_map/set can not replace by robin_hood:
- As parameter of functions that exposed to python by pybind11;
- Use bad hash that cause robin_hood::map over_flow, such as AbstractBasePtrListHasher;
6. Update cpp unit tests;
7. Add build option '-F' to enable robin_hood, default on.
4 years ago
limingqi107
e17a533369
fix the bug of graph and pynative shared the weight
4 years ago
limingqi107
a7f761c215
unified runtime support the control flow of subgraph sink
4 years ago
i-robot
3269c9b881
!26335 Support MindSpore on MacOS
Merge pull request !26335 from xulei/ms_mac_compile_br
4 years ago
i-robot
8e41c6813f
!26401 fix the weight output of exit actor.
Merge pull request !26401 from gaoyong10/runtime_second8
4 years ago
i-robot
21ffa1fb7b
!25091 Partial support for multi root graph in online debugger
Merge pull request !25091 from parastooashtari/online_multi_root_graph
4 years ago
xiefangqi
24ac2382ff
mac compile
4 years ago
gaoyong10
c64f62c309
fix the weight output of exit actor.
4 years ago
i-robot
9e6009dee2
!26355 modify kernel thread num equal to 23
Merge pull request !26355 from 范吉斌/fix_face
4 years ago
Parastoo Ashtari
7f682ba2f6
partial support for multi root graph in online debugger
4 years ago
limingqi107
d8b1b69eab
unified runtime optimize control flow
4 years ago
fan-jibin
0c941ac1eb
fix cpu thread num
4 years ago
gaoyong10
add5551ecf
fix code for testcase
4 years ago
i-robot
390b3c2efa
!26212 Control flow support untail call.
Merge pull request !26212 from gaoyong10/runtime_second8
4 years ago
gaoyong10
f7e3eef63d
control flow support untail call
4 years ago