vddong
91d3a7543f
[feat][assistant][I3T96F] add new data operator SpeechCommands
4 years ago
i-robot
4252b24335
!26792 malloc ts memory for label
Merge pull request !26792 from zhoufeng/change-label-memory-type
4 years ago
zengzitao
62458b5636
adapt graph kernel for cpu
4 years ago
i-robot
04132e0c50
!26766 fix error log and move some function to inner
Merge pull request !26766 from lianliguang/master
4 years ago
lianliguang
5864d6a006
modify some eroor log
4 years ago
zhoufeng
881179fa10
malloc ts memory for label
Signed-off-by: zhoufeng <zhoufeng54@huawei.com>
4 years ago
i-robot
3520f1427d
!26701 MD C++ UT: Remove old tests part 3
Merge pull request !26701 from hetshah/hs_remove_old_tests
4 years ago
hetshah
098ca49dd3
remove instances of datasetiterator from old tests and move coverage to new method
4 years ago
i-robot
3d0f9d8aae
!26683 Enable compile cache feature to load hyper parameter data from python
Merge pull request !26683 from LiangZhibo/mindir
4 years ago
hesham
cf3ff77cfd
AutoTune support non-sink mode
Add time sts to iterator tracing
4 years ago
i-robot
d251800c62
!26705 Add the check of function return None.
Merge pull request !26705 from Margaret_wangrui/return_none
4 years ago
i-robot
cfc6ea32ff
!24714 replace rtmemcpyxx to acl memcpy
Merge pull request !24714 from jjfeing/br_replace_rtmemcpyxx_with_acl_api
4 years ago
Margaret_wangrui
f9a384456a
Add the check of function return None.
4 years ago
l00591931
21df240f23
Enable mindir to load initialize weight from python
4 years ago
He Wei
41dcac9c49
Replace std::unordered_map/set with robin-hood-hashing
Robin-hood-hashing (https://github.com/martinus/robin-hood-hashing )
is considered faster then std::unordered_map/set,
so we use it to improve mindspore performance.
1. robin_hood head file in `third_party/robin_hood/include`;
2. In `utils/hash_map.h` and `utils/hash_set.h`, we define:
- mindspore::HashMap as an alias of robin_hood::unordered_map;
- mindspore::HashSet as an alias of robin_hood::unordered_set;
3. Replace:
- `#include <unordered_map>` --> `#include "utils/hash_map.h"`;
- `#include <unordered_set>` --> `#include "utils/hash_set.h"`;
- `std::unordered_map` --> `mindspore::HashMap`;
- `std::unordered_set` --> `mindspore::HashSet`;
- `map.insert(std::pair(key, value))` --> `map.emplace(key, value)`;
- `[] (const std::pair<K, V> &p) {..} ` --> `[] (const auto &p) {..} `;
4. Fix issues found by switch to robin_hood:
- AnfNodeConfig hash and equal;
- Fix a bug in `Slice::operator==()`;
- Fix a bug in `CNode::HasPrimalAttr()`;
- Fix map.erase() usage bugs: `map.erase(iter++)` --> `iter = map.erase(iter)`;
- Fix some iterator invalidated problem;
5. Some std::unordered_map/set can not replace by robin_hood:
- As parameter of functions that exposed to python by pybind11;
- Use bad hash that cause robin_hood::map over_flow, such as AbstractBasePtrListHasher;
6. Update cpp unit tests;
7. Add build option '-F' to enable robin_hood, default on.
4 years ago
i-robot
310bbcebe1
!26539 [GraphKernel] support lite_adapter for graph kernel.
Merge pull request !26539 from chenlei_autodiff/lite_callback
4 years ago
jjfeing
05485d991c
replace api with acl api
4 years ago
chenlei_autodiff
4c0d5dcfe7
[GraphKernel] support lite_adapter for graph kernel.
4 years ago
i-robot
70363899e7
!24080 [assistant][ops] Add new audio operator ComputeDeltas
Merge pull request !24080 from Rainfor/compute_detlas
4 years ago
i-robot
c6821bde0a
!21648 [assistant][ops] Add new data loading operator YesNoDataset
Merge pull request !21648 from 杨旭华/YesNoDataset
4 years ago
vddong
f465cd0242
[feat][assistant][I3T96L]add new dataset operator YesNo
4 years ago
i-robot
55463892e4
!26402 add mem offload strategy
Merge pull request !26402 from kisnwang/add-mem-offload-strategy
4 years ago
i-robot
3c39afad11
!24547 [MS][crowdfunding]New operator implementation, AutoAugment
Merge pull request !24547 from yangwm/autoaugment
4 years ago
i-robot
ce00ee1ad1
!25367 use acl api to control profiling
Merge pull request !25367 from yanghaitao/yht_condation_start_profiler
4 years ago
zhuchuang
4c3931f5c9
[feat] [assistant] [I3CKEE] add new audio operator ComputeDeltas
4 years ago
i-robot
9d6248194e
!26310 MindSpore support load custom aicpu kernels.
Merge pull request !26310 from linqingke/aicpu
4 years ago
kswang
7c312bd38c
add mem offload strategy
4 years ago
despicablemme
09897ce7ec
[feat] [assistant] [I40GZP] add new data op AutoAugment
4 years ago
yanghaitao1
c94aa6b872
use profiler acl api instead
4 years ago
linqingke
bef2923acf
MindSpore support load custom aicpu ops.
4 years ago
wangkc123
706f9e2cbb
[feat] [assistant] [I3CKEH] add new audio operator SlidingWindowCmn
4 years ago
i-robot
58b69a05ee
!20576 [assistant][ops] add new dataset loading operator LJSpeechDataset
Merge pull request !20576 from 杨旭华/LJSpeechDataset
4 years ago
i-robot
6bdd38399a
!25811 fault_recover_by_mirror_group
Merge pull request !25811 from yao_yf/fault_recover_by_mirror_group
4 years ago
mohammad
5c8ab5f60c
add MD Profiler Save()
4 years ago
vddong
6fe2d35ae2
[feat][assistant][I3T96D] add new dataset loading operator LJSpeechDataset
4 years ago
yao_yf
501b978d16
find data parallel common group in auto parallel
4 years ago
i-robot
04da5c2808
!26140 Add callback support to BatchOp Datasets to enable Add/Remove workers
Merge pull request !26140 from h.farahat/cb_batch
4 years ago
i-robot
e154dca4b1
!22193 [assistant][ops] Add new loader operator DBpedia
Merge pull request !22193 from 杨旭华/DBpedia
4 years ago
i-robot
0f07408425
!26034 F.grad support sens_param and fix graph_mode bug
Merge pull request !26034 from zhang_sss/grad
4 years ago
i-robot
6c587dc2d3
!25391 Support to profiling parallel strategy
Merge pull request !25391 from ougongchang/profiling_stategy
4 years ago
Carry955
0d26c38693
[feat][assistant][I40GXT] add new loader DBpedia
4 years ago
i-robot
7a73bae5c3
!26036 add output strategy for matmul operator
Merge pull request !26036 from yangzhenzhang/add-output-strategy-for-op-init
4 years ago
hesham
2662b6d5c3
Add callback to Batch op
4 years ago
zhang__sss
0dd68aa7eb
grad
4 years ago
ougongchang
9229f1c1ff
profiler support to collect parallel strategy info
If SetNodeOutputType functions forcibly splits into multiple functions, the readability decreases, so it blocks lizard scans
4 years ago
i-robot
b910870ecc
!19986 [assistant][ops] Add new dataset loading operator AGNEWS
Merge pull request !19986 from 杨旭华/AGNEWS
4 years ago
zetongzhao
2de8142c44
profiler test
4 years ago
Megalomania
c34bbabed9
[feat] [assistant] [I3T96M] add new loader AGNEWS
4 years ago
yangzhenzhang
8431ba616c
add output strategy for op init
4 years ago
i-robot
8bf7e28fa6
!25410 add dump flag for fusion nodes
Merge pull request !25410 from yuchaojie/ir_fusion3
4 years ago