i-robot
58fc9a6d81
!26985 fix the negative axis problem of reduce_eliminate
Merge pull request !26985 from huangbingjian/reduce_eliminate
4 years ago
i-robot
4b089a9661
!19466 Fix some scope error
Merge pull request !19466 from huanghui/fix-clone-same-scope
4 years ago
huangbingjian
66309632c1
fix the negative axis problem of reduce_eliminate
4 years ago
i-robot
53804eec0a
!26776 recompute state slice optimize
Merge pull request !26776 from yao_yf/add_transformer_slice_activation_config_opt
4 years ago
i-robot
e195131342
!26891 move key world 'trace' to interface DumpSourceLines
Merge pull request !26891 from liubuyu/SBB
4 years ago
lby
618d4b7dfe
move key 'trace' to interface DumpSourceLines
4 years ago
yao_yf
59934198aa
recompute output slice optimize
4 years ago
lianliguang
0dc5d7f4f9
modify some eroor log and remove some useless code
4 years ago
huanghui
c4d61629b5
fix wrong scope and debug info
4 years ago
i-robot
4a82477626
!26794 transformer_slice_activation_config_fix
Merge pull request !26794 from yao_yf/add_transformer_slice_activation_config_fix
4 years ago
yao_yf
facb0995f4
slice recompute state fix
4 years ago
i-robot
ed79ba11af
!26475 Support fused node corresponding code print
Merge pull request !26475 from huanghui/fused-debug-infos
4 years ago
i-robot
7dcdc7d0fb
!26400 [Fallback] add fallback testcases
Merge pull request !26400 from huangbingjian/eval_numpy
4 years ago
huanghui
767caad833
Support fused node corresponding code print
4 years ago
i-robot
3d0f9d8aae
!26683 Enable compile cache feature to load hyper parameter data from python
Merge pull request !26683 from LiangZhibo/mindir
4 years ago
huangbingjian
00f3fa99e2
add fallback testcases
4 years ago
i-robot
b472850a75
!26594 Replace std::unordered_map/set with robin-hood-hashing
Merge pull request !26594 from hewei/use_robin_hood
4 years ago
i-robot
519f14a909
!26006 slice recompute activation
Merge pull request !26006 from yao_yf/add_transformer_slice_activation_config
4 years ago
l00591931
21df240f23
Enable mindir to load initialize weight from python
4 years ago
He Wei
41dcac9c49
Replace std::unordered_map/set with robin-hood-hashing
Robin-hood-hashing (https://github.com/martinus/robin-hood-hashing )
is considered faster then std::unordered_map/set,
so we use it to improve mindspore performance.
1. robin_hood head file in `third_party/robin_hood/include`;
2. In `utils/hash_map.h` and `utils/hash_set.h`, we define:
- mindspore::HashMap as an alias of robin_hood::unordered_map;
- mindspore::HashSet as an alias of robin_hood::unordered_set;
3. Replace:
- `#include <unordered_map>` --> `#include "utils/hash_map.h"`;
- `#include <unordered_set>` --> `#include "utils/hash_set.h"`;
- `std::unordered_map` --> `mindspore::HashMap`;
- `std::unordered_set` --> `mindspore::HashSet`;
- `map.insert(std::pair(key, value))` --> `map.emplace(key, value)`;
- `[] (const std::pair<K, V> &p) {..} ` --> `[] (const auto &p) {..} `;
4. Fix issues found by switch to robin_hood:
- AnfNodeConfig hash and equal;
- Fix a bug in `Slice::operator==()`;
- Fix a bug in `CNode::HasPrimalAttr()`;
- Fix map.erase() usage bugs: `map.erase(iter++)` --> `iter = map.erase(iter)`;
- Fix some iterator invalidated problem;
5. Some std::unordered_map/set can not replace by robin_hood:
- As parameter of functions that exposed to python by pybind11;
- Use bad hash that cause robin_hood::map over_flow, such as AbstractBasePtrListHasher;
6. Update cpp unit tests;
7. Add build option '-F' to enable robin_hood, default on.
4 years ago
zhousiyi
3d8435ad45
Replace shortcircuit eval with deferred GetResult from AsyncAbstract when use
4 years ago
i-robot
e6ad3f42b1
!26524 clean code, fix cyclomatic complexity problems
Merge pull request !26524 from huangbingjian/clean_code_master
4 years ago
i-robot
3269c9b881
!26335 Support MindSpore on MacOS
Merge pull request !26335 from xulei/ms_mac_compile_br
4 years ago
yao_yf
188d39da83
slice_activation_in_recompute
slice recompute activation
4 years ago
huangbingjian
e623173965
clean code
4 years ago
i-robot
32fe511d76
!26198 After the env_getitem nodes are eliminated, eliminate the remaining env_setitem with updatestate.
Merge pull request !26198 from Margaret_wangrui/env_updatestate
4 years ago
xiefangqi
24ac2382ff
mac compile
4 years ago
i-robot
d36bdb54cd
!26330 Enable pynative ad to support function return none
Merge pull request !26330 from LiangZhibo/ad
4 years ago
Margaret_wangrui
d4e13790c9
After the env_getitem nodes are eliminated, eliminate the remaining env_setitem with updatestate
4 years ago
l00591931
cb43b38ccd
Enable pynative ad to return none in function output
4 years ago
huanghui
35cb09a536
compiler support dump flag
4 years ago
i-robot
9916e5844d
!26170 Fix cyclomatic complexity
Merge pull request !26170 from hewei/fix_ccn
4 years ago
i-robot
0f07408425
!26034 F.grad support sens_param and fix graph_mode bug
Merge pull request !26034 from zhang_sss/grad
4 years ago
He Wei
c7910cbd37
Fix cyclomatic complexity
4 years ago
zhang__sss
0dd68aa7eb
grad
4 years ago
l00591931
9e3da312e7
refactor load mindir
4 years ago
zhang__sss
8bae4d9d50
grad
4 years ago
He Wei
fd8bbb4d96
Optimize isa() and type id related code.
1. Let 'isa<T>()' calls 'IsSameTypeId()' when T is final,
and we add 'final' mark to some final classes derived from Base;
2. Generate type id at compile time using constexpr hash function,
remove TypeIdManager and related code;
3. Add a tool script 'scripts/check_tid.sh' to check tid uniqueness;
4. Improve type search in 'proto_exporter.cc' and 'convert_utils_py.cc';
5. Add some missed MS_DECLARE_PARENT;
6. Ensure calls of 'parent_t::IsFromTypeId()' are all inlined.
4 years ago
i-robot
6f413ee29d
!25155 Support ops, nn and numpy namespaces, and add test case.
Merge pull request !25155 from 张清华/opt_fallback
4 years ago
Zhang Qinghua
6a79e262ae
Support ops, nn and numpy namespaces, and add test case.
4 years ago
i-robot
6fa152ef1a
!25150 Add Send and Receive to recompute black list
Merge pull request !25150 from YuJianfeng/master
4 years ago
i-robot
8946a953c6
!25097 If the umonad only used by Load and one other updatestate, no need to replace
Merge pull request !25097 from Margaret_wangrui/consecutive_updatestate_eliminate_2
4 years ago
i-robot
ae87f31dea
!25088 Optimize the comment and log descriptions for the python source code
Merge pull request !25088 from zhunaipan/master
4 years ago
zhunaipan
8ce4e62725
optimize the comment and log description
修改: ops/operations/_inner_ops.py
修改: ops/operations/_quant_ops.py
修改: ops/operations/array_ops.py
修改: ops/operations/comm_ops.py
修改: ops/operations/math_ops.py
修改: ops/operations/quantum_ops.py
修改: ops/operations/rl_ops.py
修改: ops/operations/sponge_ops.py
修改: ops/operations/sponge_update_ops.py
修改: train/__init__.py
修改: common/tensor.py
修改: train/serialization.py
修改: ccsrc/pipeline/jit/parse/parse.h
修改: explainer/benchmark/_attribution/metric.py
修改: ops/composite/multitype_ops/_constexpr_utils.py
修改: ops/operations/comm_ops.py
修改: RELEASE.md
修改: mindspore/_extends/parse/standard_method.py
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/concat_offset_cpu_kernel.cc
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/dynamic_shape_cpu_kernel.cc
修改: mindspore/ccsrc/frontend/parallel/ops_info/reshape_info.cc
修改: mindspore/ccsrc/frontend/parallel/ops_info/tile_info.cc
修改: mindspore/ccsrc/frontend/parallel/ops_info/transpose_info.cc
修改: mindspore/ccsrc/frontend/parallel/strategy.h
修改: mindspore/common/tensor.py
修改: mindspore/core/abstract/prim_arrays.cc
修改: mindspore/core/abstract/prim_nn.cc
修改: mindspore/core/ops/conv2d.cc
修改: mindspore/core/ops/logical_and.h
修改: mindspore/core/ops/logical_not.h
修改: mindspore/core/ops/logical_or.h
修改: mindspore/core/ops/reduce_all.h
修改: mindspore/core/ops/reduce_any.h
修改: mindspore/lite/src/runtime/kernel/arm/fp32_grad/sgd.cc
修改: mindspore/nn/layer/quant.py
修改: mindspore/nn/optim/sgd.py
修改: mindspore/nn/sparse/sparse.py
修改: mindspore/numpy/array_creations.py
修改: mindspore/numpy/array_ops.py
修改: mindspore/numpy/logic_ops.py
修改: mindspore/numpy/math_ops.py
修改: mindspore/ops/operations/_inner_ops.py
修改: mindspore/ops/operations/array_ops.py
修改: mindspore/ops/operations/rl_ops.py
修改: mindspore/train/_utils.py
修改: tests/ut/python/model/test_lenet_core_after_exception.py
修改: mindspore/_extends/parse/standard_method.py
修改: mindspore/ops/operations/rl_ops.py
修改: mindspore/core/abstract/prim_nn.cc
修改: mindspore/core/ops/conv2d.cc
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/ctcloss_cpu_kernel.cc
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/fl/fused_pull_weight_kernel.h
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/fl/fused_push_weight_kernel.h
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/mkldnn/conv2d_grad_filter_cpu_kernel.cc
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/mkldnn/conv2d_grad_input_cpu_kernel.cc
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/ps/sparse_apply_ftrl_ps_kernel.cc
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/ps/sparse_apply_lazy_adam_ps_kernel.cc
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/rolling_cpu_kernel.cc
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/scatter_arithmetic_cpu_kernel.cc
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/split_cpu_kernel.cc
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/update_cache_cpu_kernel.cc
修改: mindspore/ccsrc/backend/kernel_compiler/gpu/arrays/split_gpu_kernel.h
修改: mindspore/ccsrc/backend/kernel_compiler/gpu/math/broadcast_gpu_kernel.h
修改: mindspore/ccsrc/backend/kernel_compiler/gpu/nn/conv2d_grad_input_gpu_kernel.h
修改: mindspore/ccsrc/fl/server/server.cc
修改: mindspore/ccsrc/frontend/optimizer/ad/kpynative.cc
修改: mindspore/ccsrc/frontend/optimizer/irpass/incorporate_getitem.h
修改: mindspore/ccsrc/frontend/optimizer/irpass/inline.h
修改: mindspore/ccsrc/minddata/dataset/core/device_tensor.cc
修改: mindspore/ccsrc/minddata/dataset/core/tensor.cc
修改: mindspore/ccsrc/minddata/dataset/engine/datasetops/source/emnist_op.cc
修改: mindspore/ccsrc/minddata/dataset/engine/datasetops/source/mnist_op.cc
修改: mindspore/ccsrc/minddata/dataset/engine/datasetops/source/qmnist_op.cc
修改: mindspore/ccsrc/minddata/dataset/engine/ir/datasetops/dataset_node.cc
修改: mindspore/ccsrc/minddata/dataset/engine/opt/pre/epoch_ctrl_pass.cc
修改: mindspore/ccsrc/minddata/dataset/kernels/image/lite_image_utils.cc
修改: mindspore/ccsrc/pipeline/jit/action.cc
修改: mindspore/ccsrc/pipeline/jit/static_analysis/evaluator.cc
修改: mindspore/ccsrc/runtime/device/ascend/executor/tiling/op_tiling_adapter.cc
修改: mindspore/compression/quant/quant_utils.py
修改: mindspore/core/abstract/prim_nn.cc
修改: mindspore/dataset/engine/validators.py
修改: mindspore/lite/micro/coder/opcoders/nnacl/fp32/affine_fp32_coder.cc
修改: mindspore/lite/micro/coder/opcoders/nnacl/int8/affine_int8_coder.cc
修改: mindspore/lite/src/runtime/kernel/ascend310/src/custom_kernel.cc
修改: mindspore/lite/src/runtime/kernel/opencl/kernel/matmul.cc
修改: mindspore/lite/src/runtime/kernel/opencl/kernel/strassen.cc
修改: mindspore/lite/tools/common/graph_util.h
修改: mindspore/lite/tools/optimizer/fisson/fisson_util.cc
修改: mindspore/ops/composite/math_ops.py
修改: mindspore/ops/operations/_inner_ops.py
修改: mindspore/ops/operations/array_ops.py
修改: mindspore/ops/operations/math_ops.py
修改: mindspore/ops/operations/other_ops.py
修改: mindspore/boost/boost_cell_wrapper.py
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/update_cache_cpu_kernel.cc
修改: mindspore/ccsrc/common/trans.cc
修改: mindspore/ccsrc/frontend/parallel/cache_embedding/cache_embedding.cc
修改: mindspore/ccsrc/frontend/parallel/ops_info/gather_info.cc
修改: mindspore/lite/src/common/log_util.h
修改: mindspore/nn/wrap/loss_scale.py
修改: mindspore/parallel/nn/moe.py
修改: tests/mindspore_test_framework/mindspore_test.py
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/split_cpu_kernel.cc
修改: mindspore/lite/tools/common/graph_util.h
修改: mindspore/ccsrc/frontend/parallel/ops_info/gather_info.cc
修改: mindspore/core/ops/conv2d.cc
修改: tests/ut/python/model/test_lenet_core_after_exception.py
4 years ago
i-robot
ef7bc4ddda
!25079 Change dir hash to file hash in bprop mindir
Merge pull request !25079 from YuJianfeng/bprop_mindir
4 years ago
yujianfeng
ffc269d29a
Add Send and Receive to recompute black list
4 years ago
i-robot
8cf8e4ddd3
!21778 support tensor slice when slice result should be caculated in runtime mode
Merge pull request !21778 from lianliguang/support-dynamic-slice
4 years ago
yujianfeng
42079ac087
Change dir hash to file hash in bprop mindir
4 years ago
Margaret_wangrui
e6c4dbdc67
If the umonad only used by Load and one other updatestate, no need to replace
4 years ago
i-robot
cf54d0c158
!24897 set abstract in TupleListGetitemDependReorder
Merge pull request !24897 from huangbingjian/abstract_master
4 years ago