tacyi139
bb935faca9
optimizes the kernel error description of Split, Meshgrid, Select, etc.
4 years ago
i-robot
30d182ac18
!26626 fix reduce ops axis multiple bug in GPU
Merge pull request !26626 from zhangbuxue/fix_reduce_ops_axis_multiple_bug_in_GPU
4 years ago
buxue
89a688f3be
fix reduce ops axis multiple bug in GPU
4 years ago
hezhenhao1
accc6368aa
Add support float64 as input type for ReduceProd GPU op.
4 years ago
hezhenhao1
cb13783059
Add fp64 as input type fot GPU op of ReduceMin and ReduceMean.
4 years ago
i-robot
7f4d32800d
!25633 Solve Operator Memory Leakage
Merge pull request !25633 from zong_shuai/FixBug
4 years ago
zong-shuai
07dc66892e
OperateMemLeak
4 years ago
i-robot
475777af3f
!25341 Add concat op dtype.
Merge pull request !25341 from liangchenghui/add_concat_dtype
4 years ago
liangchenghui
7a6607a6a8
Add concat op dtype.
4 years ago
wangshuide2020
1c7e94862e
support -1 for the args of size of slice operator.
4 years ago
zhunaipan
8ce4e62725
optimize the comment and log description
修改: ops/operations/_inner_ops.py
修改: ops/operations/_quant_ops.py
修改: ops/operations/array_ops.py
修改: ops/operations/comm_ops.py
修改: ops/operations/math_ops.py
修改: ops/operations/quantum_ops.py
修改: ops/operations/rl_ops.py
修改: ops/operations/sponge_ops.py
修改: ops/operations/sponge_update_ops.py
修改: train/__init__.py
修改: common/tensor.py
修改: train/serialization.py
修改: ccsrc/pipeline/jit/parse/parse.h
修改: explainer/benchmark/_attribution/metric.py
修改: ops/composite/multitype_ops/_constexpr_utils.py
修改: ops/operations/comm_ops.py
修改: RELEASE.md
修改: mindspore/_extends/parse/standard_method.py
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/concat_offset_cpu_kernel.cc
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/dynamic_shape_cpu_kernel.cc
修改: mindspore/ccsrc/frontend/parallel/ops_info/reshape_info.cc
修改: mindspore/ccsrc/frontend/parallel/ops_info/tile_info.cc
修改: mindspore/ccsrc/frontend/parallel/ops_info/transpose_info.cc
修改: mindspore/ccsrc/frontend/parallel/strategy.h
修改: mindspore/common/tensor.py
修改: mindspore/core/abstract/prim_arrays.cc
修改: mindspore/core/abstract/prim_nn.cc
修改: mindspore/core/ops/conv2d.cc
修改: mindspore/core/ops/logical_and.h
修改: mindspore/core/ops/logical_not.h
修改: mindspore/core/ops/logical_or.h
修改: mindspore/core/ops/reduce_all.h
修改: mindspore/core/ops/reduce_any.h
修改: mindspore/lite/src/runtime/kernel/arm/fp32_grad/sgd.cc
修改: mindspore/nn/layer/quant.py
修改: mindspore/nn/optim/sgd.py
修改: mindspore/nn/sparse/sparse.py
修改: mindspore/numpy/array_creations.py
修改: mindspore/numpy/array_ops.py
修改: mindspore/numpy/logic_ops.py
修改: mindspore/numpy/math_ops.py
修改: mindspore/ops/operations/_inner_ops.py
修改: mindspore/ops/operations/array_ops.py
修改: mindspore/ops/operations/rl_ops.py
修改: mindspore/train/_utils.py
修改: tests/ut/python/model/test_lenet_core_after_exception.py
修改: mindspore/_extends/parse/standard_method.py
修改: mindspore/ops/operations/rl_ops.py
修改: mindspore/core/abstract/prim_nn.cc
修改: mindspore/core/ops/conv2d.cc
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/ctcloss_cpu_kernel.cc
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/fl/fused_pull_weight_kernel.h
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/fl/fused_push_weight_kernel.h
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/mkldnn/conv2d_grad_filter_cpu_kernel.cc
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/mkldnn/conv2d_grad_input_cpu_kernel.cc
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/ps/sparse_apply_ftrl_ps_kernel.cc
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/ps/sparse_apply_lazy_adam_ps_kernel.cc
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/rolling_cpu_kernel.cc
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/scatter_arithmetic_cpu_kernel.cc
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/split_cpu_kernel.cc
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/update_cache_cpu_kernel.cc
修改: mindspore/ccsrc/backend/kernel_compiler/gpu/arrays/split_gpu_kernel.h
修改: mindspore/ccsrc/backend/kernel_compiler/gpu/math/broadcast_gpu_kernel.h
修改: mindspore/ccsrc/backend/kernel_compiler/gpu/nn/conv2d_grad_input_gpu_kernel.h
修改: mindspore/ccsrc/fl/server/server.cc
修改: mindspore/ccsrc/frontend/optimizer/ad/kpynative.cc
修改: mindspore/ccsrc/frontend/optimizer/irpass/incorporate_getitem.h
修改: mindspore/ccsrc/frontend/optimizer/irpass/inline.h
修改: mindspore/ccsrc/minddata/dataset/core/device_tensor.cc
修改: mindspore/ccsrc/minddata/dataset/core/tensor.cc
修改: mindspore/ccsrc/minddata/dataset/engine/datasetops/source/emnist_op.cc
修改: mindspore/ccsrc/minddata/dataset/engine/datasetops/source/mnist_op.cc
修改: mindspore/ccsrc/minddata/dataset/engine/datasetops/source/qmnist_op.cc
修改: mindspore/ccsrc/minddata/dataset/engine/ir/datasetops/dataset_node.cc
修改: mindspore/ccsrc/minddata/dataset/engine/opt/pre/epoch_ctrl_pass.cc
修改: mindspore/ccsrc/minddata/dataset/kernels/image/lite_image_utils.cc
修改: mindspore/ccsrc/pipeline/jit/action.cc
修改: mindspore/ccsrc/pipeline/jit/static_analysis/evaluator.cc
修改: mindspore/ccsrc/runtime/device/ascend/executor/tiling/op_tiling_adapter.cc
修改: mindspore/compression/quant/quant_utils.py
修改: mindspore/core/abstract/prim_nn.cc
修改: mindspore/dataset/engine/validators.py
修改: mindspore/lite/micro/coder/opcoders/nnacl/fp32/affine_fp32_coder.cc
修改: mindspore/lite/micro/coder/opcoders/nnacl/int8/affine_int8_coder.cc
修改: mindspore/lite/src/runtime/kernel/ascend310/src/custom_kernel.cc
修改: mindspore/lite/src/runtime/kernel/opencl/kernel/matmul.cc
修改: mindspore/lite/src/runtime/kernel/opencl/kernel/strassen.cc
修改: mindspore/lite/tools/common/graph_util.h
修改: mindspore/lite/tools/optimizer/fisson/fisson_util.cc
修改: mindspore/ops/composite/math_ops.py
修改: mindspore/ops/operations/_inner_ops.py
修改: mindspore/ops/operations/array_ops.py
修改: mindspore/ops/operations/math_ops.py
修改: mindspore/ops/operations/other_ops.py
修改: mindspore/boost/boost_cell_wrapper.py
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/update_cache_cpu_kernel.cc
修改: mindspore/ccsrc/common/trans.cc
修改: mindspore/ccsrc/frontend/parallel/cache_embedding/cache_embedding.cc
修改: mindspore/ccsrc/frontend/parallel/ops_info/gather_info.cc
修改: mindspore/lite/src/common/log_util.h
修改: mindspore/nn/wrap/loss_scale.py
修改: mindspore/parallel/nn/moe.py
修改: tests/mindspore_test_framework/mindspore_test.py
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/split_cpu_kernel.cc
修改: mindspore/lite/tools/common/graph_util.h
修改: mindspore/ccsrc/frontend/parallel/ops_info/gather_info.cc
修改: mindspore/core/ops/conv2d.cc
修改: tests/ut/python/model/test_lenet_core_after_exception.py
4 years ago
i-robot
8ea7a420e6
!24957 fix the broadcast size problem of meshgrid for gpu.
Merge pull request !24957 from wangshuide/wsd_master
4 years ago
hezhenhao1
ebcdbdc64a
Support int64 as input type for GPU OP TensorScatterUpdate.
4 years ago
wangshuide2020
8276309cb7
fix the broadcast size problem of meshgrid for gpu.
4 years ago
zhangyihui
a94b3dbcfe
clean up the static alarms of the second batch of operator groups
4 years ago
i-robot
25812728d8
!24496 Remove the scalar constraint of SliceGrad in CPU/GPU.
Merge pull request !24496 from hezhenhao1/slice
4 years ago
i-robot
4c8854ac02
!24403 Clean up the static alarms of the first batch of operator groups
Merge pull request !24403 from 张毅辉/static_alarms_of_operator_group_for_the_first_batch
4 years ago
i-robot
2c1d3baace
!24269 add vector size check, input shape check and divide by zero check for gpu operators.
Merge pull request !24269 from wangshuide/wsd_master
4 years ago
Peilin Wang
59631807be
initial commit: revert back to explicit implementation
initial commit revert variadic template
add missing instantiations fix ci
fix ci
4 years ago
hezhenhao1
5505d4b774
Remove the scalar constraint of SliceGrad in CPU/GPU.
4 years ago
zhangyihui
27a80a75c0
Clean up the first batch of static alarms of operator group
4 years ago
wangshuide2020
7a1862a6e6
add vector size check, input shape check and divide by zero check for gpu operators.
4 years ago
i-robot
e81476d1b0
!24257 erase warning
Merge pull request !24257 from zong_shuai/erase_warning
4 years ago
zong_shuai
417096ddcc
erase_warning_1
4 years ago
wangshuide2020
a35a1fe67d
add vector size check, nullptr check and clean code for gpu operators.
4 years ago
wangshuide2020
e06beb2ed4
add validation of vector size and non-zero validation of denominator for nn gpu operators.
4 years ago
Peilin Wang
2f7da9be3a
initial commit: variadic template slice kernel
fix ci
4 years ago
zhouyaqiang
dad375abb9
add gpu complex ops
4 years ago
i-robot
36be3f9c4f
!22601 impl batch_to_space and space_to_batch
Merge pull request !22601 from zong_shuai/impl_batch_to_space
4 years ago
zong_shuai
6ae3bc6dfe
space_to_batch
4 years ago
zong_shuai
a6fb09d744
space_to_batch
4 years ago
zong_shuai
5b9ba82ae4
space_batch
4 years ago
zong_shuai
b518a4ed33
modified commenting
4 years ago
zong_shuai
0388f8bccd
batchtospace
4 years ago
zong_shuai
4af395639f
clang-format
4 years ago
zong_shuai
a0424b6842
spacetobatch
4 years ago
zong_shuai
bc1a50147c
modified clang-format
4 years ago
zong_shuai
5a861816fd
spacetobatch modified cpplint
4 years ago
zong_shuai
5dfe1f0c1b
spacetobatch
4 years ago
zong_shuai
2b4273e6f7
spacetobatch
4 years ago
zong_shuai
6c2b64c0b7
spacetobatch
4 years ago
zong_shuai
971c38a0d2
batchtospace
4 years ago
zong_shuai
48732fb104
batchtospace
4 years ago
zong_shuai
f518b9e9fd
batchtospace
4 years ago
Peilin Wang
ecb3e6332e
initial commit: fixed python class
more python doc fix
type support for neg for sub grad
fix typos
add uint8 and int8 to neg for sub grad
change value to updates in python class
add int8 support for gathernd for sub grad
max grad
grad for max/min
after testing
add grad for min
fix ci
fix ci
4 years ago
djc
b077aa1cab
[feat] [assistant] [I3T96T] add new Dataset operator CMUARCTICDataset
4 years ago
djc
4e6f7dc97d
[feat] [assistant] [I3T96X] add new Dataset operator LibriSpeechDataset
4 years ago
Peilin Wang
6a1b1495d9
initial commit: add nullptr exception in GetDeviceAddress
all cudnn functions now use the new GetPossiblyNullDeviceAddress
fix batchnorm
fix ci
fix nll loss
fix cast and concat
fix cast: skip kernel if null input and output
fix ci
fix concat: allow null input
fix concat: allow for null inputs
4 years ago
zhou_lili
69a915d4eb
code review of gpu-array
4 years ago
i-robot
22e9299c17
!20885 add dtypes & fft kernels for SPONGE
Merge pull request !20885 from huangmengxi/sponge_ccsrc
4 years ago