wangshuide2020
6cbe8dd02e
optimizes the kernel error description of LSTM, Pad, ReLU, etc.
4 years ago
wangshuide2020
674e3aa9d6
optimizes the kernel error description of Adagrad, Adam, Conv2d, etc.
4 years ago
zhunaipan
8ce4e62725
optimize the comment and log description
修改: ops/operations/_inner_ops.py
修改: ops/operations/_quant_ops.py
修改: ops/operations/array_ops.py
修改: ops/operations/comm_ops.py
修改: ops/operations/math_ops.py
修改: ops/operations/quantum_ops.py
修改: ops/operations/rl_ops.py
修改: ops/operations/sponge_ops.py
修改: ops/operations/sponge_update_ops.py
修改: train/__init__.py
修改: common/tensor.py
修改: train/serialization.py
修改: ccsrc/pipeline/jit/parse/parse.h
修改: explainer/benchmark/_attribution/metric.py
修改: ops/composite/multitype_ops/_constexpr_utils.py
修改: ops/operations/comm_ops.py
修改: RELEASE.md
修改: mindspore/_extends/parse/standard_method.py
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/concat_offset_cpu_kernel.cc
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/dynamic_shape_cpu_kernel.cc
修改: mindspore/ccsrc/frontend/parallel/ops_info/reshape_info.cc
修改: mindspore/ccsrc/frontend/parallel/ops_info/tile_info.cc
修改: mindspore/ccsrc/frontend/parallel/ops_info/transpose_info.cc
修改: mindspore/ccsrc/frontend/parallel/strategy.h
修改: mindspore/common/tensor.py
修改: mindspore/core/abstract/prim_arrays.cc
修改: mindspore/core/abstract/prim_nn.cc
修改: mindspore/core/ops/conv2d.cc
修改: mindspore/core/ops/logical_and.h
修改: mindspore/core/ops/logical_not.h
修改: mindspore/core/ops/logical_or.h
修改: mindspore/core/ops/reduce_all.h
修改: mindspore/core/ops/reduce_any.h
修改: mindspore/lite/src/runtime/kernel/arm/fp32_grad/sgd.cc
修改: mindspore/nn/layer/quant.py
修改: mindspore/nn/optim/sgd.py
修改: mindspore/nn/sparse/sparse.py
修改: mindspore/numpy/array_creations.py
修改: mindspore/numpy/array_ops.py
修改: mindspore/numpy/logic_ops.py
修改: mindspore/numpy/math_ops.py
修改: mindspore/ops/operations/_inner_ops.py
修改: mindspore/ops/operations/array_ops.py
修改: mindspore/ops/operations/rl_ops.py
修改: mindspore/train/_utils.py
修改: tests/ut/python/model/test_lenet_core_after_exception.py
修改: mindspore/_extends/parse/standard_method.py
修改: mindspore/ops/operations/rl_ops.py
修改: mindspore/core/abstract/prim_nn.cc
修改: mindspore/core/ops/conv2d.cc
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/ctcloss_cpu_kernel.cc
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/fl/fused_pull_weight_kernel.h
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/fl/fused_push_weight_kernel.h
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/mkldnn/conv2d_grad_filter_cpu_kernel.cc
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/mkldnn/conv2d_grad_input_cpu_kernel.cc
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/ps/sparse_apply_ftrl_ps_kernel.cc
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/ps/sparse_apply_lazy_adam_ps_kernel.cc
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/rolling_cpu_kernel.cc
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/scatter_arithmetic_cpu_kernel.cc
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/split_cpu_kernel.cc
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/update_cache_cpu_kernel.cc
修改: mindspore/ccsrc/backend/kernel_compiler/gpu/arrays/split_gpu_kernel.h
修改: mindspore/ccsrc/backend/kernel_compiler/gpu/math/broadcast_gpu_kernel.h
修改: mindspore/ccsrc/backend/kernel_compiler/gpu/nn/conv2d_grad_input_gpu_kernel.h
修改: mindspore/ccsrc/fl/server/server.cc
修改: mindspore/ccsrc/frontend/optimizer/ad/kpynative.cc
修改: mindspore/ccsrc/frontend/optimizer/irpass/incorporate_getitem.h
修改: mindspore/ccsrc/frontend/optimizer/irpass/inline.h
修改: mindspore/ccsrc/minddata/dataset/core/device_tensor.cc
修改: mindspore/ccsrc/minddata/dataset/core/tensor.cc
修改: mindspore/ccsrc/minddata/dataset/engine/datasetops/source/emnist_op.cc
修改: mindspore/ccsrc/minddata/dataset/engine/datasetops/source/mnist_op.cc
修改: mindspore/ccsrc/minddata/dataset/engine/datasetops/source/qmnist_op.cc
修改: mindspore/ccsrc/minddata/dataset/engine/ir/datasetops/dataset_node.cc
修改: mindspore/ccsrc/minddata/dataset/engine/opt/pre/epoch_ctrl_pass.cc
修改: mindspore/ccsrc/minddata/dataset/kernels/image/lite_image_utils.cc
修改: mindspore/ccsrc/pipeline/jit/action.cc
修改: mindspore/ccsrc/pipeline/jit/static_analysis/evaluator.cc
修改: mindspore/ccsrc/runtime/device/ascend/executor/tiling/op_tiling_adapter.cc
修改: mindspore/compression/quant/quant_utils.py
修改: mindspore/core/abstract/prim_nn.cc
修改: mindspore/dataset/engine/validators.py
修改: mindspore/lite/micro/coder/opcoders/nnacl/fp32/affine_fp32_coder.cc
修改: mindspore/lite/micro/coder/opcoders/nnacl/int8/affine_int8_coder.cc
修改: mindspore/lite/src/runtime/kernel/ascend310/src/custom_kernel.cc
修改: mindspore/lite/src/runtime/kernel/opencl/kernel/matmul.cc
修改: mindspore/lite/src/runtime/kernel/opencl/kernel/strassen.cc
修改: mindspore/lite/tools/common/graph_util.h
修改: mindspore/lite/tools/optimizer/fisson/fisson_util.cc
修改: mindspore/ops/composite/math_ops.py
修改: mindspore/ops/operations/_inner_ops.py
修改: mindspore/ops/operations/array_ops.py
修改: mindspore/ops/operations/math_ops.py
修改: mindspore/ops/operations/other_ops.py
修改: mindspore/boost/boost_cell_wrapper.py
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/update_cache_cpu_kernel.cc
修改: mindspore/ccsrc/common/trans.cc
修改: mindspore/ccsrc/frontend/parallel/cache_embedding/cache_embedding.cc
修改: mindspore/ccsrc/frontend/parallel/ops_info/gather_info.cc
修改: mindspore/lite/src/common/log_util.h
修改: mindspore/nn/wrap/loss_scale.py
修改: mindspore/parallel/nn/moe.py
修改: tests/mindspore_test_framework/mindspore_test.py
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/split_cpu_kernel.cc
修改: mindspore/lite/tools/common/graph_util.h
修改: mindspore/ccsrc/frontend/parallel/ops_info/gather_info.cc
修改: mindspore/core/ops/conv2d.cc
修改: tests/ut/python/model/test_lenet_core_after_exception.py
4 years ago
wilfChen
54761ecccc
codedex
4 years ago
zhangyihui
a94b3dbcfe
clean up the static alarms of the second batch of operator groups
4 years ago
i-robot
4c8854ac02
!24403 Clean up the static alarms of the first batch of operator groups
Merge pull request !24403 from 张毅辉/static_alarms_of_operator_group_for_the_first_batch
4 years ago
i-robot
2c1d3baace
!24269 add vector size check, input shape check and divide by zero check for gpu operators.
Merge pull request !24269 from wangshuide/wsd_master
4 years ago
markuskunej
abdba421e5
added GetReductionInt to common_utils.h and replaced duplicated code in all loss with reduction gpu op kernels (nll loss, kl div loss, and binary cross entropy)
4 years ago
zhangyihui
27a80a75c0
Clean up the first batch of static alarms of operator group
4 years ago
wangshuide2020
7a1862a6e6
add vector size check, input shape check and divide by zero check for gpu operators.
4 years ago
i-robot
e81476d1b0
!24257 erase warning
Merge pull request !24257 from zong_shuai/erase_warning
4 years ago
zong_shuai
417096ddcc
erase_warning_1
4 years ago
wangshuide2020
a35a1fe67d
add vector size check, nullptr check and clean code for gpu operators.
4 years ago
wangshuide2020
e06beb2ed4
add validation of vector size and non-zero validation of denominator for nn gpu operators.
4 years ago
liangchenghui
7614129eda
Add isnan/isinf operator bprop.
4 years ago
i-robot
b49174cd40
!23096 Generalize gpu PadOp to support more than 4 dimensions
Merge pull request !23096 from Peilin/pad-bugfix
4 years ago
Peilin Wang
d7b23ca4b8
fix pad
remove 4d error testcase
fix ci
add 4d error nn back
4 years ago
i-robot
63114a3dfd
!22385 Fix bug in GPU conv3dtranspose
Merge pull request !22385 from fanrb/fix_conv3dtrans
4 years ago
simson
8a0087bceb
fix precision error of resizebilineargrad
4 years ago
fan1997
84a540e743
Fix bug in conv3dtranspose gpu
4 years ago
i-robot
f536d88570
!22821 modify resizebilineargrad input type
Merge pull request !22821 from Simson/opinfer
4 years ago
simson
f00e22342b
modify resizebilineargrad input type
4 years ago
i-robot
5e6287bec1
!22585 Add grad implementation of AdaptiveAvgPool2D
Merge pull request !22585 from zuochuanyong/adaptive_avgpool2d_grad
4 years ago
zuochuanyong
068191f222
add AdaptiveAvgPool2DGrad op
4 years ago
simson
7a2fbdda85
modify resizebilinear infer type
4 years ago
djc
b077aa1cab
[feat] [assistant] [I3T96T] add new Dataset operator CMUARCTICDataset
4 years ago
djc
4e6f7dc97d
[feat] [assistant] [I3T96X] add new Dataset operator LibriSpeechDataset
4 years ago
Peilin Wang
6a1b1495d9
initial commit: add nullptr exception in GetDeviceAddress
all cudnn functions now use the new GetPossiblyNullDeviceAddress
fix batchnorm
fix ci
fix nll loss
fix cast and concat
fix cast: skip kernel if null input and output
fix ci
fix concat: allow null input
fix concat: allow for null inputs
4 years ago
i-robot
22e9299c17
!20885 add dtypes & fft kernels for SPONGE
Merge pull request !20885 from huangmengxi/sponge_ccsrc
4 years ago
huangmengxi
e32297dc6b
add dtypes for sponge
4 years ago
danishfarid
92d9bc7ccd
fix for async mem_init bilinearResize_grad
fix - typo
4 years ago
Peilin Wang
594571fd4c
initial commit: fix 11 dts tickets
fix ci
4 years ago
zuochuanyong
1d565f9f8a
support MaxPool3DGrad on GPU
4 years ago
i-robot
d16e9bc3f6
!20369 GPU fix maxpoolgrad
Merge pull request !20369 from VectorSL/fix-maxpool
4 years ago
i-robot
fb33ba2b47
!19941 [MS][GPU] resizeBilinearGrad - Op FP16 fix
Merge pull request !19941 from danishfarid/resizeBilinearFix
4 years ago
VectorSL
a3590bca46
fix maxpool grad
4 years ago
danishfarid
aa37923aa5
first commit
typo fix
sep paths for fp32 and fp16 without fp32 copy
template dec fix
added 0 init for output for fp32 path
4 years ago
buxue
2b2efb0a75
fix prelu weight grad accuracy error fp16 on GPU
4 years ago
buxue
5bf41bfbd2
improve PReLU forward and implement backward on GPU
4 years ago
i-robot
8e043090be
!18472 Implement UNet3d on GPU
Merge pull request !18472 from likesen/master
4 years ago
likesen
99a995b432
Implement UNet3d on GPU
4 years ago
zuochuanyong
e890c2a2ae
fix accuracy error when input H is not equal to W
4 years ago
i-robot
26c7d274c9
!18441 Fix conv3d cudnn algorithm error
Merge pull request !18441 from tom_chen/conv3d
4 years ago
i-robot
6c33e0b710
!18392 fix the exception when occur error and replace magic number with const value.
Merge pull request !18392 from wangshuide/wsd_master
4 years ago
i-robot
9085be08b9
!18163 Support ConvTranspose3D on GPU
Merge pull request !18163 from likesen/master
4 years ago
tom__chen
35f6a1af56
fix conv3d cudnn algorithm error
4 years ago
Li Kesen
7d94095730
Support Conv3dTranspose for GPU
4 years ago
wangshuide2020
30690b1f27
fix the exception when occur error and replace magic number with const value.
4 years ago
buxue
d50d46013b
code security check
4 years ago
markuskunej
2fece8a7c2
added nll_loss_grad for gpu
4 years ago