tacyi139
7413775490
optimize error about BroadcastOpGrad
4 years ago
wilfChen
cf63527a15
multinomial profiling
4 years ago
i-robot
bfd190482f
!26842 Speed up random normal sampling
Merge pull request !26842 from zichun_ye/random_normal_speed_up
4 years ago
zhujingxuan
30c6fa7f9b
bind stream with handle
4 years ago
Zichun Ye
8398c07d68
update random normal op impl to speed up sampling
4 years ago
wenbean
31053edbe4
Use Allocator and workspace pre allocat mem in GPU
4 years ago
wenbean
26d4bf6350
Fix meme leak bug, add result expect
4 years ago
wenbean
13409f519f
Unify GPU/CPU ops input/output(col/rolmajor), modify related testcases, add linalg function and testcases
4 years ago
i-robot
b5c02a4ee0
!26426 gpu environment kernel
Merge pull request !26426 from chenweifeng/gpu-environment-kernel
4 years ago
wilfChen
68260a6a94
gpu environment kernel implement
4 years ago
wenbean
9b305b231d
Add GPU eigenvalues/eigenvector for symmetric mtrix(real and complex)
add cuda ops real to complex(imag is 0j)
cpu/gpu eigh ops alreas returns complex eigenvalues for complex input
retuns real eigenvalues for real input
4 years ago
z00512249
36032e7ee2
add cholesky, cho_factor primitive and backend gpu implements
4 years ago
hezhenhao1
d61b089f6b
Support int32 as input type for Abs of GPU op, float64 as input type for IsFinite GPU op.
4 years ago
zhujingxuan
fb1805de30
add GPU trsm 2d matrix support
4 years ago
i-robot
ba0e1a810e
!25457 add broadcast GPU float64 registration
Merge pull request !25457 from zhujingxuan/master
4 years ago
zhujingxuan
7441353d9e
add unit_diagonal option for solve triangular
4 years ago
zhujingxuan
8f45ddf39d
add float64 registration
4 years ago
zhujingxuan
28987d787d
add trsm
4 years ago
i-robot
0495ed8630
!25119 add cholesky factorization for gpu backend
Merge pull request !25119 from zhuzhongrui/pub_master
4 years ago
z00512249
a125654fbc
add cholesky && lu factorization for gpu backend
4 years ago
zhunaipan
8ce4e62725
optimize the comment and log description
修改: ops/operations/_inner_ops.py
修改: ops/operations/_quant_ops.py
修改: ops/operations/array_ops.py
修改: ops/operations/comm_ops.py
修改: ops/operations/math_ops.py
修改: ops/operations/quantum_ops.py
修改: ops/operations/rl_ops.py
修改: ops/operations/sponge_ops.py
修改: ops/operations/sponge_update_ops.py
修改: train/__init__.py
修改: common/tensor.py
修改: train/serialization.py
修改: ccsrc/pipeline/jit/parse/parse.h
修改: explainer/benchmark/_attribution/metric.py
修改: ops/composite/multitype_ops/_constexpr_utils.py
修改: ops/operations/comm_ops.py
修改: RELEASE.md
修改: mindspore/_extends/parse/standard_method.py
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/concat_offset_cpu_kernel.cc
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/dynamic_shape_cpu_kernel.cc
修改: mindspore/ccsrc/frontend/parallel/ops_info/reshape_info.cc
修改: mindspore/ccsrc/frontend/parallel/ops_info/tile_info.cc
修改: mindspore/ccsrc/frontend/parallel/ops_info/transpose_info.cc
修改: mindspore/ccsrc/frontend/parallel/strategy.h
修改: mindspore/common/tensor.py
修改: mindspore/core/abstract/prim_arrays.cc
修改: mindspore/core/abstract/prim_nn.cc
修改: mindspore/core/ops/conv2d.cc
修改: mindspore/core/ops/logical_and.h
修改: mindspore/core/ops/logical_not.h
修改: mindspore/core/ops/logical_or.h
修改: mindspore/core/ops/reduce_all.h
修改: mindspore/core/ops/reduce_any.h
修改: mindspore/lite/src/runtime/kernel/arm/fp32_grad/sgd.cc
修改: mindspore/nn/layer/quant.py
修改: mindspore/nn/optim/sgd.py
修改: mindspore/nn/sparse/sparse.py
修改: mindspore/numpy/array_creations.py
修改: mindspore/numpy/array_ops.py
修改: mindspore/numpy/logic_ops.py
修改: mindspore/numpy/math_ops.py
修改: mindspore/ops/operations/_inner_ops.py
修改: mindspore/ops/operations/array_ops.py
修改: mindspore/ops/operations/rl_ops.py
修改: mindspore/train/_utils.py
修改: tests/ut/python/model/test_lenet_core_after_exception.py
修改: mindspore/_extends/parse/standard_method.py
修改: mindspore/ops/operations/rl_ops.py
修改: mindspore/core/abstract/prim_nn.cc
修改: mindspore/core/ops/conv2d.cc
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/ctcloss_cpu_kernel.cc
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/fl/fused_pull_weight_kernel.h
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/fl/fused_push_weight_kernel.h
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/mkldnn/conv2d_grad_filter_cpu_kernel.cc
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/mkldnn/conv2d_grad_input_cpu_kernel.cc
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/ps/sparse_apply_ftrl_ps_kernel.cc
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/ps/sparse_apply_lazy_adam_ps_kernel.cc
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/rolling_cpu_kernel.cc
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/scatter_arithmetic_cpu_kernel.cc
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/split_cpu_kernel.cc
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/update_cache_cpu_kernel.cc
修改: mindspore/ccsrc/backend/kernel_compiler/gpu/arrays/split_gpu_kernel.h
修改: mindspore/ccsrc/backend/kernel_compiler/gpu/math/broadcast_gpu_kernel.h
修改: mindspore/ccsrc/backend/kernel_compiler/gpu/nn/conv2d_grad_input_gpu_kernel.h
修改: mindspore/ccsrc/fl/server/server.cc
修改: mindspore/ccsrc/frontend/optimizer/ad/kpynative.cc
修改: mindspore/ccsrc/frontend/optimizer/irpass/incorporate_getitem.h
修改: mindspore/ccsrc/frontend/optimizer/irpass/inline.h
修改: mindspore/ccsrc/minddata/dataset/core/device_tensor.cc
修改: mindspore/ccsrc/minddata/dataset/core/tensor.cc
修改: mindspore/ccsrc/minddata/dataset/engine/datasetops/source/emnist_op.cc
修改: mindspore/ccsrc/minddata/dataset/engine/datasetops/source/mnist_op.cc
修改: mindspore/ccsrc/minddata/dataset/engine/datasetops/source/qmnist_op.cc
修改: mindspore/ccsrc/minddata/dataset/engine/ir/datasetops/dataset_node.cc
修改: mindspore/ccsrc/minddata/dataset/engine/opt/pre/epoch_ctrl_pass.cc
修改: mindspore/ccsrc/minddata/dataset/kernels/image/lite_image_utils.cc
修改: mindspore/ccsrc/pipeline/jit/action.cc
修改: mindspore/ccsrc/pipeline/jit/static_analysis/evaluator.cc
修改: mindspore/ccsrc/runtime/device/ascend/executor/tiling/op_tiling_adapter.cc
修改: mindspore/compression/quant/quant_utils.py
修改: mindspore/core/abstract/prim_nn.cc
修改: mindspore/dataset/engine/validators.py
修改: mindspore/lite/micro/coder/opcoders/nnacl/fp32/affine_fp32_coder.cc
修改: mindspore/lite/micro/coder/opcoders/nnacl/int8/affine_int8_coder.cc
修改: mindspore/lite/src/runtime/kernel/ascend310/src/custom_kernel.cc
修改: mindspore/lite/src/runtime/kernel/opencl/kernel/matmul.cc
修改: mindspore/lite/src/runtime/kernel/opencl/kernel/strassen.cc
修改: mindspore/lite/tools/common/graph_util.h
修改: mindspore/lite/tools/optimizer/fisson/fisson_util.cc
修改: mindspore/ops/composite/math_ops.py
修改: mindspore/ops/operations/_inner_ops.py
修改: mindspore/ops/operations/array_ops.py
修改: mindspore/ops/operations/math_ops.py
修改: mindspore/ops/operations/other_ops.py
修改: mindspore/boost/boost_cell_wrapper.py
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/update_cache_cpu_kernel.cc
修改: mindspore/ccsrc/common/trans.cc
修改: mindspore/ccsrc/frontend/parallel/cache_embedding/cache_embedding.cc
修改: mindspore/ccsrc/frontend/parallel/ops_info/gather_info.cc
修改: mindspore/lite/src/common/log_util.h
修改: mindspore/nn/wrap/loss_scale.py
修改: mindspore/parallel/nn/moe.py
修改: tests/mindspore_test_framework/mindspore_test.py
修改: mindspore/ccsrc/backend/kernel_compiler/cpu/split_cpu_kernel.cc
修改: mindspore/lite/tools/common/graph_util.h
修改: mindspore/ccsrc/frontend/parallel/ops_info/gather_info.cc
修改: mindspore/core/ops/conv2d.cc
修改: tests/ut/python/model/test_lenet_core_after_exception.py
4 years ago
lizhenyu
29982ecdd7
some bugfix for parameter server cache
4 years ago
zhangyihui
a94b3dbcfe
clean up the static alarms of the second batch of operator groups
4 years ago
i-robot
4c8854ac02
!24403 Clean up the static alarms of the first batch of operator groups
Merge pull request !24403 from 张毅辉/static_alarms_of_operator_group_for_the_first_batch
4 years ago
zhangyihui
27a80a75c0
Clean up the first batch of static alarms of operator group
4 years ago
wangshuide2020
7a1862a6e6
add vector size check, input shape check and divide by zero check for gpu operators.
4 years ago
wangshuide2020
a35a1fe67d
add vector size check, nullptr check and clean code for gpu operators.
4 years ago
i-robot
bb9597e570
!23811 add validation of vector size and non-zero validation of denominator for nn gpu operators.
Merge pull request !23811 from wangshuide/wsd_master_new
4 years ago
wangshuide2020
e06beb2ed4
add validation of vector size and non-zero validation of denominator for nn gpu operators.
4 years ago
zhouyaqiang
f76cb53cfe
Add complex ops and bprop of real、conj、imag ops
4 years ago
zhouyaqiang
dad375abb9
add gpu complex ops
4 years ago
Peilin Wang
ecb3e6332e
initial commit: fixed python class
more python doc fix
type support for neg for sub grad
fix typos
add uint8 and int8 to neg for sub grad
change value to updates in python class
add int8 support for gathernd for sub grad
max grad
grad for max/min
after testing
add grad for min
fix ci
fix ci
4 years ago
ms_yan
36a8886ca2
Revert "[feat] [assistant] [I3T96T] add new Dataset operator CMUARCTICDataset"
This reverts commit b077aa1cab .
Revert "[feat] [assistant] [I3T96X] add new Dataset operator LibriSpeechDataset"
This reverts commit 4e6f7dc97d .
delete pass_registry_test.cc
comment hiai_nlu_model_multi.pb related line
4 years ago
djc
4e6f7dc97d
[feat] [assistant] [I3T96X] add new Dataset operator LibriSpeechDataset
4 years ago
zong_shuai
f1eb2fe6bf
expend broadcast_gpu_kernel with truncate_div and truncate_mod
4 years ago
zong_shuai
ebe1a2d7f5
expend broadcast_gpu_kernel with truncate_div and truncate_mod
4 years ago
zong_shuai
336fabe0e6
expend broadcast_gpu_kernel with truncate_div and truncate_mod
4 years ago
zong_shuai
1c6dd3543f
implement truncatediv and truncatemod
4 years ago
zong_shuai
ef72e70cb0
implement truncatediv and truncatemod
4 years ago
zong_shuai
4f7a27319b
implement truncatediv and truncatemod
4 years ago
zong_shuai
ce116f7887
implement truncatediv and truncatemod
4 years ago
zong_shuai
ee03495ff5
implement truncatediv and truncatemod
4 years ago
zong_shuai
9ea9ba917b
implement truncatediv and truncatemod
4 years ago
i-robot
22e9299c17
!20885 add dtypes & fft kernels for SPONGE
Merge pull request !20885 from huangmengxi/sponge_ccsrc
4 years ago
huangmengxi
e32297dc6b
add dtypes for sponge
4 years ago
zhou_lili
9838029fb9
code clean of gpu-math
4 years ago
Peilin Wang
594571fd4c
initial commit: fix 11 dts tickets
fix ci
4 years ago
Peilin Wang
922bcf603c
bugfix, needs a device sync
4 years ago
i-robot
b9c178e6b7
!16616 GPU index_add op remove cuda device sync
Merge pull request !16616 from tom_chen/index_add
4 years ago
wilfChen
d68069a617
parameter-without-user
4 years ago