2268 Commits (v1.8.2)
 

Author SHA1 Message Date
  megvii-mge c42ce93705 feat(mge/third_party): update cutlass version 3 years ago
  温娟 9902ccfcb0 chore(release): bump version 3 years ago
  Megvii Engine Team 8e5410e41f feat(cuda): add fp16 compute 16 kernel 3 years ago
  Megvii Engine Team 472e2f9655 refactor(cuda): depthwish large kernel 3 years ago
  Megvii Engine Team e698ec20c2 feat(cuda): float16 depthwise large kernel conv compute fp32 3 years ago
  Megvii Engine Team 48406382ce feat(cuda): support float16 depthwise large kernel conv 4 years ago
  Megvii Engine Team 7042f76b34 perf(cuda): speedup conv backward data with small feature map and large filter size 4 years ago
  Megvii Engine Team 87a2aeebb1 perf(cuda): speedup chanwise conv with small feature map and large filter size 4 years ago
  Megvii Engine Team 2293385e93 feat(mge): add conv padding mode 4 years ago
  Megvii Engine Team afe9c4b50d feat(dnn/cuda): add implicit bmm kernels for large kernel depthwise convolution backward filter opr 3 years ago
  Megvii Engine Team e8a169292f feat(dnn/cuda): add heuristic rule for implicit batched gemm large kernel dwconv2d kernels 4 years ago
  Megvii Engine Team 38067472d2 fix(dnn/cuda): fix ci 4 years ago
  Megvii Engine Team 1da58ae17a feat(dnn/cuda): add implicit bmm large kernel dwconv2d dgrad kernels 4 years ago
  Megvii Engine Team 96050073a2 feat(dnn/cuda): add implicit bmm large kernel dwconv2d fprop impl 4 years ago
  温娟 19fe2e94e7 chore(release): bump version 4 years ago
  Megvii Engine Team 1add4517ad test(trace): test subtensor on unknown shape 4 years ago
  Megvii Engine Team 54eef55871 fix(trace): assume result is not scalar when shape is valid 4 years ago
  Megvii Engine Team 84d99d1cc4 fix(traced_module): fix Module compatible issue and traced module getattr check 4 years ago
  Megvii Engine Team 275b63114d fix(imperative): fix use collections error from python3.10 4 years ago
  Megvii Engine Team 95ac055538 feat(dnn,mgb,imperative): add diag opr implement 4 years ago
  Megvii Engine Team 39d77fb55a feat(arm): add arm rnn_cell/lstm_cell/lstm optimized kernel 4 years ago
  Megvii Engine Team 3ddc32d3e3 feat(android/whl): support android whl 4 years ago
  Megvii Engine Team f509b1be9b fix(build): split elemwise_multi_type cpp 4 years ago
  Megvii Engine Team 3252016e05 Merge pull request #401 from LosReturn:patch-1 4 years ago
  Megvii Engine Team f7e034b506 feat(lite): add global layout transform python interface for lite 4 years ago
  Megvii Engine Team e70c07a223 feat(lite): add global layout transform c/c++ interface for lite 4 years ago
  Megvii Engine Team 86ee4638bf Merge pull request #402 from AA1HSHH:docstring-reshape 4 years ago
  Megvii Engine Team 3251f50114 fix(mgb/cuda-stub): add libcuda-wrap_11.4.h to fit the CUDA11.4 toolchain 4 years ago
  Megvii Engine Team 2c2df83051 fix(cmake): enable custom op when building develop to avoid the pytest fail 4 years ago
  Megvii Engine Team ee0b95e935 feat(dnn/elemwise/arm_common): support part of arm ternary elemwise multithread 4 years ago
  Megvii Engine Team 7ea104d788 Revert "fix(mge): replace _full_sync by sync" 4 years ago
  Megvii Engine Team cbbca5fb10 feat(mge): add softmax op use cudnn api 4 years ago
  Megvii Engine Team 1d2510b6d7 fix(module): fix module dumped in old version without _short_name attr 4 years ago
  Megvii Engine Team cf5e9488bb fix(traced_module): fix module trace transformation 4 years ago
  Megvii Engine Team 97c90d9137 feat(traced_module): add _exclude_from_trace 4 years ago
  Megvii Engine Team 30e565e5b8 fix(traced_module): fix error message 4 years ago
  Megvii Engine Team de8ffe0c12 refactor(imperative): unify interpreter option setting 4 years ago
  Megvii Engine Team 8b60bdfa10 fix(mge): replace _full_sync by sync 4 years ago
  Megvii Engine Team 20b42a8c3b fix(dnn): add naive lstm kernel 4 years ago
  Megvii Engine Team 2faa6ea5a9 Merge pull request #213 from kxz18:rnn 4 years ago
  Megvii Engine Team f5b8fec4ca fix(imperative): remove big tensor from host side 4 years ago
  Megvii Engine Team 68cde8734e fix(mge/imperative): support broadcast with None 4 years ago
  Megvii Engine Team 0bdd0b1467 refactor(dispatch): switch to new dispatch system 4 years ago
  Megvii Engine Team d3689c3f3c feat(imperative/python): add transformation manager 4 years ago
  Megvii Engine Team 9ce1f0f5d1 refactor(dispatch): implement grad 4 years ago
  Megvii Engine Team c609c031f1 refactor(dispatch): implement symbol 4 years ago
  Megvii Engine Team e32929dfd2 refactor(dispatch): implement scalar 4 years ago
  Megvii Engine Team 59084fa857 refactor(dispatch): implement lazy_eval 4 years ago
  Megvii Engine Team d2b67c2a88 refactor(dispatch): implement trace 4 years ago
  Megvii Engine Team 39ac606b9c refactor(dispatch): implement eval 4 years ago