70 Commits (4d9073fcbaaaa0417be2b6b2fe7cbce9ce8b3ad4)

Author SHA1 Message Date
  Megvii Engine Team ff755451d2 refactor(mgb): move algo's name from info to desc and delete some algo's unnecessary param() method 5 years ago
  Megvii Engine Team 756c1eb7f2 fix(mgb/dnn): add cuda float naive matmul algo 5 years ago
  Megvii Engine Team 68f2e59763 fix(mgb(ci)): fix tx1 ci testcase 5 years ago
  Megvii Engine Team ba2ad46e54 feat(gopt): add deconv nchw4 int8 opt pass, add deconv nchw int8 5 years ago
  Megvii Engine Team 5d350fc843 feat(dnn/cuda): add deconv int8 and fix cutlass conv wrapper base on modify cutlass 2.4 5 years ago
  Megvii Engine Team c82d88751a fix(dnn/cuda): add cuda nchw int8 conv impl with nchw4 to fix cu111 compatibility 5 years ago
  Megvii Engine Team 97beae2fd8 fix(megdnn): fix megdnn benchmark testcase 5 years ago
  Megvii Engine Team 2de2222e46 feat(dnn/cuda): add cutlass batched gemv kernel for matmul operator 5 years ago
  Megvii Engine Team 973d2a0ac2 feat(dnn/cuda): add cutlass matmul using split k parallel 5 years ago
  Megvii Engine Team 03c921f7c4 feat(dnn/cuda): add cutlass matmul impls 5 years ago
  Megvii Engine Team cf27dd642c fix(cuda): use cudnn8.0.4 as cu111 default libs 5 years ago
  Megvii Engine Team 649e4dd750 test(cuda): fix test for cu111 5 years ago
  Megvii Engine Team c69359d00d fix(dnn/cuda): disable cudnn conv_bias kernels for NCHW4_NCHW tensor format 5 years ago
  Megvii Engine Team 0e3a6329ff build(cuda): support cu111 build 5 years ago
  Megvii Engine Team af42ce7e69 fix(megdnn): some fixes of execution policy 5 years ago
  Megvii Engine Team 821656aa4b refactor(megdnn): refactor brute force algo in batched matmul 5 years ago
  Megvii Engine Team 08ff62deb6 refactor(megdnn): refactor batched matmul algo in conv bias 5 years ago
  Megvii Engine Team 8773926ef8 refactor(megdnn): refactor matmul algo in conv bias 5 years ago
  Megvii Engine Team e4b71bdf64 refactor(megdnn): remove unnessary 1x1 algo 5 years ago
  Megvii Engine Team b04ad06f84 refactor(megdnn): refactor matmul algo in conv backward filter 5 years ago
  Megvii Engine Team 25089e520e refactor(megdnn): refactor matmul algo in conv backward data 5 years ago
  Megvii Engine Team 0d720653ac refactor(megdnn): add default algo for convolution forward 5 years ago
  Megvii Engine Team 659217acd2 refactor(megdnn): refactor bfloat16 convbias to recursive inteface 5 years ago
  Megvii Engine Team 4a1d52c9c6 refactor(megdnn): refactor bfloat16 matmul to recursive inteface 5 years ago
  Megvii Engine Team b8febaf91f refactor(megdnn): refactor bfloat16 convolutionbackwardfilter to recursive inteface 5 years ago
  Megvii Engine Team f14e0c17e7 feat(mgb): add recursive for fastrun and megdnn test 5 years ago
  Megvii Engine Team 364afec033 chore(mge): update copyright years 5 years ago
  Megvii Engine Team a85531dd0f feat(mgb/opr): add tqt opr 5 years ago
  Megvii Engine Team c3a4b2225d feat(dnn/cuda): add cutlass impls for fused convolution reformat operation 5 years ago
  Megvii Engine Team 5f44203d7b feat(dnn/cuda): add a cutlass impl for fusing convolution and dimshuffle 5 years ago
  Megvii Engine Team 61f917fb8e feat(dnn/cuda): add impl for fusing warp perspective and dimshuffle 5 years ago
  Megvii Engine Team 3bf73ff16f feat(dnn): add cuda preprocess fusion 5 years ago
  Megvii Engine Team 142f31a875 perf(dnn/cuda): change conv_bias heu, prefer dnn chanwise impl, dislike dnn batch gemm conv1x1 5 years ago
  Megvii Engine Team a1877ee0fa refactor(dnn): refactor algo interface, use algoinfo instead of global algorithm 5 years ago
  Megvii Engine Team 6f5d0febf1 perf(dnn/cuda): enhance performance for pooling forward 5 years ago
  Megvii Engine Team 6856ce9ce2 feat(dnn): support conv bias activation for nchw4 input tensor format and nchw output tensor format 5 years ago
  Megvii Engine Team 89ad33aeb3 feat(dnn/cuda): support weight preprocessing for cutlass algorithms 5 years ago
  Megvii Engine Team c03249c059 feat(dnn/opr): add megdnn fake quant opr 5 years ago
  Megvii Engine Team 739f927c4c feat(dnn/cuda): opt dp4a conv for small channel base on cutlass 5 years ago
  Megvii Engine Team 4aa277a203 refactor(dnn/cuda): misc 5 years ago
  Megvii Engine Team ba66e1d039 feat(dnn): add nchw_fp32 nchw44_qint8 cuda dct 5 years ago
  Megvii Engine Team edb32495c6 feat(dnn/opr): add megdnn adaptive pooling opr 5 years ago
  Megvii Engine Team 310c805f20 fix(dnn/cuda): use kernel parameter instead of user constant memory 5 years ago
  Megvii Engine Team 3a03fa7a50 fix(dnn/cuda): disable pascal sass conv2d 5 years ago
  Megvii Engine Team a5fad7d07c feat(dnn): add compile for riscv64 5 years ago
  Megvii Engine Team 76fa71573b feat(dnn/cuda): add cutlass nchw4 convolution 5 years ago
  Megvii Engine Team 16324e3076 feat(dnn/cuda): add remap backward 5 years ago
  Megvii Engine Team 6e882c1a86 feat(whl/imperative): compat for build python whl imperative and legacy runtime 5 years ago
  Megvii Engine Team aeffcd5897 feat(dnn/cuda): integrate cutlass nchw32 tensorcore convolution 5 years ago
  Megvii Engine Team c7b6ef35c1 feat(dnn/cuda): add warp perspective backward mat idx 5 years ago