74 Commits (e715423f20d810670322474d46a1726f50979d9d)

Author SHA1 Message Date
  Megvii Engine Team e715423f20 feat(src/gopt): add optpass on arm for fusing typecvt and elemwise to elemwise multi type 4 years ago
  Megvii Engine Team 34773ba37b fix(mgb/gopt): tensorcore pass replace BatchConvBias inputs to nchw4 4 years ago
  Megvii Engine Team 369c2ccc5a style(all): reformat c++ code 4 years ago
  Megvii Engine Team 88c1eedbd7 feat(mgb/gopt): enable reduce for nchw44 4 years ago
  Megvii Engine Team c0ccd0ea7e feat(mge/bn): add NHWC support for bn 4 years ago
  Megvii Engine Team d7cc4628f6 perf(gopt): opt concat for OpenCL 4 years ago
  Megvii Engine Team a3cd3fc74f test(mgb/gopt): add testcase for global layout transform 4 years ago
  Megvii Engine Team 3eb0505f9b feat(imperative): add support for quantized conv transpose2d 4 years ago
  Megvii Engine Team 869a03271b perf(mgb): disable FoldingConvBiasDimshufflePass in cuda10 for performance 4 years ago
  Megvii Engine Team 239916a997 fix(mgb/gopt): fix testcase for enable nchw64 pass 4 years ago
  Megvii Engine Team 009c90a2fe feat(mgb/gopt): modify padding policy for 4bit conv bias oprs 4 years ago
  Megvii Engine Team b4687ce8da feat(dnn/cuda): add convolution with i8 input and u4 output 4 years ago
  Megvii Engine Team bba04f02e5 feat(mgb/gopt): add fusion support for conv, astype(s4) and reformat 4 years ago
  Megvii Engine Team 7d3df995cb feat(gopt/inference): allow Float32 output dtype in EnableNCHW4Pass 4 years ago
  Megvii Engine Team 47dcdf3e17 fix(mgb/core): fix dtype and resize modifiers for tensor 4 years ago
  Megvii Engine Team 0fb9cc41e4 fix(gopt): fix nchw64 opt pass 4 years ago
  Megvii Engine Team 86b69cacd0 fix(dnn): fixes for int4 4 years ago
  Megvii Engine Team 8da2f698a3 feat(dnn/cuda): support warp perspective/pooling op when channel not aligned to 64 4 years ago
  Megvii Engine Team ae6ff2c5a6 feat(mgb/gopt): add opt pass for nchw64 layout transform 4 years ago
  Megvii Engine Team 63a9bd30a8 feat(mgb/gopt): add an opt pass for padding channels to enable fast int8/int4 support on GPU 4 years ago
  Megvii Engine Team 36b1ba052f fix(mgb/dnn): fix cudnn8.0.4 convbias with z 4 years ago
  Megvii Engine Team 2d18074a70 fix(mgb): fix spell error 4 years ago
  Megvii Engine Team a437ec8e88 fix(src/gopt): add replace func of typecvt opr for nhwcd4 pass 4 years ago
  Megvii Engine Team 04b1a45af4 fix(dnn): fix cudnn crash when finalize called after cudnn dtor 4 years ago
  Megvii Engine Team 14a089c49d fix(dnn): change ci to cudnn804, reopen testcase 4 years ago
  Megvii Engine Team ba2ad46e54 feat(gopt): add deconv nchw4 int8 opt pass, add deconv nchw int8 4 years ago
  Megvii Engine Team a3ea1f153c feat(mgb/opr): add fast profile and combined Execution strategy 5 years ago
  Megvii Engine Team c82d88751a fix(dnn/cuda): add cuda nchw int8 conv impl with nchw4 to fix cu111 compatibility 4 years ago
  Megvii Engine Team 51868533c8 fix(mgb/gopt): fix opt pass elementwise operation shape issue at tranform to NCHW4 4 years ago
  Megvii Engine Team cf27dd642c fix(cuda): use cudnn8.0.4 as cu111 default libs 5 years ago
  Megvii Engine Team 649e4dd750 test(cuda): fix test for cu111 5 years ago
  Megvii Engine Team 2e4b9a42f7 fix(mgb/gopt): fix folding conv dimshuffle opt pass 5 years ago
  Megvii Engine Team 364afec033 chore(mge): update copyright years 5 years ago
  Megvii Engine Team 1b1ad56a82 fix(mgb/gopt): fix warp fusion opt pass 5 years ago
  Megvii Engine Team 4e9be159f7 feat(mgb/gopt): add opt pass for fusing convolution and reformat 5 years ago
  Megvii Engine Team 61f917fb8e feat(dnn/cuda): add impl for fusing warp perspective and dimshuffle 5 years ago
  Megvii Engine Team 3bf73ff16f feat(dnn): add cuda preprocess fusion 5 years ago
  Megvii Engine Team 5f171298aa feat(mgb/gopt): add AxisAddRemove opr support for cd4 opt pass 5 years ago
  Megvii Engine Team 6f5d0febf1 perf(dnn/cuda): enhance performance for pooling forward 5 years ago
  Megvii Engine Team 7cd71c3102 fix(mgb/gopt): fix cd4 elewise transform 5 years ago
  Megvii Engine Team cab46f2a00 fix(dnn): fix relayout format when group conv group=1 5 years ago
  Megvii Engine Team 4afa4b72c3 fix(mgb/oppass): fix fuse conv bias pass when bias is full bias and bias 5 years ago
  Megvii Engine Team 40e778fbd1 fix(mgb): fix-deconv-io16c32-pass-fail 5 years ago
  Megvii Engine Team ee2e2b3c7b fix(mgb/gopt): fix nchwxx optpass of no handle conv_bias opr which with no bias 5 years ago
  Megvii Engine Team 8f44d6ea60 fix(mgb): fix optpass fail at transform NCHW to NCHW4 when input dtype is float 5 years ago
  Megvii Engine Team fade97d4ef fix(mgb/gopt): fix convert batchnorm to elemwise pass issue 5 years ago
  Megvii Engine Team c20d4cc6dc feat(dnn): fix opt pass nchw44 can not dump resnet 5 years ago
  Megvii Engine Team 56381f808b fix(dnn/arm): use vcvtq_f32_s32 for all arm code 5 years ago
  Megvii Engine Team 1173205726 fix(gopt): nchw_nchwxx useable and opt pass use nchw_nchwxx_valid 5 years ago
  Megvii Engine Team eb18eba87d fix(gopt): fix nchw44 nchw44_dot gopt test 5 years ago