93 Commits (1d64792b41fbd3c094e38cd6bbc6bf38b9bfd43e)

Author SHA1 Message Date
  Megvii Engine Team b87af9f77f feat(dnn/cuda): topk support fp16 5 years ago
  Megvii Engine Team 606540bef4 feat(dnn/cuda): add nhwc 4bit warp perspective 5 years ago
  Megvii Engine Team 1e6019436c feat(dnn/cuda): add nhwc int4 pooling 5 years ago
  Megvii Engine Team 319436dd14 feat(dnn/cuda): add cutlass impls for uint4 x int4 conv bias 5 years ago
  Megvii Engine Team d28eba4ea5 feat(dnn/cuda): add cutlass impls for int4 conv bias 5 years ago
  Megvii Engine Team 2d4e62ef58 feat(dnn/cuda): add cuda uint4 pooling 5 years ago
  Megvii Engine Team 19919384fc feat(dnn/cuda): add cuda uint warp perspective 5 years ago
  Megvii Engine Team 4a802d21ca feat(dnn/cuda): add conv u4xs4 sass kernel 5 years ago
  Megvii Engine Team adf75a291d perf(dnn/cuda): add sass int4 128x128 5 years ago
  Megvii Engine Team 8da2f698a3 feat(dnn/cuda): support warp perspective/pooling op when channel not aligned to 64 5 years ago
  Megvii Engine Team 4fe68ac9ed feat(dnn/cuda): support transforming layout between nchw and nchw64 when channel not aligned to 64 5 years ago
  Megvii Engine Team 56e863b7d4 fix(dnn/cuda): fix int4 epilogue stg bug 5 years ago
  Megvii Engine Team 12a0e61542 feat(dnn/cuda): add cuda elemwise int4 5 years ago
  Megvii Engine Team df1af59b5c feat(dnn): warp perspective support int4 5 years ago
  Megvii Engine Team 2398df079c feat(dnn/cuda): add cuda int4 pooling 5 years ago
  Megvii Engine Team e250afb08f feat(dnn/cuda): support conv_bias for nchw64 and qint4 5 years ago
  Megvii Engine Team 8fef78d06d feat(dnn/cuda): add relayout format when width is an odd number 5 years ago
  Megvii Engine Team 19a554d674 test(dnn/cuda): add testcase for transforming tensor layout between nchw and nchw64 5 years ago
  Megvii Engine Team 23032f50f2 feat(dnn/cuda): support float16 for index_incr_multi_axis_vec 5 years ago
  Megvii Engine Team 938944027d fix(mgb/dnn): fix cudnn8 convbias 5 years ago
  Megvii Engine Team 1997b1a289 feat(dnn/cuda): add correlation kernel 5 years ago
  Megvii Engine Team c3f8cf04fa feat(dnn): add conv_bwd_data and conv_bwd_filter accuracy shake check 5 years ago
  Megvii Engine Team 1e6ef3771f feat(mgb/dnn): add accuracy shake checker 5 years ago
  Megvii Engine Team ff755451d2 refactor(mgb): move algo's name from info to desc and delete some algo's unnecessary param() method 5 years ago
  Megvii Engine Team 756c1eb7f2 fix(mgb/dnn): add cuda float naive matmul algo 5 years ago
  Megvii Engine Team 68f2e59763 fix(mgb(ci)): fix tx1 ci testcase 5 years ago
  Megvii Engine Team ba2ad46e54 feat(gopt): add deconv nchw4 int8 opt pass, add deconv nchw int8 5 years ago
  Megvii Engine Team 5d350fc843 feat(dnn/cuda): add deconv int8 and fix cutlass conv wrapper base on modify cutlass 2.4 5 years ago
  Megvii Engine Team c82d88751a fix(dnn/cuda): add cuda nchw int8 conv impl with nchw4 to fix cu111 compatibility 5 years ago
  Megvii Engine Team 97beae2fd8 fix(megdnn): fix megdnn benchmark testcase 5 years ago
  Megvii Engine Team 2de2222e46 feat(dnn/cuda): add cutlass batched gemv kernel for matmul operator 5 years ago
  Megvii Engine Team 973d2a0ac2 feat(dnn/cuda): add cutlass matmul using split k parallel 5 years ago
  Megvii Engine Team 03c921f7c4 feat(dnn/cuda): add cutlass matmul impls 5 years ago
  Megvii Engine Team cf27dd642c fix(cuda): use cudnn8.0.4 as cu111 default libs 5 years ago
  Megvii Engine Team 649e4dd750 test(cuda): fix test for cu111 5 years ago
  Megvii Engine Team c69359d00d fix(dnn/cuda): disable cudnn conv_bias kernels for NCHW4_NCHW tensor format 5 years ago
  Megvii Engine Team 0e3a6329ff build(cuda): support cu111 build 5 years ago
  Megvii Engine Team af42ce7e69 fix(megdnn): some fixes of execution policy 5 years ago
  Megvii Engine Team 821656aa4b refactor(megdnn): refactor brute force algo in batched matmul 5 years ago
  Megvii Engine Team 08ff62deb6 refactor(megdnn): refactor batched matmul algo in conv bias 5 years ago
  Megvii Engine Team 8773926ef8 refactor(megdnn): refactor matmul algo in conv bias 5 years ago
  Megvii Engine Team e4b71bdf64 refactor(megdnn): remove unnessary 1x1 algo 5 years ago
  Megvii Engine Team b04ad06f84 refactor(megdnn): refactor matmul algo in conv backward filter 5 years ago
  Megvii Engine Team 25089e520e refactor(megdnn): refactor matmul algo in conv backward data 5 years ago
  Megvii Engine Team 0d720653ac refactor(megdnn): add default algo for convolution forward 5 years ago
  Megvii Engine Team 659217acd2 refactor(megdnn): refactor bfloat16 convbias to recursive inteface 5 years ago
  Megvii Engine Team 4a1d52c9c6 refactor(megdnn): refactor bfloat16 matmul to recursive inteface 5 years ago
  Megvii Engine Team b8febaf91f refactor(megdnn): refactor bfloat16 convolutionbackwardfilter to recursive inteface 5 years ago
  Megvii Engine Team f14e0c17e7 feat(mgb): add recursive for fastrun and megdnn test 5 years ago
  Megvii Engine Team 364afec033 chore(mge): update copyright years 5 years ago