771 Commits (9889e82ea23d358ccc9b75d5595a34eb80448e05)

Author SHA1 Message Date
  Megvii Engine Team fa4883389a feat(dnn,imperative): remove the restriction of tensor shape when using uint8 region mask 3 years ago
  Megvii Engine Team 0ebd4400d5 fix(dnn): fix the modulo of int 3 years ago
  Megvii Engine Team 68d2710810 fix(build): remove build so many warning on windows 3 years ago
  Megvii Engine Team 582dd4ceb8 fix(dnn/sfotmax): call cpu dispatch for softmax opr 3 years ago
  Megvii Engine Team 235d81ddb0 feat(dnn): add fp16 nchw88 im2col algo 3 years ago
  Megvii Engine Team dbd9483993 feat(dnn,src,imperative): add groupnorm op 3 years ago
  Megvii Engine Team 0a52b2587e fix(opencl/test): fix test weight preprocess filter UAF issue 3 years ago
  Megvii Engine Team 8fe8edf4d6 feat(dnn): add fp16 mk8 16x12 matmul algo 3 years ago
  Megvii Engine Team f444d4fe4d feat(dnn,imperative): region restricted conv support groups=1 even if 3 years ago
  Megvii Engine Team fa9d719f7e fix(gopt): fix global layout transform fold conv typecvt 3 years ago
  Megvii Engine Team ece454fd46 fix(third_party): fix cpuinfo related to sve2 3 years ago
  Megvii Engine Team 6db4620e6d feat(dnn): fix wgrad rrconv for compute capability 3 years ago
  Megvii Engine Team 4e9b1c4eee feat(dnn): add rrconv wgrad, support int32 and uint8 region mask 3 years ago
  Megvii Engine Team 977c207171 feat(dnn): add RegionRestrictedConv DGRAD support int32 and uint8 3 years ago
  Megvii Engine Team 543c9b77a8 feat(dnn): add RegionRestrictedConv cuda 3 years ago
  Megvii Engine Team fdec82ece5 feat(dnn): add naive RegionRestrictedConv 3 years ago
  Megvii Engine Team e9cc523741 fix(mgb): format code 3 years ago
  huangxinda a07fbf79f7 Merge pull request #484 from wangxiang9603:add-nchw44-deconv 3 years ago
  Megvii Engine Team ec234135a6 feat(lite): support discrete inputs 3 years ago
  Megvii Engine Team 58b682ca00 feat(dnn/cuda): add naive bmm 3 years ago
  Megvii Engine Team edd3ee67ce fix(mgb): add error infomation for old version load new elemwise mode 3 years ago
  Megvii Engine Team a7e28ebe8c fix(dnn): fix winograd load error and cpuinfo test error 3 years ago
  Megvii Engine Team 41b9db85e2 fix(mgb): make error infomation of advanced indexing out of bound more readable 3 years ago
  Megvii Engine Team f0291883b6 fix(mgb): make error infomation of group conv input channel mismatch more readable 3 years ago
  Megvii Engine Team d977079212 feat(third_party): update cpuinfo 3 years ago
  wangxiang fb2329e9db feat(dnn) add nchw44 deconv 3 years ago
  Megvii Engine Team 217999b1fa feat(arm): add winograd F43 NCHW44 algo and winograd F43 44 algo 3 years ago
  Megvii Engine Team 1529bce525 perf(opencl): add opencl weight transpose kernel 3 years ago
  Megvii Engine Team 5ee0094322 fix(dnn/cuda): fix ptx mma algo compute bugs 3 years ago
  Megvii Engine Team 1404437a90 fix(mgb): fix the compatibility issue of cuda stub with older version drivers 3 years ago
  Megvii Engine Team a6a2646c10 feat(arm): add AlgoFP32Winograd F43, and add filter size into name of winograd-related algorithms 3 years ago
  Megvii Engine Team b8821edb3d perf(dnn/aarch64): optimize aarch64 sigmoid with asm 3 years ago
  Megvii Engine Team 2b99bfec4e feat(arm): supports weight pre-processing for winograd benchmark tests 3 years ago
  Megvii Engine Team 421bcfd3d8 style(mgb/tools): add format for tools, dnn and ci 3 years ago
  Megvii Engine Team 116781ba9c fix(mgb): fix megtee build errors 3 years ago
  Megvii Engine Team 54b5db1729 feat(x86/rvv): add AGENT_NCHW_NCHW44 algo 3 years ago
  Megvii Engine Team eaa180181a feat(x86/rvv): opt gi intrinsic helper 3 years ago
  Megvii Engine Team 399db31aab fix(dnn): fix build 3 years ago
  Megvii Engine Team f31e52d521 feat(mgb): warpperspective support multi src input 3 years ago
  Megvii Engine Team 669816e291 feat(dnn): warpperspective support multi src input 3 years ago
  Megvii Engine Team 1b94380794 fix(dnn): fix reduce sum/mean error when b is large 3 years ago
  Megvii Engine Team c7a9909839 feat(cuda): add int4 ptx 256x64 mma kernel 3 years ago
  Megvii Engine Team cf3ca1e9a2 feat(cuda): add int4 ptx 128x256 mma kernel 3 years ago
  Megvii Engine Team 1f8e930e28 feat(cuda): add int4 ptx 128x128 mma kernel 3 years ago
  Megvii Engine Team 1a2ed8c47b feat(cuda): add convbias ptx algo testcase 3 years ago
  Megvii Engine Team 64551105f9 feat(cuda): add convbias ptx algo 3 years ago
  Megvii Engine Team 8395a459b5 fix(dnn/fallback): fix naive shift multidefination error and optimize GiCvtFromInt32V4ToUint8 3 years ago
  Megvii Engine Team 23a3d13350 fix(dnn/softmax): create redcue and elemwise opr when get workspace size 3 years ago
  Megvii Engine Team b3a7d149a0 feat(dnn/fallback): add some new gi api 3 years ago
  Megvii Engine Team fac67e7c2b feat(gopt): support nchw44 global pooling with fuse_grain 3 years ago