91 Commits (8928c77c56b38dfc81d5b44cabedac886688fdfd)

Author SHA1 Message Date
  Megvii Engine Team 58c8746e30 fix(opr): fix fast-run error in cuda 5 years ago
  Megvii Engine Team 5d350fc843 feat(dnn/cuda): add deconv int8 and fix cutlass conv wrapper base on modify cutlass 2.4 5 years ago
  Megvii Engine Team a3ea1f153c feat(mgb/opr): add fast profile and combined Execution strategy 5 years ago
  Megvii Engine Team c82d88751a fix(dnn/cuda): add cuda nchw int8 conv impl with nchw4 to fix cu111 compatibility 5 years ago
  Megvii Engine Team f2b42bf09e chore(dotprod): add arm dotprod attribute for easy use 5 years ago
  Megvii Engine Team c33a717314 feat(dnn): repalce is_reproducible with algo attribute in opencl, cpu, rocm and cuda 5 years ago
  Megvii Engine Team 9cc732f82d fix(opencl): fix opencl search algo negative stride support 5 years ago
  Megvii Engine Team cd7090acbb fix(opencl): enable image on mali(cl2.1) 5 years ago
  Megvii Engine Team c51a687cef chore(mge): update copyright years 5 years ago
  Megvii Engine Team 7afa422df4 refactor(megdnn): refactor sub opr setter 5 years ago
  Megvii Engine Team f14e0c17e7 feat(mgb): add recursive for fastrun and megdnn test 5 years ago
  Megvii Engine Team 85fa988348 refactor(dnn): add get_algorithm_from_desc interface 5 years ago
  Megvii Engine Team 364afec033 chore(mge): update copyright years 5 years ago
  Megvii Engine Team 8f7f52ae4d feat(jit): add memfwd in jit executor opr 5 years ago
  Megvii Engine Team dfb2b2ce49 fix(dnn): change pooling window size smaller than padding constraint to log_error 5 years ago
  Megvii Engine Team a85531dd0f feat(mgb/opr): add tqt opr 5 years ago
  Megvii Engine Team 61f917fb8e feat(dnn/cuda): add impl for fusing warp perspective and dimshuffle 5 years ago
  Megvii Engine Team eb826422c4 fix(dnn): forbid pooling window size smaller than padding 5 years ago
  Megvii Engine Team fc0fcd2f7f chore(winograd): remove winograd transform code 5 years ago
  Megvii Engine Team d1adc9a22f fix(dnn): fix opencl algo search 5 years ago
  Megvii Engine Team 3bf73ff16f feat(dnn): add cuda preprocess fusion 5 years ago
  Megvii Engine Team 86cf7490ec feat(dnn/aarch64): add quantizeds4 matmul int4x4x16_k8x8x8 5 years ago
  Megvii Engine Team a1877ee0fa refactor(dnn): refactor algo interface, use algoinfo instead of global algorithm 5 years ago
  Megvii Engine Team 6856ce9ce2 feat(dnn): support conv bias activation for nchw4 input tensor format and nchw output tensor format 5 years ago
  Megvii Engine Team c03249c059 feat(dnn/opr): add megdnn fake quant opr 5 years ago
  Megvii Engine Team 1217801133 perf(mge): add opdef for broadcast 5 years ago
  Megvii Engine Team 2a3f4d099a refactor(dnn/arm): refactor CPU heuristic algo selection 5 years ago
  Megvii Engine Team ba66e1d039 feat(dnn): add nchw_fp32 nchw44_qint8 cuda dct 5 years ago
  Megvii Engine Team 215f88f373 fix(dnn/argmxx): fix argmxx on inf 5 years ago
  Megvii Engine Team edb32495c6 feat(dnn/opr): add megdnn adaptive pooling opr 5 years ago
  Megvii Engine Team 95eb6ae380 feat(mgb/opr): let more ops support empty IO 5 years ago
  Megvii Engine Team a5fad7d07c feat(dnn): add compile for riscv64 5 years ago
  Megvii Engine Team 3e11d89415 fix(dnn/dump): add more info for dump CD4 5 years ago
  Megvii Engine Team 16324e3076 feat(dnn/cuda): add remap backward 5 years ago
  Megvii Engine Team 6e882c1a86 feat(whl/imperative): compat for build python whl imperative and legacy runtime 5 years ago
  Megvii Engine Team 7f857bd471 feat(mgb/rocm): add cmake for rocm and fix compile errors and bn 5 years ago
  Megvii Engine Team 199eefbd4c fix(dnn): generate mode files 5 years ago
  Megvii Engine Team 9510136223 fix(mgb/rocm): remove begin-internal of rocm 5 years ago
  Megvii Engine Team 00ef677249 fix(mgb): remove internal for cambricon and atlas 5 years ago
  Megvii Engine Team a1e6720756 feat(dnn): enable bool comparison 5 years ago
  Megvii Engine Team 56381f808b fix(dnn/arm): use vcvtq_f32_s32 for all arm code 5 years ago
  Megvii Engine Team 1173205726 fix(gopt): nchw_nchwxx useable and opt pass use nchw_nchwxx_valid 5 years ago
  Megvii Engine Team 2272abe18d fix(mgb/fallback): disable nchw44 in conv1x1 and im2col in x86 5 years ago
  Megvii Engine Team 230ab45a1e fix(mgb/naive): fix naive convolution no dispatch kernel in handle 5 years ago
  Megvii Engine Team 6e70fa7a11 feat(dnn/arm): add fp32 asm gemm for a53 a55 and i8i8i16 gemm for a72 a53 5 years ago
  Megvii Engine Team c7b6ef35c1 feat(dnn/cuda): add warp perspective backward mat idx 5 years ago
  Megvii Engine Team e258812f12 feat(dnn): add bool dtype 5 years ago
  Megvii Engine Team 6bcc6faec8 feat(mge/imperative/opr): modify batch_norm to support frozen BN 5 years ago
  Megvii Engine Team f6018422fd perf(dnn/arm_common): add nchw44 winograd f73 5 years ago
  Megvii Engine Team 324af87807 feat(dnn/arm): add cpuinfo runtime check for x86 and arm 5 years ago