702 Commits (4f60fbbb2fc02fe658a0ce229c1ee9a45b651606)

Author SHA1 Message Date
  Megvii Engine Team 273c0e8745 fix(autodiff): fix some bugs in relation to 2nd order grad 3 years ago
  Megvii Engine Team d56570d929 fix(megbrain): add rdnn to copybara 3 years ago
  Megvii Engine Team 12a3ef8d01 refactor(fastrun): decouple fastrun from computing graph 3 years ago
  Megvii Engine Team 2b80806f21 perf(imperative/src): improve dot performance 3 years ago
  Megvii Engine Team 1709b3940b perf(mge/functional): speed up Broadcast and Reshape 3 years ago
  Megvii Engine Team 3e206d899b perf(mge/functional): speed up Split 3 years ago
  Megvii Engine Team 8446626193 perf(imperative/src): improve elemwise 3 years ago
  Megvii Engine Team e400b7ffe5 perf(imperative): enable memory forwarding for imperative 4 years ago
  Megvii Engine Team 0cb60d646d feat(imperative): add output_descs for apply_on_physical_tensor 3 years ago
  Megvii Engine Team fea46ea9a4 perf(imperative): add opr cache for apply_on_physical_tensor 4 years ago
  Megvii Engine Team ea4e6ab93a fix(mgb/opr): fix shape cache of NvOF 4 years ago
  Megvii Engine Team 87de704a46 feat(gopt): fuse conv h_swish 3 years ago
  Megvii Engine Team 3726f5cc92 feat(gopt): merger consecutive relayout and dimshuffle to one relayout to optimize CD4 performarce 3 years ago
  Megvii Engine Team 1fead9b6b0 feat(gopt): merge consecutive dimshuffle and relayout to one relayout to optimize CD4 performace 3 years ago
  Megvii Engine Team 26d1e4f7ed feat(gopt): optimize cd4 pass rule for elemwise and typecvt to let cd4 start as soon as possible 3 years ago
  Megvii Engine Team 5f4501e0f3 fix(gopt): fix conv bias fuse 2 noline 3 years ago
  Megvii Engine Team 7d2063e35a perf(cuda): speedup conv backward data with small feature map and large filter size 4 years ago
  Megvii Engine Team 28d48f2f7a fix(mgb/src): fix megbrain cmake unsupport android_nn 4 years ago
  Megvii Engine Team 187c1dc081 fix(jit): copy aux var when shallow copying JITExecutor 4 years ago
  Megvii Engine Team b6ce02a152 fix(subgraph): fallback back to cg if jit unsupported 4 years ago
  Megvii Engine Team c55fda9a7c fix(fastrun): don't kill profiling worker 4 years ago
  Megvii Engine Team aa587446fc feat(subgraph): support shape inference for CompiledOp 4 years ago
  Megvii Engine Team bdb853ee6f fix(mgb): fix extra device malloc when load MultipleDeviceTensorWithFormatHolder 4 years ago
  Megvii Engine Team e2b79ea00e feat(mgb): reduce the number of trtruntimeopr create contexts 4 years ago
  Megvii Engine Team 95ac055538 feat(dnn,mgb,imperative): add diag opr implement 4 years ago
  Megvii Engine Team cbbca5fb10 feat(mge): add softmax op use cudnn api 4 years ago
  Megvii Engine Team 20b42a8c3b fix(dnn): add naive lstm kernel 4 years ago
  Megvii Engine Team 2faa6ea5a9 Merge pull request #213 from kxz18:rnn 4 years ago
  Megvii Engine Team 85ea882cb5 fix(mgb/ops): immutable tensor support empty storage 4 years ago
  Megvii Engine Team 4b0ecb5deb fix(ops/recv): use std::vector to store shape to support scalar 4 years ago
  Megvii Engine Team f4f20046c4 fix(mgb): fix tensorrt runtimeopr get output var shape bug 4 years ago
  Megvii Engine Team 1999307015 feat(mgb/opr): add dropout kernel 4 years ago
  Megvii Engine Team a93741815b feat(mgb/opr): add layernorm forward and backward kernel 4 years ago
  Megvii Engine Team 1657b8e881 fix(fastrun): fix persistent_cache in redis 4 years ago
  Megvii Engine Team a404cd7d06 fix(mgb/src): add tensorRT version check 4 years ago
  Megvii Engine Team c53cad2049 feat(cmake): format all cmake file 4 years ago
  Megvii Engine Team 6011f51001 style(all): fix clang-format for MGB_DEFINE inside another macro 4 years ago
  Megvii Engine Team 7231257efc fix(imperative/fastrun): fix worksapce limit for cpu compnode 4 years ago
  Megvii Engine Team a72e0cb568 feat(imperative,src): add jit builder for custom op 4 years ago
  Megvii Engine Team 93310c0e4b fix(mgb/gopt): fix cpu global layout transform fastrun error 4 years ago
  Megvii Engine Team 8624ec224b fix(mgb): fix param merge bug that caused the weight statistics error 4 years ago
  Megvii Engine Team 46d4bd8a59 feat(windows): let sdk do not care about more macro on win 4 years ago
  Megvii Engine Team 202b407149 fix(core): fix output var replaced by optpass 4 years ago
  Megvii Engine Team e715423f20 feat(src/gopt): add optpass on arm for fusing typecvt and elemwise to elemwise multi type 4 years ago
  Megvii Engine Team d9a46ea47b fix(dnn): correct behaviour of floor div for int tensor 4 years ago
  Megvii Engine Team cf1db2616e fix(fastrun): replace py_redis with cpp_redis to avoid deadlock 4 years ago
  Megvii Engine Team 390d2bb545 feat(mgb): tensorrt runtime opr support mutiple profiles 4 years ago
  Megvii Engine Team 1708ab2ec6 feat(mgb): add tensorrt runtime dynamic batch testcase 4 years ago
  Megvii Engine Team 87c845fd61 feat(mgb): tensorrt runtime opr support dynamic batch trt model 4 years ago
  Megvii Engine Team ce119ef5a5 fix(lite): fix lite error when record level is 2 4 years ago