Megvii Engine Team
273c0e8745
fix(autodiff): fix some bugs in relation to 2nd order grad
1. implement double backward for batchnorm
2. fix grad attach in nested grad manager
3. pad empty tensor for unsatisfied output_has_grad
4. support double backward for jit subgraph
5. support double backward for autodiff.Function
6. readd debug flag MGE_LOG_OP_DISPATCH
GitOrigin-RevId: cd31ddc620
3 years ago
Megvii Engine Team
d56570d929
fix(megbrain): add rdnn to copybara
GitOrigin-RevId: 7d8bf77053
3 years ago
Megvii Engine Team
12a3ef8d01
refactor(fastrun): decouple fastrun from computing graph
GitOrigin-RevId: 27abd22295
3 years ago
Megvii Engine Team
2b80806f21
perf(imperative/src): improve dot performance
GitOrigin-RevId: 35b5bd164f
3 years ago
Megvii Engine Team
1709b3940b
perf(mge/functional): speed up Broadcast and Reshape
GitOrigin-RevId: a72f5460b6
3 years ago
Megvii Engine Team
3e206d899b
perf(mge/functional): speed up Split
GitOrigin-RevId: 43550a0706
3 years ago
Megvii Engine Team
8446626193
perf(imperative/src): improve elemwise
GitOrigin-RevId: 78aa487277
3 years ago
Megvii Engine Team
e400b7ffe5
perf(imperative): enable memory forwarding for imperative
GitOrigin-RevId: 7c1993979c
4 years ago
Megvii Engine Team
0cb60d646d
feat(imperative): add output_descs for apply_on_physical_tensor
GitOrigin-RevId: 5b036c2c5a
3 years ago
Megvii Engine Team
fea46ea9a4
perf(imperative): add opr cache for apply_on_physical_tensor
GitOrigin-RevId: fc5d5fb34d
4 years ago
Megvii Engine Team
ea4e6ab93a
fix(mgb/opr): fix shape cache of NvOF
GitOrigin-RevId: 456ba478e9
4 years ago
Megvii Engine Team
87de704a46
feat(gopt): fuse conv h_swish
GitOrigin-RevId: a3d12991fb
3 years ago
Megvii Engine Team
3726f5cc92
feat(gopt): merger consecutive relayout and dimshuffle to one relayout to optimize CD4 performarce
GitOrigin-RevId: a058776be3
3 years ago
Megvii Engine Team
1fead9b6b0
feat(gopt): merge consecutive dimshuffle and relayout to one relayout to optimize CD4 performace
GitOrigin-RevId: 16f22baa80
3 years ago
Megvii Engine Team
26d1e4f7ed
feat(gopt): optimize cd4 pass rule for elemwise and typecvt to let cd4 start as soon as possible
GitOrigin-RevId: 6580dedca7
3 years ago
Megvii Engine Team
5f4501e0f3
fix(gopt): fix conv bias fuse 2 noline
GitOrigin-RevId: a6ab9f4e5e
3 years ago
Megvii Engine Team
7d2063e35a
perf(cuda): speedup conv backward data with small feature map and large filter size
GitOrigin-RevId: 85592bca6b
4 years ago
Megvii Engine Team
28d48f2f7a
fix(mgb/src): fix megbrain cmake unsupport android_nn
GitOrigin-RevId: 037c197912
4 years ago
Megvii Engine Team
187c1dc081
fix(jit): copy aux var when shallow copying JITExecutor
GitOrigin-RevId: 3b331e1c17
4 years ago
Megvii Engine Team
b6ce02a152
fix(subgraph): fallback back to cg if jit unsupported
GitOrigin-RevId: 853a00a402
4 years ago
Megvii Engine Team
c55fda9a7c
fix(fastrun): don't kill profiling worker
GitOrigin-RevId: 99a0f11a5a
4 years ago
Megvii Engine Team
aa587446fc
feat(subgraph): support shape inference for CompiledOp
GitOrigin-RevId: a96b8f3446
4 years ago
Megvii Engine Team
bdb853ee6f
fix(mgb): fix extra device malloc when load MultipleDeviceTensorWithFormatHolder
GitOrigin-RevId: adf4a7f77a
4 years ago
Megvii Engine Team
e2b79ea00e
feat(mgb): reduce the number of trtruntimeopr create contexts
GitOrigin-RevId: 14e5d1769e
4 years ago
Megvii Engine Team
95ac055538
feat(dnn,mgb,imperative): add diag opr implement
GitOrigin-RevId: 43016ffa2b
4 years ago
Megvii Engine Team
cbbca5fb10
feat(mge): add softmax op use cudnn api
GitOrigin-RevId: 7734ebf8c4
4 years ago
Megvii Engine Team
20b42a8c3b
fix(dnn): add naive lstm kernel
GitOrigin-RevId: f08ef810cf
4 years ago
Megvii Engine Team
2faa6ea5a9
Merge pull request #213 from kxz18:rnn
GitOrigin-RevId: 9e9215c115
4 years ago
Megvii Engine Team
85ea882cb5
fix(mgb/ops): immutable tensor support empty storage
GitOrigin-RevId: 2851498fce
4 years ago
Megvii Engine Team
4b0ecb5deb
fix(ops/recv): use std::vector to store shape to support scalar
GitOrigin-RevId: e1dac3c919
4 years ago
Megvii Engine Team
f4f20046c4
fix(mgb): fix tensorrt runtimeopr get output var shape bug
GitOrigin-RevId: b830706a89
4 years ago
Megvii Engine Team
1999307015
feat(mgb/opr): add dropout kernel
GitOrigin-RevId: d248bd2005
4 years ago
Megvii Engine Team
a93741815b
feat(mgb/opr): add layernorm forward and backward kernel
GitOrigin-RevId: 0cd484e753
4 years ago
Megvii Engine Team
1657b8e881
fix(fastrun): fix persistent_cache in redis
GitOrigin-RevId: ada5862b05
4 years ago
Megvii Engine Team
a404cd7d06
fix(mgb/src): add tensorRT version check
GitOrigin-RevId: 7abfd30cab
4 years ago
Megvii Engine Team
c53cad2049
feat(cmake): format all cmake file
GitOrigin-RevId: 0a4ecab99b
4 years ago
Megvii Engine Team
6011f51001
style(all): fix clang-format for MGB_DEFINE inside another macro
GitOrigin-RevId: 8c2b6a2aed
4 years ago
Megvii Engine Team
7231257efc
fix(imperative/fastrun): fix worksapce limit for cpu compnode
GitOrigin-RevId: 4583ce6d4b
4 years ago
Megvii Engine Team
a72e0cb568
feat(imperative,src): add jit builder for custom op
GitOrigin-RevId: 3bb0b46311
4 years ago
Megvii Engine Team
93310c0e4b
fix(mgb/gopt): fix cpu global layout transform fastrun error
GitOrigin-RevId: ea254297e5
4 years ago
Megvii Engine Team
8624ec224b
fix(mgb): fix param merge bug that caused the weight statistics error
GitOrigin-RevId: f76a096832
4 years ago
Megvii Engine Team
46d4bd8a59
feat(windows): let sdk do not care about more macro on win
GitOrigin-RevId: c522c2fd63
4 years ago
Megvii Engine Team
202b407149
fix(core): fix output var replaced by optpass
GitOrigin-RevId: aea62de345
4 years ago
Megvii Engine Team
e715423f20
feat(src/gopt): add optpass on arm for fusing typecvt and elemwise to elemwise multi type
GitOrigin-RevId: e6bcbbf91b
4 years ago
Megvii Engine Team
d9a46ea47b
fix(dnn): correct behaviour of floor div for int tensor
GitOrigin-RevId: 1444f69cce
4 years ago
Megvii Engine Team
cf1db2616e
fix(fastrun): replace py_redis with cpp_redis to avoid deadlock
GitOrigin-RevId: 9af7fa5c97
4 years ago
Megvii Engine Team
390d2bb545
feat(mgb): tensorrt runtime opr support mutiple profiles
GitOrigin-RevId: 1157d34e4d
4 years ago
Megvii Engine Team
1708ab2ec6
feat(mgb): add tensorrt runtime dynamic batch testcase
GitOrigin-RevId: 36372437ff
4 years ago
Megvii Engine Team
87c845fd61
feat(mgb): tensorrt runtime opr support dynamic batch trt model
GitOrigin-RevId: 7461de704e
4 years ago
Megvii Engine Team
ce119ef5a5
fix(lite): fix lite error when record level is 2
GitOrigin-RevId: 7dabfd8876
4 years ago