Megvii Engine Team
cc21855074
feat(lite): load_and_run support optimize for inference
GitOrigin-RevId: d9abb8de9e
3 years ago
Megvii Engine Team
626222c698
fix(test): fix test for brainpp docker env
GitOrigin-RevId: c4c2cc73d2
3 years ago
Megvii Engine Team
fac67e7c2b
feat(gopt): support nchw44 global pooling with fuse_grain
GitOrigin-RevId: 4c43a149f8
3 years ago
Megvii Engine Team
32b31fd578
fix(mgb): change the check method of cuda sm code
GitOrigin-RevId: 23dbc9b574
3 years ago
Megvii Engine Team
d2a1905ad5
Revert "feat(mgb): add cumprod opr"
This reverts commit 3436c3bdaa .
GitOrigin-RevId: 95ab3d1aa7
3 years ago
Megvii Engine Team
49e14f87b5
feat(mgb): add cumprod opr
GitOrigin-RevId: 3436c3bdaa
4 years ago
Megvii Engine Team
3fbceb3a38
fix(mgb/version): fix nvinfer.h not found
GitOrigin-RevId: 981dd9a7d2
4 years ago
Megvii Engine Team
d60d028a40
feat(mge/device): enable to get cuda/cudnn/tensorrt version
GitOrigin-RevId: 5864c61d10
4 years ago
Megvii Engine Team
6dfd5a4cd4
fix(win7): workaround for hang when progress exit on win7+32bit
GitOrigin-RevId: b49b0b230e
4 years ago
Megvii Engine Team
cfed86f9af
feat(persistentcache): change file persistent cache with append model
GitOrigin-RevId: 7a427bdab4
4 years ago
Megvii Engine Team
35cf0422f0
fix(ci): relax async timeout
GitOrigin-RevId: 551de481f0
4 years ago
Megvii Engine Team
a32b727720
fix(build): upgrade bazel riscv toolchains
GitOrigin-RevId: 8ac61cc4b6
4 years ago
Megvii Engine Team
24c5c19bf0
fix(imperative): make functional ops support negative axis
GitOrigin-RevId: f61e01270b
4 years ago
Megvii Engine Team
64a8aaaf42
fix(build): remove ununsed functions when cuda disabled
GitOrigin-RevId: 03f50683f6
4 years ago
Megvii Engine Team
e694301721
fix(ops/jit): skip lookup include path when nvcc executable not found
GitOrigin-RevId: f5e7dce1c5
4 years ago
Megvii Engine Team
c2deef1a97
feat(mge): aad atlas710 support
GitOrigin-RevId: 6458c5c23c
4 years ago
Megvii Engine Team
b36b5bd8cb
refactor(mgb): check input when profiling
GitOrigin-RevId: 1d722dd741
4 years ago
Megvii Engine Team
6c9b3a58e3
refactor(dnn): remove algorithm cache queries
GitOrigin-RevId: b7a1dc62d8
4 years ago
Megvii Engine Team
8563f51404
fix(imperative): fix buildin reduce keepdim
GitOrigin-RevId: 38d90ab38a
4 years ago
Megvii Engine Team
50faabf614
feat(serialization): support the registry for new serialization format
GitOrigin-RevId: 8eacd5e77c
4 years ago
Megvii Engine Team
98b5ee78c1
feat(mge/dnn): add lamb optimizer
GitOrigin-RevId: 5a27157456
4 years ago
Megvii Engine Team
02bfb8f8b9
feat(lite): add and fix some feature for load and run fitting mode
GitOrigin-RevId: bbddc9bb79
4 years ago
Megvii Engine Team
80e1f38bea
fix(gtest): fix ci error report stack-use-after-scope
how to reproduce the problem:
1: build with asan(revert this MR)
2: then taskset process to one cpu:
taskset 01 ./megbrain_test --gtest_filter=TestAsyncQueue.SynchronizerWaiterStarving
GitOrigin-RevId: eb6f7aa4d8
4 years ago
Megvii Engine Team
c2e9860feb
chore(license): remove all license in file header
GitOrigin-RevId: a0e31247a6
4 years ago
Megvii Engine Team
bde2efa3b5
feat(lite/load_and_run): support put and get model redis cache
GitOrigin-RevId: 55c82e28c1
4 years ago
Megvii Engine Team
48526abb79
fix(mgb): fix concat cd4 tensor check size invalid
GitOrigin-RevId: 065e0b4be0
4 years ago
Megvii Engine Team
c87d998e59
feat(mgb): add interface to support opencl IO zero copy when inference
GitOrigin-RevId: a1d7021892
4 years ago
Megvii Engine Team
a0e531180d
fix(src/comp_node): fix calling cuda driver api
GitOrigin-RevId: cc33af2ac4
4 years ago
Megvii Engine Team
e2f5156b69
refactor(megbrain): save fastrun result to algorithm cache
GitOrigin-RevId: 45301ebb4d
4 years ago
Megvii Engine Team
7dc347697a
feat(dnn/cuda): add typecvt uint16
GitOrigin-RevId: d1368c414e
4 years ago
Megvii Engine Team
b92866d2c2
fix(build): fix build depends dirty file issue
GitOrigin-RevId: 435d8b5c50
4 years ago
Megvii Engine Team
27d4c4b36c
refactor(stats): use static inline variable declaration
GitOrigin-RevId: 7d86e5f257
4 years ago
Megvii Engine Team
787a22a9d6
perf(tensor): implement __new__ in cpp
GitOrigin-RevId: 4defd249c3
4 years ago
Megvii Engine Team
99df4a7996
fix(dtype): dtype scalar set_retain_dtype supports bool
GitOrigin-RevId: aafd378e1b
4 years ago
Megvii Engine Team
7bf5b0ee1e
test(imperative): check env values after each pytest
GitOrigin-RevId: 826788113a
4 years ago
Megvii Engine Team
409c988163
fix(imperative): add matmul apply_on_varnode
GitOrigin-RevId: 2cf6bf237c
4 years ago
Megvii Engine Team
b9cbc10120
feat(lite): add pack model
GitOrigin-RevId: 1a150f2af3
4 years ago
Megvii Engine Team
7927e98fd6
perf(mge): speed up PixelShuffle
GitOrigin-RevId: 942e755745
4 years ago
Megvii Engine Team
1c2a323e78
feat(mge): add warning message when mismatched cuda sm is detected
GitOrigin-RevId: f78c79eb06
4 years ago
Megvii Engine Team
877bda4180
perf(mge): improve cross stream memory borrowing
GitOrigin-RevId: c68977c5dc
4 years ago
Megvii Engine Team
c2435d1561
perf(imperative): specialize adaptive pooling
GitOrigin-RevId: 01e1418458
4 years ago
Megvii Engine Team
c0b267fff6
refactor(cuda-stub): opt cuda-stub log
GitOrigin-RevId: 87dda08e1b
4 years ago
Megvii Engine Team
3949d425fb
feat(core): always show MegEngine version and git commit id
GitOrigin-RevId: 4daa5be6d6
4 years ago
Megvii Engine Team
2a900a69cb
perf(imperative): improve reduce op performance
GitOrigin-RevId: 26d982a7b8
4 years ago
Megvii Engine Team
273c0e8745
fix(autodiff): fix some bugs in relation to 2nd order grad
1. implement double backward for batchnorm
2. fix grad attach in nested grad manager
3. pad empty tensor for unsatisfied output_has_grad
4. support double backward for jit subgraph
5. support double backward for autodiff.Function
6. readd debug flag MGE_LOG_OP_DISPATCH
GitOrigin-RevId: cd31ddc620
4 years ago
Megvii Engine Team
12a3ef8d01
refactor(fastrun): decouple fastrun from computing graph
GitOrigin-RevId: 27abd22295
4 years ago
Megvii Engine Team
1709b3940b
perf(mge/functional): speed up Broadcast and Reshape
GitOrigin-RevId: a72f5460b6
4 years ago
Megvii Engine Team
3e206d899b
perf(mge/functional): speed up Split
GitOrigin-RevId: 43550a0706
4 years ago
Megvii Engine Team
e400b7ffe5
perf(imperative): enable memory forwarding for imperative
GitOrigin-RevId: 7c1993979c
4 years ago
Megvii Engine Team
0cb60d646d
feat(imperative): add output_descs for apply_on_physical_tensor
GitOrigin-RevId: 5b036c2c5a
4 years ago