498f7b2e
fix(opr): fix compile on GCC6 by
2020-09-07 19:31:41 +0800
b9f3077d
feat(opr): remove old weight preprocess adapter in fastrun by
2020-08-31 15:16:10 +0800
75eebb7c
feat(opr): use weight preprocess feature of MegDNN by
2020-07-20 22:37:14 +0800
66509a54
test(mgb): enable gmock by
2020-08-29 01:26:19 +0800
fe3ee3cd
refactor(opr): refactor OprArityTrait by
2020-07-20 20:34:24 +0800
91efd67d
refactor(mge/jit): change dump options, add test by
2020-09-14 13:05:54 +0800
099ffeac
chore(mge): typo and reword by
2020-09-14 13:54:30 +0800
539aac44
fix(dnn/cuda): fix sass loader by
2020-08-28 15:38:46 +0800
39d43283
fix(mge/functional): functional api fixes by
2020-09-09 18:21:42 +0800
023d60ae
fix(mgb/atlas): use aclmdlGetOutputSizeByIndex to create DataBuffer by
2020-09-10 15:14:50 +0800
116eee52
build(third_party): update megray by
2020-09-11 11:13:13 +0800
e507228e
feat(mge/examples): add distributed training examples using launcher by
2020-09-10 16:30:59 +0800
23437864
fix(mgb/jit): mlir doesn't support broadcast by
2020-09-10 19:30:32 +0800
f87bba68
feat(mgb/jit): add scalar support for mlir by
2020-09-07 11:21:38 +0800
11b121a7
fix(mgb/jit): link libdevice.bc when generate nvvm ir by
2020-09-05 23:23:48 +0800
aa7f28b8
fix(mgb/jit): fix gpu kernel args order by
2020-08-27 00:31:17 +0800
9767ca8f
feat(mgb/jit): refactor code and add more elemwise mode by
2020-08-27 00:31:17 +0800
f9109bed
feat(mge): add mlir cmake by
2020-08-14 13:11:03 +0800
a51d5b4c
feat(mgb/jit): add mlir backend for cpu and cuda by
2020-07-22 15:38:04 +0800
523ce65e
fix(mge/imperative): fix cgtools related tests by
2020-09-12 01:46:30 +0800
dd39265e
fix(mgb/dtype): enable TypeCvt for bool when trace(symbolic=True) by
2020-09-07 00:25:39 +0800
5c37a64e
feat(sdk/load_and_run): add dump with testcase for mge support imperative by
2020-09-09 11:01:45 +0800
ac11c38a
feat(mge/imperative): add graph load and cgtools for imperative by
2020-09-06 17:58:34 +0800
76f36796
fix(mge/trace): fix op order in symbolic by
2020-09-07 19:07:23 +0800
10e942d9
refactor(mge): polish api by
2020-09-10 12:44:06 +0800
9389a805
fix(mge/oprmm): fix grad for collective comm by
2020-09-09 14:39:15 +0800
6f581906
refactor(mge/profiler): refactor profiler by
2020-09-04 17:44:07 +0800
cc952b2b
fix(rocm): fix rocm megdnntest sleep and a cut code by
2020-09-04 14:25:04 +0800
3a03fa7a
fix(dnn/cuda): disable pascal sass conv2d by
2020-09-07 16:19:40 +0800
5474b000
fix(mge/functional): fix convert_inputs before apply by
2020-09-09 20:09:25 +0800
a5fad7d0
feat(dnn): add compile for riscv64 by
2020-09-10 13:31:57 +0800
478f2c51
feat(mge/jit): add trace/dump options by
2020-09-06 13:08:47 +0800
44d0b5da
feat(imperative): enable to() to copy to device by
2020-09-08 15:05:13 +0800
3e11d894
fix(dnn/dump): add more info for dump CD4 by
2020-09-09 16:14:31 +0800
d49a5c1b
fix(mge/imperative): remove codes newly introduced in numpy 1.19 by
2020-09-08 16:45:47 +0800
69a7c55f
test(mge/function): fix test for new optimizer api by
2020-09-10 12:25:09 +0800
a66d4b8b
fix(mge/parampacksplit): fix param pack split mem forward by
2020-09-09 22:08:33 +0800
bd3b9cb6
fix(mge/oprmm): fix grad for collective comm by
2020-09-09 14:39:15 +0800
8d02d104
refactor(mge/distributed): change bcast_params_ to bcast_list_ by
2020-09-09 18:27:52 +0800
c7acba41
refactor(mge/optimizer): refine gradmanager api, record = __enter__ by
2020-09-09 13:03:45 +0800
8c482b67
fix(mge/grad): make register_after_backward_callback private by
2020-09-08 18:18:42 +0800
66b6daf7
test(mge/optimizer): fix test for new optimizer api by
2020-09-08 17:21:02 +0800
e9104ef1
fix(mge/parampack): fix copy stream, import cycle by
2020-09-08 15:22:40 +0800
e283663a
fix(mge/imperative): update tests to new optimizer api by
2020-09-07 19:59:53 +0800
b5016b9d
feat(mge/parampack): add parampack in allreduce callback by
2020-09-07 19:41:46 +0800
5ae89c79
refactor(mgb/grad): place grad at param.grad by
2020-09-04 23:46:51 +0800
9faa32fc
fix(mge/imperative): fix grad callback by
2020-09-04 20:30:46 +0800
6d4fd938
fix(mge/optimizer): remove distributed optimizer by
2020-09-04 18:39:09 +0800
e50fa074
fix(mge/imperative): remove backward from optimizer by
2020-09-04 16:29:37 +0800
60702667
refactor(mge/grad_manager): refactor gradmanager, add allreduce callback by
2020-09-04 16:32:50 +0800
3f2eac2f
fix(mge/imperative): move functional/distributed.py to distributed/functional.py by
2020-09-04 17:10:00 +0800
b3889938
feat(mge/examples): add trace & dump example of cifar10 quantization by
2020-09-06 17:18:14 +0800
67859f04
fix(imperative): add __array__ and __array_wrap__ for tensorwrapper by
2020-09-07 14:08:37 +0800
b1ab3646
feat(imperative): add tensor sanity check by
2020-08-31 17:10:12 +0800
5aa19f3d
test(mge/imperative): add more testcases for function by
2020-08-26 15:11:01 +0800
2a300415
fix(mge): remove a duplicated __init__ call in jit.trace by
2020-09-08 19:14:44 +0800
517c1ec0
fix(mge): disable dedup for io callback oprs by
2020-09-08 18:58:22 +0800
939bf3b2
fix(mge/imperative): impl hashable for nms and cond_take by
2020-09-04 19:49:29 +0800
8dc23e0f
fix(mge/functional): fix indexing_one_hot and remote_recv by
2020-09-04 16:34:49 +0800
3bbfef30
fix(mge/imperative): impl hashable for coll-comm by
2020-09-04 16:29:48 +0800
76fa7157
feat(dnn/cuda): add cutlass nchw4 convolution by
2020-08-24 10:54:36 +0800
1f3f4abc
fix(dnn): fix compile warnings by
2020-09-07 18:48:24 +0800
a558d4a2
fix(mgb/atlas): remove unnessary setdevice by
2020-09-07 18:37:19 +0800
14ceda0d
ci(mge): fix Windows CI by
2020-09-08 13:08:53 +0800
1fe8a212
fix(mge): fix sublinear memory in jit.trace by
2020-09-07 16:54:39 +0800
2df1ab96
refactor(mge/jit): skip seed when checking equal rng op by
2020-09-09 11:34:32 +0800
5b6ebeb5
fix(mgb): append json file for dump and ready for midout open source by
2020-09-01 17:03:11 +0800
a81abc1d
fix(mgb): append json file for dump and ready for midout open source by
2020-09-01 17:03:11 +0800
2beb65b1
fix(mge/module): tensor shape will not work when constructing numpy array by
2020-09-07 18:11:43 +0800
651920c7
fix(dnn): fix nchw88 winograd weight preprocess by
2020-09-07 12:22:21 +0800
adf5620e
fix(dnn/x86): uniformity commit for dnnl by
2020-08-31 12:25:40 +0800
5a378324
fix(whl/misc): easy whl local test, and fix export ALL_PYTHON at docker env when value include more than two python version by
2020-09-07 11:14:29 +0800
fc3eca84
feat(mge/imperative/jit): add dump input shape and xornet example by
2020-09-06 16:53:41 +0800
1569cab0
feat(mge/imperative): add shufflenet example by
2020-09-03 18:44:27 +0800
087ceb52
feat(mge/imperative): add more optimizer trace tests by
2020-09-06 13:52:03 +0800
38a5c1c9
fix(mge): fix refcnt in TensorShape caster by
2020-09-06 15:23:14 +0800
2cc85487
fix(imperative): fix hardcode of default device by
2020-09-04 11:05:54 +0800
403a1e7b
feat(mge/imperative): add graph profiler by
2020-09-05 00:57:25 +0800
d06f248d
fix(whl/mgb/imperative): fix symbols conflict runtime crash by
2020-09-05 07:00:23 +0800
1b2194cd
feat(whl/internal): put imperative rt to megbrain by
2020-09-05 06:55:00 +0800
0bb49696
fix(imperative): fix dangling pointer in dispatcher by
2020-09-04 19:24:45 +0800
799d09ef
fix(imperative): fix compile error when vectorcall is enabled by
2020-09-04 18:10:19 +0800
a3b2232b
feat(mge): add trace.dump by
2020-09-03 12:50:17 +0800
76dbaa27
feat(mge/imperative): add name, make_h2d, dump_graph to graph runtime by
2020-09-03 12:45:35 +0800
7336b306
fix(mge/imperative): fix OutputCallback hang by
2020-09-03 12:43:50 +0800
0e82b959
feat(mge/imperative): add sublinear options by
2020-09-01 18:15:17 +0800
e027dcbf
chore(mge): improve symbolic tracing value/shape inference by
2020-09-01 17:18:38 +0800
e6e29748
chore(mge/imperative): fix Graph.make_const by
2020-09-01 17:17:48 +0800
ac3408bf
chore(mge): add VarNode.value by
2020-09-01 15:01:54 +0800
0537cb74
chore(mge/imperative): fix BackwardGraph for jit.trace by
2020-08-31 21:37:27 +0800
4101d5bc
refactor(mge/imperative): add BackwardGraph.interpret by
2020-08-31 19:56:23 +0800
afddefb6
feat(mge/imperative): add more trace test by
2020-08-31 22:26:07 +0800
a085b71c
fix(mge/imperative): fix compilation for python3.6 by
2020-08-31 18:53:54 +0800
989f3e62
chore(mge/imperative): workaround a gcc-7 bug by
2020-08-31 15:15:52 +0800
d4bad711
feat(mge): add jit.trace by
2020-08-29 17:39:10 +0800
0b88ec3c
refactor(mge/imperative): use apply.super in distributed by
2020-08-24 17:48:16 +0800
33295900
chore(mge): add multipledispatch license by
2020-09-14 19:12:26 +0800
5474538f
refactor(mge/imperative): fork multipledispatch by
2020-08-17 21:07:43 +0800
ad9ac521
refactor(mge/imperative): remove abandoned code by
2020-08-18 20:39:43 +0800
03320a05
fix(functional): change to user-friendly nms api and fix concat by
2020-09-03 14:11:20 +0800