Megvii Engine Team
e258812f12
feat(dnn): add bool dtype
GitOrigin-RevId: 98c8a092b4
5 years ago
Megvii Engine Team
734c498d27
perf(mgb/core): improve DevMemAlloc when it has single stream
GitOrigin-RevId: 61874faa6d
5 years ago
Megvii Engine Team
39bd66fc63
fix(mgb): fix TensorRT missing cudaSetDevice
GitOrigin-RevId: 40eb119e48
5 years ago
Megvii Engine Team
ab9dfbcefc
test(mgb): fix tensorrt tests missing cudaSetDevice
GitOrigin-RevId: faeb6ae070
5 years ago
Megvii Engine Team
b43fb1a97c
perf(mgb): add CUDA host memory allocator
test(mgb): add SimpleCachingAlloc test
GitOrigin-RevId: 17f381e4ac
5 years ago
Megvii Engine Team
2afceb4187
fix(mgb/atlas): use dyn output alloc if enable dynamic batchsize
GitOrigin-RevId: 45a6c6ad51
5 years ago
Megvii Engine Team
6bcc6faec8
feat(mge/imperative/opr): modify batch_norm to support frozen BN
fix(mge/imperative): cmake uses MGE_BUILD_IMPERATIVE_RT flag
GitOrigin-RevId: 8ea21af9da
5 years ago
Megvii Engine Team
54d18115b6
fix(imperative): fix grad of BatchNorm
GitOrigin-RevId: 1e8d8afaf2
5 years ago
Megvii Engine Team
80c4705317
perf(mgb): use midout in megbrain to reduce binary size
GitOrigin-RevId: ddc8af79af
5 years ago
Megvii Engine Team
4348960c40
fix(mge/gopt): fix fp16 compute mode
GitOrigin-RevId: 350625d1aa
5 years ago
Megvii Engine Team
9f4060b050
fix(mgb/gopt): add ShuffleShuffleRemovePass assert
GitOrigin-RevId: 69e07c3197
5 years ago
Megvii Engine Team
9b0978597c
chore(core): remove index_sequence polyfill for pre-C++14
GitOrigin-RevId: 8010000bc2
5 years ago
Megvii Engine Team
074645d82e
feat(core): add array_skip
GitOrigin-RevId: ae0d12fbb8
5 years ago
Megvii Engine Team
2faab90bd0
feat(core): add a naive if_constexpr for C++14
GitOrigin-RevId: bd5811ebde
5 years ago
Megvii Engine Team
8f42fe0d92
feat(core): import a std::apply polyfill
GitOrigin-RevId: da95c85b57
5 years ago
Megvii Engine Team
be5dde9da8
fix(mgb/opr-mm): fix group manager sync error
GitOrigin-RevId: 344aae534c
5 years ago
Megvii Engine Team
2dc9190055
fix(fastrun/persistent_cache): fix fastrun crash
GitOrigin-RevId: b3f7bdf7dc
5 years ago
Megvii Engine Team
c1fb3c8c1a
fix(mgb/gopt): fix ConvertF32ToF16Pass endpoints
GitOrigin-RevId: 850eaa0906
5 years ago
Megvii Engine Team
618faf6497
feat(mgb/profiler): dump static infer info
GitOrigin-RevId: bb9150eb83
5 years ago
Megvii Engine Team
5c224c71f0
refactor(mge/imperative): drop ComputingGraphImpl in proxy graph
GitOrigin-RevId: a6e6f2e76b
5 years ago
Megvii Engine Team
5e7d2a91c2
refactor(mgb): add TensorND::proxy_to_default_cpu
GitOrigin-RevId: 3ab8525f1c
5 years ago
Megvii Engine Team
9e9e8ca0fe
fix(mgb): fix some compile errors
GitOrigin-RevId: ec38f8e15a
5 years ago
Megvii Engine Team
d782edf80f
refactor(mgb): decouple node insertion from ComputingGraphImpl
GitOrigin-RevId: 59b45fcb17
5 years ago
Megvii Engine Team
d42cf4cd65
refactor(mgb): replace static_cast<ComputingGraphImpl*> with a checked version
GitOrigin-RevId: d05b114668
5 years ago
Megvii Engine Team
55844d3e24
fix(core): fix CPU compnode get_ram_status_byte free_memory error
GitOrigin-RevId: a6d4351840
5 years ago
Megvii Engine Team
672d4ad0e0
fix(jit): more testcases on the grad of JITExecutor
GitOrigin-RevId: c3bb405979
5 years ago
Megvii Engine Team
bc95e873ef
fix(jit): fix jit grad
a) fix shape mismatch when take grad of JITExecutor including Dimshuffle
b) avoid redundant computation in the grad of JITExecutor
c) not pass unused vars as inputs to the grad of JITExecutor to save device memory
d) traverse internal graph only once in JITExecutor ctor instead of traverse
whole graph in each call of setup_args()
e) expand the gradient graph into the origin graph if all inputs are const
GitOrigin-RevId: ba6a2b29e9
5 years ago
Megvii Engine Team
4cf193ab97
fix(mgb/opr-mm): fix device id in TestOprIORemote.IdentityWithGopt
GitOrigin-RevId: 0184036b8b
5 years ago
Megvii Engine Team
c76789a5c9
feat(mge/imperative): add io remote wrapper
GitOrigin-RevId: 6dae8287d3
5 years ago
Megvii Engine Team
9006d76bea
fix(serialization): do not gate enum entries by macro
GitOrigin-RevId: aa27ecbf21
5 years ago
Megvii Engine Team
ad4928c995
fix(serialization): remove #include flatbuffers/flatbuffers.h in public header
Removes the need of Flatbuffers for (most) internal users.
GitOrigin-RevId: 6585e9c848
5 years ago
Megvii Engine Team
177c268898
feat(osx/python_whl): fix python pytest failedon osx
GitOrigin-RevId: 97d2b496bb
5 years ago
Megvii Engine Team
4d541b72ce
feat(osx/python_whl): make macos HAPPY for build megbrain python package
GitOrigin-RevId: 7ffd9b9541
5 years ago
Megvii Engine Team
ab3efab066
fix(mgb): add compute capability check
GitOrigin-RevId: fba819afb3
5 years ago
Megvii Engine Team
8070f40aa1
fix(mgb/gopt): fix gopt nchwxx convert elemwise and reshape
GitOrigin-RevId: 982dee36e1
5 years ago
Megvii Engine Team
b38e8225be
refactor(mgb/opr-mm): update megray communicator init interface and fix ci
GitOrigin-RevId: 55c59879f2
5 years ago
Megvii Engine Team
5e912eddbd
fix(mgb/opr-mm): fix grad func of reduce and gather
GitOrigin-RevId: 4687faef99
5 years ago
Megvii Engine Team
e3e981ccf0
test(mge/collective_comm): fix collective_comm test and add data parallel test
GitOrigin-RevId: 9209e77973
5 years ago
Megvii Engine Team
d3d9018f8d
feat(imperative/opr-mm): add broadcast
GitOrigin-RevId: 83640255c7
5 years ago
Megvii Engine Team
3bd8ef3589
feat(mgb/compnode): add atlas compnode
GitOrigin-RevId: 19f3c33003
5 years ago
Megvii Engine Team
7c2782366b
feat(core): redact exception messages if logging is disabled
GitOrigin-RevId: 1d6bda2380
5 years ago
Megvii Engine Team
6137adc05a
feat(core): move more exceptions to base exception
mgb::cg::GraphError
mgb::NumRangeChecker::Error
mgb::VarSanityCheck::Error
GitOrigin-RevId: 4d79e3629c
5 years ago
Megvii Engine Team
49f02e71fc
chore(format): format megbrain/exception.h
GitOrigin-RevId: 744ed23b2d
5 years ago
Megvii Engine Team
9667d93cfe
feat(core): internal changes to exception types
See comments.
GitOrigin-RevId: ecd3c22faa
5 years ago
Megvii Engine Team
cf3a55ce17
fix(mgb/opr-mm): remove PeerDesc from RemoteSend and RemoteRecv
GitOrigin-RevId: b7a7bbd0da
5 years ago
Megvii Engine Team
d53dab2fb2
fix(build): export target megengine
GitOrigin-RevId: 6daf6b2648
5 years ago
Megvii Engine Team
a808f807c6
feat(externcopr/nvidaopticalflow): fix output and fix cu context issue on
diff thread, nvof loader outptr share with mgb ptr, upgrade lastest nvof sdk
GitOrigin-RevId: 6e003c6428
5 years ago
Megvii Engine Team
486cbdea8b
fix(mgb/opt): nchw to nchw4 pass suppport ic less than 4
GitOrigin-RevId: a3c205f38f
5 years ago
Megvii Engine Team
cb5c5fd5dc
feat(mgb/opt): add nchw->nchw4 for tensorrt replace pass
GitOrigin-RevId: db114549be
5 years ago
Megvii Engine Team
2e70cf1dd0
feat(mgb/opt): add nchw->nchw4 in tensorcore pass
GitOrigin-RevId: 755f8dfefe
5 years ago