79 Commits (fc0fcd2f7f34431770c4fb8197536783112d79f9)

Author SHA1 Message Date
  Megvii Engine Team fc0fcd2f7f chore(winograd): remove winograd transform code 5 years ago
  Megvii Engine Team 368c18607f fix(mgb/jit): find cuda include path correctly 5 years ago
  Megvii Engine Team cf53d9e0f8 fix(mgb/tensor): do tensor overlap check only when d2d and h2h 5 years ago
  Megvii Engine Team 0398a7867f fix(build/windows/cuda/llvm): fix windows bazel build with cuda 5 years ago
  Megvii Engine Team dbb64b46d5 feat(debug/android): opt android backtrace 5 years ago
  Megvii Engine Team 3e00e3f697 feat(debug/linux): opt linux backtrace 5 years ago
  Megvii Engine Team 783a612643 feat(debug/macos/windows): imp macos/windows backtrace, fix mem issue 5 years ago
  Megvii Engine Team 4f9948d00c chore(mbg/core): add MGB_WORKER_SHORT_SPIN env variable to set short spin 5 years ago
  Megvii Engine Team 4f527af4ae chore(mbg/core): fix the repeat code 5 years ago
  Megvii Engine Team 005ead5aea docs(cpu/comp_node): note cpu_default() 5 years ago
  Megvii Engine Team 9415ba5890 feat(src/core): free weight preprocessed weight 5 years ago
  Megvii Engine Team 5cfe73657b fix(mgb/core): ensure all VarNodes would be handled in init_ready_event 5 years ago
  Megvii Engine Team 45032220a4 feat(opr): add dct opr 5 years ago
  Megvii Engine Team 3635af6274 style(atlas): add comment for async d2d 5 years ago
  Megvii Engine Team d68d4d1d99 perf(atlas): use async d2d 5 years ago
  Megvii Engine Team 50d64bd333 fix(pytest/windows/impertive): open impertive pytest on windows 5 years ago
  Megvii Engine Team 9dc5538692 fix(mgb): update fastrun log level when not set timeout 5 years ago
  Megvii Engine Team cb8e53635a fix(mgb/core): disable thread_local in ios due to x-code bug 5 years ago
  Megvii Engine Team ab69883418 fix(mgb/core): use thread local for OpenCL recorder 5 years ago
  Megvii Engine Team 8857456593 fix(mgb/core): use thread local fix multi thread use same compnode with recorder enabled 5 years ago
  Megvii Engine Team 10106341fe fix(mgb/opr): fix take CpuDispatchableBase::EventImpl as CpuEventImpl 5 years ago
  Megvii Engine Team b8ddca4c38 fix(atlas): add MGB_USE_ATLAS_ASYNC_API to enable async api 5 years ago
  Megvii Engine Team 94f1f63496 refactor(mgb/build): remove MGB_ENABLE_IMPERATIVE macro 5 years ago
  Megvii Engine Team 4a5e317063 fix(mgb/opr-mm): remove stream -4 for CollectiveComm 5 years ago
  Megvii Engine Team a599725cd0 fix(mgb/core): fix operator input waiting spec 5 years ago
  Megvii Engine Team a7b9ece4a7 feat(mgb/comp_node): add set_prealloc_config 5 years ago
  Megvii Engine Team 066da0bff1 fix(mgb/comp_node): add lock_guard to sd.mtx & update warning message & add test 5 years ago
  Megvii Engine Team a51d5b4c31 feat(mgb/jit): add mlir backend for cpu and cuda 5 years ago
  Megvii Engine Team dd39265e95 fix(mgb/dtype): enable TypeCvt for bool when trace(symbolic=True) 5 years ago
  Megvii Engine Team 6f581906a1 refactor(mge/profiler): refactor profiler 5 years ago
  Megvii Engine Team cc952b2b92 fix(rocm): fix rocm megdnntest sleep and a cut code 5 years ago
  Megvii Engine Team 1fe8a21299 fix(mge): fix sublinear memory in jit.trace 5 years ago
  Megvii Engine Team 0e82b959a1 feat(mge/imperative): add sublinear options 5 years ago
  Megvii Engine Team 6e882c1a86 feat(whl/imperative): compat for build python whl imperative and legacy runtime 5 years ago
  Megvii Engine Team 9510136223 fix(mgb/rocm): remove begin-internal of rocm 5 years ago
  Megvii Engine Team 00ef677249 fix(mgb): remove internal for cambricon and atlas 5 years ago
  Megvii Engine Team 4a178a8dba feat(windows/cuda/cmake): support cmake cuda build on windows 5 years ago
  Megvii Engine Team 14e71b551b feat(imperative): add helper for dnn opr caller 5 years ago
  Megvii Engine Team e8571cca51 fix(mgb/cuda): fix cuda host alloc set device 5 years ago
  Megvii Engine Team 09b5f3d434 fix(mgb/core): fix multi thread pool deactive and multi thread conflict 5 years ago
  Megvii Engine Team ef239f835f feat(windows/python_whl): make windows HAPPY for build megbrain python package 5 years ago
  Megvii Engine Team 734c498d27 perf(mgb/core): improve DevMemAlloc when it has single stream 5 years ago
  Megvii Engine Team b43fb1a97c perf(mgb): add CUDA host memory allocator 5 years ago
  Megvii Engine Team 2afceb4187 fix(mgb/atlas): use dyn output alloc if enable dynamic batchsize 5 years ago
  Megvii Engine Team 2dc9190055 fix(fastrun/persistent_cache): fix fastrun crash 5 years ago
  Megvii Engine Team 618faf6497 feat(mgb/profiler): dump static infer info 5 years ago
  Megvii Engine Team 9e9e8ca0fe fix(mgb): fix some compile errors 5 years ago
  Megvii Engine Team d782edf80f refactor(mgb): decouple node insertion from ComputingGraphImpl 5 years ago
  Megvii Engine Team d42cf4cd65 refactor(mgb): replace static_cast<ComputingGraphImpl*> with a checked version 5 years ago
  Megvii Engine Team 55844d3e24 fix(core): fix CPU compnode get_ram_status_byte free_memory error 5 years ago