204 Commits (b6cf16d8aae6fef361ba419fc1942d553cc2432a)

Author SHA1 Message Date
  mindspore-ci-bot da93717c5d !14682 GPU set tensor limitation to 2G 5 years ago
  mindspore-ci-bot 83b25e10e9 !13009 [debugger] offline debug feature 5 years ago
  VectorSL 7b9b84d651 addtensor size limitation to 2G 5 years ago
  John Tzanakakis da3b13a0e1 Offline debugger 5 years ago
  hwjiaorui dac67cbabb clean code 5 years ago
  lizhenyu cf2244f1ef [bugfix]Not set device id in asynchronous compile and run graph 5 years ago
  mindspore-ci-bot 825ce95756 !13832 fix a bug that cuda error show wrong name in log 5 years ago
  mindspore-ci-bot e9d57490c5 !14155 [MS][RDR] fix fps degradation of gpu trainning in master branch 5 years ago
  wilfChen e1d443efe3 tensor-rt library dynamic loadg 5 years ago
  louei5 f23ce6c7d9 optimize record gpu memory information 5 years ago
  mindspore-ci-bot a48785cdcc !14052 add op atomic clean to clear input addr in launch allreduce 5 years ago
  lizhenyu 3f9d9c5b2e add error log when set device id failed 5 years ago
  lvchangquan 0a7df321fe add op atomic clean to clear input addr in launch allreduce 5 years ago
  dayschan 11ee3b1624 add context graph_kernel_flags 5 years ago
  hanhuifeng2020 9629977242 fix a bug that cuda error show wrong name in log 5 years ago
  mindspore-ci-bot 5fd3d140b6 !13344 add DeviceContext module 5 years ago
  mindspore-ci-bot 8e8f3043f9 !12115 IR operators of GPU and CPU are unified as batchnorm 5 years ago
  lizhenyu 95565aa7b8 add hardware abstract layer 5 years ago
  dingpeifei 87e41aaeee IR operators of GPU and CPU are unified as batchnorm 5 years ago
  mindspore-ci-bot defcc51641 !13304 refactor RDR to support single name 5 years ago
  luopengting c8ba7694c5 refactor RDR to support single name 5 years ago
  mindspore-ci-bot eb1c0310a9 !13307 GPU fix shared_ptr in GpuKernel 5 years ago
  mindspore-ci-bot 77cda67b3f !13012 add mul fusion based on allreduce fusion 5 years ago
  lvchangquan 31f9e6a42c add op_mul fusion based on allreduce fusion in pynative mode 5 years ago
  VectorSL 36e11ae17c fix GPUKernelMod about the using of shard_ptr 5 years ago
  limingqi107 a046a5eb43 optimize GPU format transform 5 years ago
  mindspore-ci-bot a21c8e13b5 !13010 Add device id log 5 years ago
  tanghuikang dac64f30ee Support ms_function + heterogenous 5 years ago
  ZPaC f2edee750a Add device id log 5 years ago
  wenfangpei d6b3a07b4a parallel build gpu ops about graph kernel 5 years ago
  mindspore-ci-bot 2f312dac66 !12091 Performance optimization for PyNative AllReduce 5 years ago
  mindspore-ci-bot 4365c332e6 !12813 unify AvgPoolGrad's MindIR 5 years ago
  yuchaojie d2cb3aa1c2 unify AvgPoolGrad 5 years ago
  louei5 99203038a5 support recording gpu memory information and graph execute order 5 years ago
  caifubi 171b468bb3 PyNative AllReduce Bucket 5 years ago
  mindspore-ci-bot 50542793c8 !12077 optimize gpu backend logger 5 years ago
  wilfChen 58196f1faf modify gpu backend logger 5 years ago
  zuochuanyong 3fa26683ac nlp perf(Pynative): change memory sync mode from synchronous to asynchronous in SyncHostToDevice 5 years ago
  He Wei 7d9a783993 [auto-monad] Support side-effects by auto-monad 5 years ago
  lizhenyu 6649153c49 add input data type check for ps cache mode 5 years ago
  limingqi107 366f3e668d optimize the memory alloc error info 5 years ago
  mindspore-ci-bot 4364abc7ee !11798 Support RunOpsInGraph on CPU&GPU in pynative mode 5 years ago
  lizhenyu f17534af08 ps cache support sparse 5 years ago
  tanghuikang 6f2cd92aba Support RunOpsInGraph on CPU&GPU in pynative mode 5 years ago
  mindspore-ci-bot 03f88c6f44 !11271 Fix the bug of step_trace cannot get the step_trace_point name in callback scene 5 years ago
  mindspore-ci-bot d8323b5d51 !11342 Support device memeory profiling 5 years ago
  yuchaojie 1932d87a26 update some op's attr name 5 years ago
  yanghaitao1 8d147deb07 profiler memory 5 years ago
  gzhcv ce66e8cf4e Fix the bug of step_trace cannot get the step_trace_point name in callback scene 5 years ago
  chujinjin ade9a82c2b fix device memory leak 5 years ago