216 Commits (bae1dd3bb0b19331a700d681ec00be1b09080ea4)

Author SHA1 Message Date
  tacyi139 7413775490 optimize error about BroadcastOpGrad 4 years ago
  wilfChen cf63527a15 multinomial profiling 4 years ago
  i-robot bfd190482f !26842 Speed up random normal sampling 4 years ago
  zhujingxuan 30c6fa7f9b bind stream with handle 4 years ago
  Zichun Ye 8398c07d68 update random normal op impl to speed up sampling 4 years ago
  wenbean 31053edbe4 Use Allocator and workspace pre allocat mem in GPU 4 years ago
  wenbean 26d4bf6350 Fix meme leak bug, add result expect 4 years ago
  wenbean 13409f519f Unify GPU/CPU ops input/output(col/rolmajor), modify related testcases, add linalg function and testcases 4 years ago
  i-robot b5c02a4ee0 !26426 gpu environment kernel 4 years ago
  wilfChen 68260a6a94 gpu environment kernel implement 4 years ago
  wenbean 9b305b231d Add GPU eigenvalues/eigenvector for symmetric mtrix(real and complex) 4 years ago
  z00512249 36032e7ee2 add cholesky, cho_factor primitive and backend gpu implements 4 years ago
  hezhenhao1 d61b089f6b Support int32 as input type for Abs of GPU op, float64 as input type for IsFinite GPU op. 4 years ago
  zhujingxuan fb1805de30 add GPU trsm 2d matrix support 4 years ago
  i-robot ba0e1a810e !25457 add broadcast GPU float64 registration 4 years ago
  zhujingxuan 7441353d9e add unit_diagonal option for solve triangular 4 years ago
  zhujingxuan 8f45ddf39d add float64 registration 4 years ago
  zhujingxuan 28987d787d add trsm 4 years ago
  i-robot 0495ed8630 !25119 add cholesky factorization for gpu backend 4 years ago
  z00512249 a125654fbc add cholesky && lu factorization for gpu backend 4 years ago
  zhunaipan 8ce4e62725 optimize the comment and log description 4 years ago
  lizhenyu 29982ecdd7 some bugfix for parameter server cache 4 years ago
  zhangyihui a94b3dbcfe clean up the static alarms of the second batch of operator groups 4 years ago
  i-robot 4c8854ac02 !24403 Clean up the static alarms of the first batch of operator groups 4 years ago
  zhangyihui 27a80a75c0 Clean up the first batch of static alarms of operator group 4 years ago
  wangshuide2020 7a1862a6e6 add vector size check, input shape check and divide by zero check for gpu operators. 4 years ago
  wangshuide2020 a35a1fe67d add vector size check, nullptr check and clean code for gpu operators. 4 years ago
  i-robot bb9597e570 !23811 add validation of vector size and non-zero validation of denominator for nn gpu operators. 4 years ago
  wangshuide2020 e06beb2ed4 add validation of vector size and non-zero validation of denominator for nn gpu operators. 4 years ago
  zhouyaqiang f76cb53cfe Add complex ops and bprop of real、conj、imag ops 4 years ago
  zhouyaqiang dad375abb9 add gpu complex ops 4 years ago
  Peilin Wang ecb3e6332e initial commit: fixed python class 4 years ago
  ms_yan 36a8886ca2 Revert "[feat] [assistant] [I3T96T] add new Dataset operator CMUARCTICDataset" 4 years ago
  djc 4e6f7dc97d [feat] [assistant] [I3T96X] add new Dataset operator LibriSpeechDataset 4 years ago
  zong_shuai f1eb2fe6bf expend broadcast_gpu_kernel with truncate_div and truncate_mod 4 years ago
  zong_shuai ebe1a2d7f5 expend broadcast_gpu_kernel with truncate_div and truncate_mod 4 years ago
  zong_shuai 336fabe0e6 expend broadcast_gpu_kernel with truncate_div and truncate_mod 4 years ago
  zong_shuai 1c6dd3543f implement truncatediv and truncatemod 4 years ago
  zong_shuai ef72e70cb0 implement truncatediv and truncatemod 4 years ago
  zong_shuai 4f7a27319b implement truncatediv and truncatemod 4 years ago
  zong_shuai ce116f7887 implement truncatediv and truncatemod 4 years ago
  zong_shuai ee03495ff5 implement truncatediv and truncatemod 4 years ago
  zong_shuai 9ea9ba917b implement truncatediv and truncatemod 4 years ago
  i-robot 22e9299c17 !20885 add dtypes & fft kernels for SPONGE 4 years ago
  huangmengxi e32297dc6b add dtypes for sponge 4 years ago
  zhou_lili 9838029fb9 code clean of gpu-math 4 years ago
  Peilin Wang 594571fd4c initial commit: fix 11 dts tickets 4 years ago
  Peilin Wang 922bcf603c bugfix, needs a device sync 4 years ago
  i-robot b9c178e6b7 !16616 GPU index_add op remove cuda device sync 4 years ago
  wilfChen d68069a617 parameter-without-user 4 years ago