You can not select more than 25 topics Topics must start with a chinese character,a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
 
VectorSL cb3d25c8f0 add cpu tensor array 4 years ago
..
arrays optimizes the kernel error description of Split, Meshgrid, Select, etc. 4 years ago
control code check for ops 4 years ago
cuda_impl !26518 tag environment implement 4 years ago
custom adapt custom op to pyfunc kernel 4 years ago
data support start profiler in the minddle of training. 4 years ago
debug add validation of vector size and non-zero validation of denominator for nn gpu operators. 4 years ago
math !26842 Speed up random normal sampling 4 years ago
nccl !26292 Add GPU operator NeighborExchange 4 years ago
nn optimizes the kernel error description of LSTM, Pad, ReLU, etc. 4 years ago
other change test level 4 years ago
quant clean up the static alarms of the second batch of operator groups 4 years ago
random clean up the static alarms of the second batch of operator groups 4 years ago
rl add cpu tensor array 4 years ago
sponge fix ops 4 years ago
trt add exception level to python error report 4 years ago
gpu_kernel.cc fix issue#I3ARG6 5 years ago
gpu_kernel.h add vector size check, nullptr check and clean code for gpu operators. 4 years ago
gpu_kernel_factory.cc Move TypeId2String from kernel_compiler/ to ir/dtype_extends.cc 4 years ago
gpu_kernel_factory.h clean up the static alarms of the second batch of operator groups 4 years ago
kernel_constants.h add cholesky, cho_factor primitive and backend gpu implements 4 years ago