nihui
|
f0b4933eac
|
massive simd optimize in compute shader (#772)
* init vec4 shader
* more vec4 shader ...
* convolutiondepthwise is depthwise
* pooling pack4, fix global pooling
* dropout pack4, relu pack4
* softmax pack4
* more shader vec4 ..
* fix staging remap, remove layer pipeline member, add destroy_pipeline interface, add pack4 glue code
* eltwise pack4 glue code
* add binary pack4, unary pack4
* add binaryop unaryop pack4 glue code
|
7 years ago |