JackieWu
ce2251db05
Improve ROIAlign (accelerate ROIAlign, support sampling ratio and aligned ROIAlign) ( #1820 )
* update roialign
* update tool for roialign
* fix ceil for roialign
* fix ios build
* fix
* fix
* make it compatibile with the old version
* fix ios build
* trigger CI
* fix test
* order
* build
* remove code
* merge roialign
* accelerate ROIAlign
* update note
* rename func
* roialign version
* trigger CI
* fix roialign
* use ref for pre-calc in roialign
* retrigger CI
* pre-alloc
6 years ago
nihui
8fec0038ba
fix ci test
6 years ago
nihuini
4a624c636b
skip image tests on unsupported platforms
6 years ago
zhiliu6
d23cef320c
Add Swish layer ( #1799 )
* add swish layer
* add swish test
6 years ago
nihui
15a4b2c878
test pad same mode
6 years ago
nihuini
0efcf63f51
mat pixel rotate test
6 years ago
nihuini
ebabfa60c1
disable image storage test on macos and ios
6 years ago
zhiliu6
3bfabf1d6a
Add fused convolution and mish layer support. ( #1761 )
6 years ago
nihui
9c0e46b00a
priorbox test fix
6 years ago
nihui
52c2782922
priorbox test
6 years ago
nihuini
f350c96112
memorydata vulkan
6 years ago
nihui
f9332e04e4
enable image storage test ( #1744 )
* enable image storage test
* update swiftshader, use subzero backend, opt default
6 years ago
nihuini
b9e9b99e56
reuse device packing and unpacking, noop test, fix packing test
6 years ago
nihui
9a9a618229
image storage is mandatory, less options makes life easier
6 years ago
nihui
e8688b042f
fuse packing cast storage, binaryop image shader, dummy buffer and image, device-wide utility packing converter operators, fix multi-blob layer test
6 years ago
zhiliu6
bd55ddcf0d
Add mish layer ( #1733 )
* add mish layer
* add mish test
6 years ago
nihui
62da1228e1
adreno image shader + fp16 + fp16a ( #1714 )
* wip
* wip
* fix
* image and imageview can not be destroyed until command execution ends
* fast copy path for tightly packed data
* wip
* texture load works
* 1d 3d image
* record clone image, multiple commands share one image reference
* upload download image
* layer forward accept vkimagemat
* vkimagemat graph works
* staging vkimagemat for passing dynamic parameters, macro for fp32+image shader, padding image shader
* vkimagemat elemsize
* convolution test pass
* conv1x1s1 image shader
* fast staging image allocator from host memory, pooling image shader
* convolutiondepthwise image shader
* innerproduct image shader
* packing image shader
* crop deconvolution image shader
* resolve spirv binding types
* image fp16 and fp16a, cast image shader
* eltwise image shader
* wip
* absval image shader
* deconvolutiondepthwise image shader
* concat image shader, squeezenet works
* noop split image shader
* uniform precision hint
* layer support_image_storage
* wip
* vulkan device utility operator
* command is storage and packing option aware
* fallback to cpu on image allocation failed, mobilenetssd works
* flatten image shader, enable more test
* ci test
* check imgfp32 imgfp16 imgfp16a features
* fix ci test
* fix ci test
* upgrade swiftshader
* wip
* opt aggressive
* imgfp16p
* opt none
* convolution winograd image shader
* fix flush range, fast copy path for continous buffer
* minor fix
* fix innerproduct
* wip ...
* wip
* cast fix
* packing test
* wip
* image fp16p is fp16p
* wip
* silence
* more line info
* code clean
* softmax image shader
6 years ago
nihuini
b2d9325c0d
test activation fusion
6 years ago
nihuini
956ab49d02
fix conv1x1s1 pack4to1 bf16s
6 years ago
nihuini
36f6942fa0
testing time is too long ...
6 years ago
nihui
18328f63e6
fix arm bf16 test conditions, fix unused warning in crop arm
6 years ago
nihui
979dd5fd11
test does not need to provide data type options
6 years ago
nihui
7365bb80a2
vkmat and command api breaks ( #1689 )
* vkmat and command api breaks
* always use compute queue for compute buffer transfer
* no barrier for readonly weight buffer
* record clone, drop queue_owner
* bring back layer forward
* fix validation errors
* lifecycle inside command makes life easier
* update doc
* record_import_android_hardware_buffer
6 years ago
nihuini
0ae11b6e4a
deepcopy layer
6 years ago
nihuini
a05b97a430
dropout prelu scale gpu test
6 years ago
nihuini
9f3af60b3a
dropout prelu scale test
6 years ago
nihuini
85d5e5d3e4
fix innerproduct vulkan pack8 and arm neon, disable packing_layout for int8 test
6 years ago
nihuini
32a9a489bc
fix flatten vulkan fp16p image pack1to4 and pack1to8
6 years ago
nihuini
6077066b02
binaryop broadcasting special type 3 4 for lhs
6 years ago
nihui
3cd7a30172
shufflechannel bf16s
6 years ago
nihuini
867ff7ae97
binaryop bf16s
6 years ago
nihui
ec40b4dbd7
test bf16s ( #1644 )
* wip
* wip
* wip
* fix avx2 test
6 years ago
nihui
d023137426
test fp16 packed and shader pack8 option ( #1636 )
* wip
* fix slice pack8 test
* fix flatten pack8 test
* fix binaryop pack8 test
* fix interp pack8 test
* rewrite cast test for different blob type and packing
6 years ago
xieydd
c8264946ce
test_hardswish.cpp: Add unittest of HardSwish ( #1598 )
* test_hardswish.cpp: Add unittest of HardSwish
6 years ago
nihui
d045918080
ncnn: test: Add support for ROIPooling test ( #1596 )
* ncnn: test: Add support for ROIPooling test
* test: Fix bottom_blob of ROIPooling test
6 years ago
xieydd
3bab5f7233
test_hardsigmoid.cpp: Add unittest of HardSigmoid ( #1597 )
* test_hardsigmoid.cpp: Add unittest of HardSigmoid
6 years ago
Monkeyking
8763badbe0
test: Fix bottom_blob of ROIPooling test
6 years ago
xieydd
aafd1ab529
test_clip.cpp: Add unittest of clip ( #1593 )
* fix requant relu6 bug
* fix
* delete pipeline change in forward/forward_inplace avoid race in multithreading
* test_clip.cpp: Add unittest of clip
* fix
* fix
6 years ago
monkeyking
7e9d5d0439
ncnn: test: Add support for ROIPooling test
6 years ago
nihui
8d4c249066
ncnn: test: Add support for ELU test ( #1589 )
* ncnn: test: Add support for ELU test
*For now alpha was generated by RandomFloat(0.001f, 1000.f)
* test: ELU_test: Remove duplicated cases
6 years ago
nihui
2fa22dc2be
if layer do not support vulkan, pass the test
6 years ago
songqun
f5d6aba720
test_tanh.cpp: Add unittest of tanh ( #1578 )
6 years ago
nihuini
648ef3fdee
reuse vkallocator in test
6 years ago
nihui
839c4c4e34
smaller test size
6 years ago
nihui
9929d52885
less duplicated code for crop layer, slice axes starts from 0
6 years ago
nihui
f4b1760a38
pixelshuffle vulkan, pixelshuffle test
6 years ago
nihui
4fead31bea
some test case changes
6 years ago
nihui
0f7e7bca02
shader shape specialization constant and basic local group size partition ( #1523 )
* use Mat class for Shape description
* shape specialization constant in compute shader
* wip
* wip
* test forward_inplace, add binaryop unaryop sigmoid test
* fix arm unaryop test
* fix arm binaryop test
* make shape hint optional, cast int8 to fp32, add cast test
* wip
* follow the good and old local size setting for conv1x1
* the optimal local size rewrite
* fix build on msvc
* add permute shader for all packing layout, add permute test
* concat and slice patial shape constant, slice test
* fix slice test
* interp test
* add lrn test, test packing layout implicitly
* add eltwise test
* add normalize test
* add instancenorm test
* reorg shape constant
* simple local group size partition
* add shape constant param
6 years ago
nihui
a718129d76
shader pack8 option works
6 years ago
nihui
680df01f27
reorg test
6 years ago