* promote vfpv4 for auto fp16 storage conversion * always report neon and vfpv4 for arm64
* create layer decoupled * no more virtual public * allow build test with shared library * decouple cpu vulkan * drop old scripts