Fix double precision nrm2 kernels returning NaN when the input vectors contain Inf/-Inf.
The implementation is a hybird of the ARMV8 one with some of the improved TX2 rountines along with specifying -march=v8.2-a