...VPf do not have instruction queue like NEON, so every VPf instruction need to wait for the NEON queue to be empty.So this is most of case not a good idea to use both together....
-O3 -mcpu=cortex-a9 -mfpu=neon -ftree-vectorize -mfloat-abi=softfp