Sorry, was thinking about 32-bit code. Yes, standard v8 does require Neon.
If it is vectorized, it's very poorly done. A7 with the Accelerate library can push over 8 GFlops/s DGEMM per core, while geekbench reports less than 2. Obviously these libraries aren't available for Android, but the...