• Neon instruction timing/latency
    Note: This was originally posted on 7th July 2010 at http://forums.arm.com Hello! I am having trouble deciphering the tables in the Cortex-A8 technical reference manual that contains the NEON advanced...
  • NEON pipeline stages in instruction timing
    Note: This was originally posted on 3rd April 2012 at http://forums.arm.com I'm trying to understand more detail about the instruction timing in Cortex-A8/A9. In TRM of A8, the timing is described as...
  • Any equivalent NEON instruction to SMULWy?
    Note: This was originally posted on 6th July 2013 at http://forums.arm.com Hi everybody, I'm currently working on 7x7 gaussian blur filter for NEON. And since everything bigger than 3x3 is hard to handle...
  • NEON instructions for fixed-point arithemtics
    Hi, I'm somehow familiar with NEON, but it's the first time I'm trying to get serious advantage from it. I have fixed-point code which does very best from armv7 ISA. In particular it uses instructions...
  • How to shuffle bits and Check high bit value using Neon Intrinsics?
    Note: This was originally posted on 1st November 2011 at http://forums.arm.com Hi, I am trying to convert a code written in SSE3 intrinsics to NEON SIMD  and am stuck because of a shuffle function.I have...