• ARM Neon vs Intel SSE
    Hello experts. Its my first question and it is regarding ARM Neon engine performance compared to Intel SSEx. Introduction. I took C-function which performs addition on 16-bit data in array and wrote...
  • ARM Neon vs Intel SSE
    Hello experts. Its my first question and it is regarding ARM Neon engine performance compared to Intel SSEx. Introduction. I took C-function which performs addition on 16-bit data in array and wrote...
  • Any equivalent NEON instruction to SMULWy?
    Note: This was originally posted on 6th July 2013 at http://forums.arm.com Hi everybody, I'm currently working on 7x7 gaussian blur filter for NEON. And since everything bigger than 3x3 is hard to handle...
  • Any equivalent NEON instruction to SMULWy?
    Note: This was originally posted on 6th July 2013 at http://forums.arm.com Hi everybody, I'm currently working on 7x7 gaussian blur filter for NEON. And since everything bigger than 3x3 is hard to handle...
  • x86 _mm_sign_epi8(_m128i a,_m128i b) intrinsic NEON equivalent
    I have the x86 intrinsic (_m128i _mm_sign_epi8(_m128i a,_m128i b)) it performs the following task: for (i = 0; i < 16; i++) { if (b[i] < 0) { r[i] = -a[i]; } else if (b[i] == 0) { r[i] = 0; } else { r...