• NEON-Advanced SIMD vs. SIMD
    Hello, I’m new to ARM architecture and was looking to get a better understanding of how it works. Most notably, the Cortex-A series and its DSP functionality. When reading through ARM’s webpage, it often...
  • NEON-Advanced SIMD vs. SIMD
    Hello, I’m new to ARM architecture and was looking to get a better understanding of how it works. Most notably, the Cortex-A series and its DSP functionality. When reading through ARM’s webpage, it often...
  • 3x3 convolution optimized speed using (NEON SIMD) or (NEON SIMD and OpenMP) on S7/Note7
    We want to implement 3x3 convolution of image whose size is 4032x3024 on S7/Note7 to have Chipset such as Exynos 8890(S7 S.LSI) or Qualcomm MSM8996 Snapdragon 820. To implement this, we used the Anroid...
  • 3x3 convolution optimized speed using (NEON SIMD) or (NEON SIMD and OpenMP) on S7/Note7
    We want to implement 3x3 convolution of image whose size is 4032x3024 on S7/Note7 to have Chipset such as Exynos 8890(S7 S.LSI) or Qualcomm MSM8996 Snapdragon 820. To implement this, we used the Anroid...
  • Aarch64 - Armv8-a bitwise cyclic shift operation
    Hi all, I am looking for a ROT64 ASM function able to do ultra fast bitwise cyclic shift ? Eventually using SIMD or NEON function. Any tips or Retex ? thanks best regards