• SIMD-NEON Optimization on CortexA7or Cortex A57
    Hi, we are experiencing poor performance on Small functions translated to SIMD NEON because of likely latency.I found a guide on http://infocenter.arm.com/help/topic/com.arm.doc.uan0015b/Cortex_A57_Software_Optimization_Guide_external...
  • Vectors optimization
    Hi, I have some dataset used by other algorithms. So, the layout of it cannot be modified. That is my problem. So what is left: scattered data in memory but in a contiguous way per group and all...
  • NEON-Advanced SIMD vs. SIMD
    Hello, I’m new to ARM architecture and was looking to get a better understanding of how it works. Most notably, the Cortex-A series and its DSP functionality. When reading through ARM’s webpage, it often...
  • aarch64 instruction
    I have a build problem when use aarch64 instruction. I use ds5 for simulation, target CPU: Generic ARMv8-A AArch64 target FPU ARMv8(NEON & Crypto) others use default UMOV w5, v3.h[#2] index...
  • Divide and Conquer
    Division on ARM Cores “At the end of the day, we must go forward with hope and not backward by fear and division.” – Jesse Jackson. It often surprises me how many people believe that “ARM doesn...