We are running a survey to help us improve the experience for all of our members. If you see the survey appear, please take the time to tell us about your experience if you can.
There is ARM software optimization guide (e.g., https://developer.arm.com/documentation/swog309707/latest for neoverse n1) to look up the instruction latency and throughput for many instructons.
This guide doesn't seem to contain the latency and throughput for Neon or SVE. Is there a separate guide for NEON or SVE (e.g., the instruction latency and throughput for INSR (SIMD&FP scalar) instruction)?
INSR (SIMD&FP scalar)
A pointer would be very helpful!