Hi, I am looking at ARM CMSIS code for biquad float32 implementation. This is written for Cortex-M as documentation states. How much effort would be needed to port this code to Cortex-A53? The code should be fast, optimized using intrinsics, not assembly. Thanks.
biquad
I recommend to check with the Armv8-A neon docs: Some info is here:
developer.arm.com/.../learn-the-architecture