Hi, i have some issue on an inplace vetx.32 instructions. I post it on the Cortex A forum. Who has a tip or workaround as it is too slow on A7,A8,A9, etc... ? thanks
If you're talking about the latency of the instruction itself then there isn't really an alternative to it,
You could in principle do it with two shifts (left and right) and an or, but that will undoubtedly be more expensive.
Depending on the actual operation you're doing you make be able to use a different sequence but if you're just only talking about vext then I don't believe there is.
Thanks to All for the answer: Paul, Peterson and Tamar. Your answers contribute to sort it out as per Tamar's comment. I understand the latency of the instructions for in place unfortunately i also do not see how to do this operation another way. Doing a different sequence lead also to the same point on my use case (all roads lead to Rome) . closing the topic then. cheers.
View all questions in Arm Compilers forum