AI and ML forum mm_shuffle_epi8 equivalent on ARM machines

State Suggested Answer
Locked Locked
Replies 1 reply
Answers 1 answer
Subscribers 14 subscribers
Views 1209 views
Users 0 members are here

Options

Related

How was your experience today?

This discussion has been locked.

You can no longer post new replies to this discussion. If you have a question you can start a new discussion

mm_shuffle_epi8 equivalent on ARM machines

FrankAlexander over 2 years ago

In a project which is focussed on accelerating the performance on ARM, I am using the mm_shuffle_epi8 implementation from the below page https://github.com/f4exb/cm256cc/blob/master/sse2neon.h#L981.

But above implementation is sub optimal and leading to performance costs.

Is there a right equivalent for _mm_shuffle_epi8 for ARM ?

Top replies

Ben Clark over 2 years ago +2 suggested

There isn't an exact equivalent, but vtbl is likely a useful command for doing _mm_shuffle_epi8 in Neon. As there isn't a direct equivalent, a completely generic version won't be as efficient, but if...