Arm Community
Site
Search
User
Site
Search
User
Groups
Arm Research
DesignStart
Education Hub
Graphics and Gaming
High Performance Computing
Innovation
Multimedia
Open Source Software and Platforms
Physical
Processors
Security
System
Software Tools
TrustZone for Armv8-M
中文社区
Blog
Announcements
Artificial Intelligence
Automotive
Healthcare
HPC
Infrastructure
Innovation
Internet of Things
Machine Learning
Mobile
Smart Homes
Wearables
Forums
All developer forums
IP Product forums
Tool & Software forums
Support
Open a support case
Documentation
Downloads
Training
Arm Approved program
Arm Design Reviews
Community Help
More
Cancel
Developer Community
Tools and Software
Software Tools
Jump...
Cancel
Software Tools
Arm Development Studio forum
NEON vs Intel SSE - equivalence of certain operations
Tools, Software and IDEs blog
Forums
Videos & Files
Help
Jump...
Cancel
New
Replies
3 replies
Subscribers
126 subscribers
Views
3700 views
Users
0 members are here
Related
NEON vs Intel SSE - equivalence of certain operations
Offline
Marius Cetateanu
over 7 years ago
Note: This was originally posted on 26th August 2011 at http://forums.arm.com
Hi,
I'm having some trouble figuring out the NEON equivalence of a couple of Intel SSE operations.
It seems that NEON is not capable to handle an entire Q register at once(128 bit value data type; haven't found anything in
the arm_neon header or in the reference
http://infocenter.arm.com/help/index.jsp?topic=/com.arm.doc.dui0348c/Badcdfad.html
).
What I want to do is the following:
// Intel SSE
// shift the entire 128 bit value with 2 bytes to the right; this is done without sign extension
// by shifting in zeros
__m128i val = _mm_srli_si128(d, 2);
// insert the least significant 16 bits of "some_16_bit_val"(the whole thing in this case)
// into the selected 16 bit integer of vector "val"(the 16 bit element with index 7 in this case)
val = _mm_insert_epi16(val, some_16_bit_val, 7);
I've looked at the shifting operations provided by NEON but could not find an equivalent way of
doing the above(I don't have much experience with NEON). Is it possible to do the above(I guess it is I just don't know how)?
Any pointers greatly appreciated.
Thanks
More questions in this forum
By title
By date
By reply count
By view count
By most asked
By votes
By quality
Descending
Ascending
All recent questions
Unread questions
Questions you've participated in
Questions you've asked
Unanswered questions
Answered questions
Questions with suggested answers
Questions with no replies
Answered
Extended asm alternative for Arm Compiler 5 (memory barriers)
+1
Memory Management Unit (MMU)
Arm Assembly Language (ASM)
Arm Compiler 5
2200
views
1
reply
Latest
1 month ago
by
Ronan Synnott
Answered
Use Arm DS5 streamline performance analyzer on TX2
+1
2989
views
9
replies
Latest
1 month ago
by
ShirB
Answered
Problem with arm_cmplx_mag_f32()
+1
2514
views
2
replies
Latest
2 months ago
by
Vishal_Patel
Answered
Can anyone please help me on how evalution development studio 2020.1 work s and which compiler is needed and how it can be setup?
+1
2246
views
3
replies
Latest
2 months ago
by
Ronan Synnott
Answered
Can anyone tell me the difference between DSTREAM and DSTREAM-ST?
+1
2375
views
2
replies
Latest
2 months ago
by
Xiang
<
>
View all questions in Arm Development Studio forum