Arm Community
Site
Search
User
Site
Search
User
Groups
Arm Research
DesignStart
Education Hub
Graphics and Gaming
High Performance Computing
Innovation
Multimedia
Open Source Software and Platforms
Physical
Processors
Security
System
Software Tools
TrustZone for Armv8-M
中文社区
Blog
Announcements
Artificial Intelligence
Automotive
Healthcare
HPC
Infrastructure
Innovation
Internet of Things
Machine Learning
Mobile
Smart Homes
Wearables
Forums
All developer forums
IP Product forums
Tool & Software forums
Pelion IoT Platform
Support
Open a support case
Documentation
Downloads
Training
Arm Approved program
Arm Design Reviews
Community Help
More
Cancel
Developer Community
Tools and Software
Software Tools
Jump...
Cancel
Software Tools
Arm Development Studio forum
NEON pipeline stages in instruction timing
Tools, Software and IDEs blog
Forums
Videos & Files
Help
Jump...
Cancel
New
State
Accepted Answer
Replies
9 replies
Subscribers
127 subscribers
Views
6798 views
Users
0 members are here
Related
NEON pipeline stages in instruction timing
Offline
Kun Feng
over 7 years ago
Note: This was originally posted on 3rd April 2012 at http://forums.arm.com
I'm trying to understand more detail about the instruction timing in Cortex-A8/A9.
In TRM of A8, the timing is described as E1 or N2, which means pipeline stage "Execution 1" in ARM pipeline and "Execution 2" in NEON pipeline, is that right?
I think before executing there must be cycles for fetching and decoding. What is the value of cycles that fetching and decoding take? Are they the same for ARM and NEON?
I got such a figure after googling.
Is that a right description for A8 pipeline?
Assuming it's right, the decoding of NEON instruction is after the ARM pipeline. Does it mean that NEON instructions have to pass through the entire ARM pipeline first then get decoded? And when does dual issue happen, after decoding before pipeline? Why NEON instructions need to be decoded twice? Isn't it a waste of time and die size?
The summing up question: how to calculate the number of cycles that a NEON instruction takes in total, from fetch to write back and taking dual issue into consideration?
Thank you so much.
More questions in this forum
By title
By date
By reply count
By view count
By most asked
By votes
By quality
Descending
Ascending
All recent questions
Unread questions
Questions you've participated in
Questions you've asked
Unanswered questions
Answered questions
Questions with suggested answers
Questions with no replies
Not Answered
Getting errors after including arm_math.h
0
stm32 h7
Keil
Digital Signal Processor (DSP)
STM32
25410
views
9
replies
Latest
3 months ago
by
roger-liu
Not Answered
freeRTOS demo DS-5 ERROR(CMD360) when trying to debug
+1
10594
views
12
replies
Latest
3 months ago
by
tolc
Answered
ubuntu - How to uninstall Arm Development studio and all its requirements
0
Arm Development Studio
6710
views
1
reply
Latest
3 months ago
by
Jonathan Simmonds
Answered
DSTREAM Probe damage - Spare parts?
0
DSTREAM
8942
views
2
replies
Latest
3 months ago
by
SamGKN
Suggested Answer
Optimized ARM version of memcmp
0
7468
views
3
replies
Latest
3 months ago
by
Ronan Synnott
<
>
View all questions in Arm Development Studio forum