This discussion has been locked.

You can no longer post new replies to this discussion. If you have a question you can start a new discussion

Cortex-M7 "zero overhead loop"

Hi.

In the page 22 of the document below informs that the cortex-m7 has "zero overhead loops" capability. I would like to know how it is done? Is there a special instruction for it?

http://community.arm.com/servlet/JiveServlet/downloadBody/9595-102-4-18606/ARM_Cortex_M7_MCU_Johnson.pdf

Ari.

Parents

0 Joseph Yiu over 10 years ago in reply to Jens Bauer

Hi,
There is no new/special instruction for loops in Cortex-M7.
The design help reducing loop overhead in a number of ways:
- BTAC enable good accuracy in branch predction, so in most cases, there is no branch penalty (of course you still got branch penalty if the prediction is wrong)
- a branch instruction can execute at the same cycle with another data processing instruction
- moving the condition generation instruction eariler helps in some cases too (but in general the design of Cortex-M7 enable high performance without too much of compiler optimization).
So in strictly computer "geek" language, I won't call it zero-over-head loops . But the result is essentially same as some zero-overhead-loop designs, so in "PR" language this description is "correct".
And Ari is right that there is no BTAC in Cortex-M4.
regards,
Joseph
(Disclaimer : this message is written before my 2nd cup of coffee this morning....may not be suitable for human consumption).
Cancel
Vote up 0 Vote down

Cancel

Reply

0 Joseph Yiu over 10 years ago in reply to Jens Bauer

Hi,
There is no new/special instruction for loops in Cortex-M7.
The design help reducing loop overhead in a number of ways:
- BTAC enable good accuracy in branch predction, so in most cases, there is no branch penalty (of course you still got branch penalty if the prediction is wrong)
- a branch instruction can execute at the same cycle with another data processing instruction
- moving the condition generation instruction eariler helps in some cases too (but in general the design of Cortex-M7 enable high performance without too much of compiler optimization).
So in strictly computer "geek" language, I won't call it zero-over-head loops . But the result is essentially same as some zero-overhead-loop designs, so in "PR" language this description is "correct".
And Ari is right that there is no BTAC in Cortex-M4.
regards,
Joseph
(Disclaimer : this message is written before my 2nd cup of coffee this morning....may not be suitable for human consumption).
Cancel
Vote up 0 Vote down

Cancel

Children

No data