Please note: We are aware of an issue affecting replies on the Arm Community forums, which may not be loading as expected.

We apologize for any inconvenience and appreciate your patience while we investigate and work to resolve the issue.

Thank you for your understanding.


This discussion has been locked.
You can no longer post new replies to this discussion. If you have a question you can start a new discussion

What's the single cycle Load-Use in ALU mean?(In Cortex-A7)

What's the single cycle Load-Use in ALU mean?

This is in the follow picture:

CA7_detailpipeline.jpg
  • Hello,

    According to http://www.arm.com/files/downloads/Enabling_Embedded_Innovation_with_the_Cortex-A7_Processor.pdf, it is described "The address generation unit is shifted one stage back in the pipeline to enable a single cycle load-use penalty".

    I think this means the load can be performed in single cycle by pipelining.

    The load pipe includes 2 stages and they would be the address generation and the L1 cache access.

    In the same presentation material of ARM TechCon 2011, it is described "Single cycle load-use penalty in DPU".

    Therefore, the load execution pipeline acts as the following.

            Addr1  Cache1

                        Addr2   Cache2

                                     Addr3   Cache3

                                                  Addr4    Cache4

    Best regards,

    Yasuhiko Koumoto.