Please note: We are aware of an issue affecting replies on the Arm Community forums, which may not be loading as expected.
We apologize for any inconvenience and appreciate your patience while we investigate and work to resolve the issue.
Thank you for your understanding.
What's the single cycle Load-Use in ALU mean?
This is in the follow picture:
Hello,
According to http://www.arm.com/files/downloads/Enabling_Embedded_Innovation_with_the_Cortex-A7_Processor.pdf, it is described "The address generation unit is shifted one stage back in the pipeline to enable a single cycle load-use penalty".
I think this means the load can be performed in single cycle by pipelining.
The load pipe includes 2 stages and they would be the address generation and the L1 cache access.
In the same presentation material of ARM TechCon 2011, it is described "Single cycle load-use penalty in DPU".
Therefore, the load execution pipeline acts as the following.
Addr1 Cache1
Addr2 Cache2
Addr3 Cache3
Addr4 Cache4
Best regards,
Yasuhiko Koumoto.