add r5, r5, #1 mov r6, r5
mul r4, r5, r4 mla r0, r6, r7, r4
Destination available is always given with respect to the last cycle in a data processing multi-cycle instruction. This rule does not apply to load/store multiple instructions.
Sorry but what you mean when say word shortcut)))))and how do you know that mul then mla, takes 4 cycles?? )I mean how you test it??
r4 is only needed at the beginning of the stage 4 of the second cycle of the MLA