n.18-1 1c __ vmull.u8 q12,d9 ,d17n.19-0 1c __ vmlal.u8 q12,d11,d19n.20-0 1c n0 vmlal.u8 q12,d12,d20n.21-0 1c n0 vmlal.u8 q12,d14,d22n.22-0 1c n0 vmlal.u8 q12,d2 ,d4 n.23-0 1c n0 vmlsl.u8 q12,d8 ,d16n.24-0 1c n0 vmlsl.u8 q12,d10,d18n.25-0 1c n0 vmlsl.u8 q12,d13,d21n.26-0 1c n0 vmlsl.u8 q12,d15,d23
n.35-0 2c n0 _____ vmul.s16 q12,q4 ,d0[0]n.39-0 2c n0 q12l:5 vmla.s16 q12,q5 ,d0[1]n.43-0 2c n0 q12l:5 vmla.s16 q12,q6 ,d0[2]n.47-0 2c n0 q12l:5 vmla.s16 q12,q7 ,d0[3]n.51-0 2c n0 q12l:5 vmla.s16 q12,q8 ,d1[0]n.55-0 2c n0 q12l:5 vmla.s16 q12,q9 ,d1[1]n.59-0 2c n0 q12l:5 vmla.s16 q12,q10,d1[2]n.63-0 2c n0 q12l:5 vmla.s16 q12,q11,d1[3]n.67-0 2c n0 q12l:5 vmla.s16 q12,q1 ,q2
If a multiply-accumulate follows a multiply or anothermultiply-accumulate, and depends on the result of that first instruction, thenif the dependency between both instructions are of the same type and size,the processor uses a special multiplier accumulator forwarding. This specialforwarding means the multiply instructions can issue back-to-back becausethe result of the first instruction in cycle 5 is forwarded to the accumulatorof the second instruction in cycle 4. If the size and type of the instructionsdo not match, then Dd or Qd is required in cycle 3. This applies tocombinations of the multiply-accumulate instructions VMLA, VMLS, VQDMLA,and VQDMLS, and the multiply instructions VMUL andVQDMUL.