Exec Latency
Execution Throughput
Utilized Pipelines
ASIMD FP multiply accumulate, Q-form
VMLA,VMLS,VFMA,
9(4)
ASIMD multiply-accumulate pipelines support late-forwarding of accumulate operands from similar μops, allowing a typical sequence of floating-point multiply-accumulate μops to issue one every four cycles
(accumulate latency shown in parentheses).
(1)、in above description, what is the meaning of "late-forwarding"?
(2)、whan is the meaning of "allowing a typical sequence of floating-point multiply-accumulate μops to issue one every four cycles"?