This discussion has been locked.
You can no longer post new replies to this discussion. If you have a question you can start a new discussion

Cortex-A72 Maximum Theoretical Linpack Performance R_peak

Hi

As part of my MSc Scientific Computing at UCL, I'm benchmarking a small Raspberry Pi 4 Model B cluster.

I would like to reference the theoretical maximum performance of the BCM2711 (4 x ARM Cortex-A72) in Linpack terminology, R_peak.

I believe R_peak to be: 1.5 GHz x 3-way dispatch x 4 cores = 18 Gflops. This seems to be the "standard" Linpack methodology.

It would be very helpful if someone more knowledgable than me can confirm that this seems reasonable. Or even better, if there is some official ARM benchmarking material which I can reference in my dissertation?

Best wishes

John

Parents
  • Hi Chris

    Thank you. That is very helpful!

    If you don't mind a follow up question...

    On Page 6 of the Cortex-A72 Software Optimisation Guide, 2.1 Pipeline Overview, the block diagram, and subsequent instruction details, refer to two floating point pipelines, FP/ASIMD 0 and FP/ASIMD 1. I'm not sure how these relate to "a single 128-bit vector pipeline"?

    Kind regards

    John

Reply
  • Hi Chris

    Thank you. That is very helpful!

    If you don't mind a follow up question...

    On Page 6 of the Cortex-A72 Software Optimisation Guide, 2.1 Pipeline Overview, the block diagram, and subsequent instruction details, refer to two floating point pipelines, FP/ASIMD 0 and FP/ASIMD 1. I'm not sure how these relate to "a single 128-bit vector pipeline"?

    Kind regards

    John

Children