Hi
As part of my MSc Scientific Computing at UCL, I'm benchmarking a small Raspberry Pi 4 Model B cluster.
I would like to reference the theoretical maximum performance of the BCM2711 (4 x ARM Cortex-A72) in Linpack terminology, R_peak.
I believe R_peak to be: 1.5 GHz x 3-way dispatch x 4 cores = 18 Gflops. This seems to be the "standard" Linpack methodology.
It would be very helpful if someone more knowledgable than me can confirm that this seems reasonable. Or even better, if there is some official ARM benchmarking material which I can reference in my dissertation?
Best wishes
John
Hi.
I think the easiest way to imagine this is that the core has the option to use two 64-bit pipelines or combine them together as a single 128-bit vector. If you look at the throughput numbers in the document you reference hopefully that explains those away a bit more.
Chris
Thank you again Chris.