Hi guys,
I am doing some basic bottleneck analysis on Kirin960 social with Unity URP demo(SSAA on, HDR Pipeline...).I run the demo on mate9, mate9 pro, honor9, nova 2s and all of these devices use Kirin 960 SOC.
But the fps is varies widely. mate 9 and mate 9 pro(Android 9) run the demo on 20 FPS and honor9(Android 8) and nova 2s(Android 9) are only 10FPS.
I use the Streamline to check the bottleneck, it shows a huge read stall rate on honor9 and all the devices have a huge write stall rate.
The following picture show the difference between mate 9(left, 20 fps) and honor 9(right, 10 fps).
GPU active shows that the frequencies of the two gpu are almost the same, with no throttling.
Mali Core L2 Memory Reads(load/store bytes) on Honor9 is 4 times the number on Mate 9.
It seems to be caused by L2 cache size or frequency? Is this guess reasonable? And How can I get the L2 cache size or frequency on Mali GPU?
Thanks for the reply, very helpful!