When profiling my shaders using Streamline, I found some odd things concerning the usage of GPU cycles (picture below).
The GPU I am testing on is a Mali T-880 (12-core), supposedly running at 650MHz. What surprises me is that within 1s only 442Mcycles are spent in the shaders, though 641Mcycles were available on the GPU. Is there a good explanation for that (CPU is not maxed out) ?