I'm trying to infer if our application is bandwidth-bound (I believe it's not the case) but I don't have the read/write beat counters suggested in Tutorial. Although, I believe the ones I have already "resolve" the beats to the actual bytes value ($MaliL2CacheExtReadsExternalReadBytes and $MaliL2CacheExtWritesExternalWriteBytes).
We seem to be using ~740MB/s of bandwidth but I'm not sure which number to benchmark this against. Presumably the 5GB/s mentioned in the above tutorial (which seems to apply to this GPU) but I was wondering if you could shed some light on this.
Cheers Pete!Sadly, the GPU Active Cycles counter has no data, i.e. always 0, in this device (Samsung J330F). I do get some stall related counters (image below) but I'm not sure if I can use them against anything else. This was measured over 1s btw.
A proxy stall rate, in the absence of GPU Active might be (e.g.):
($MaliL2CacheExtReadsExternalBusStallsAR / max($MaliJobManagerCyclesJS0Cycles, $MaliJobManagerCyclesJS1Cycles)) * 100
2.6M stalls over 1 second doesn't look like a problem though - assuming the GPU is 500MHz, that's < 1% stall rate - so it looks happy.
Cheers, Pete
Cheers Pete! Sadly, all the JobManager counters are also missing :/ for this particular device, but it sounds like it's unlikely to be a problem.