hi,
If i remenber the work group processing use to be ramdom. It is still the case using Mali G715.
Or is there a way to force the GPU to work like a CPU, i mean processing the group in row order.
example : for a 2*2 group and a buffer of 10*10. the GPU would process data(0,1,10,11) then (2,3,12,13) until (8,9,18,19) then (20,21,30,31) exct ...
that would be great ;)) ;))