random processing of working group in OpenCL

hi,

If i remenber the work group processing  use to be ramdom. It is still the case using Mali G715.

Or is there a way to force the GPU to work like a CPU, i mean processing the group in row order.

example : for a 2*2 group and a buffer of 10*10. the GPU would process data(0,1,10,11) then (2,3,12,13) until (8,9,18,19) then (20,21,30,31) exct ...

that would be great ;)) ;))