We are running a survey to help us improve the experience for all of our members. If you see the survey appear, please take the time to tell us about your experience if you can.
hi,
If i remenber the work group processing use to be ramdom. It is still the case using Mali G715.
Or is there a way to force the GPU to work like a CPU, i mean processing the group in row order.
example : for a 2*2 group and a buffer of 10*10. the GPU would process data(0,1,10,11) then (2,3,12,13) until (8,9,18,19) then (20,21,30,31) exct ...
that would be great ;)) ;))
Most work could be done on the GPU, the question is whether it can be done efficiently. Running a GPU significantly under-threaded because of a need to enforce some fine grained work ordering will seriously reduce performance.
so it would be better to run many kernel if i anderstoud.
Within reason - GPUs will have some implementation-defined limit on how many things can run in parallel in their queue design.
I have never tried to run kernel in parallel. I even does no how to do it. But i'd like to know. ;))