Hi,
I would like to ask whether a work group with 192 work items can run on multiple G76 cores?
I thought similar as other GPUs, one work group can only run on one shader core. However, it seems not the case.
I got similar latency between a work group with 192 work items and a work-group with 24 work items. But one core should only can run 24 (3x8) work items parallelly.
Therefore, I guess the 192 work items actually run on multiple cores?
Thank you!
Hi Kevin,
Thanks for your quick response! I understand now. It is really helpful.
So the batching makes 24 and 192 no latency difference. That makes sense.
Regards,
Ting