Please note: We are aware of an issue affecting replies on the Arm Community forums, which may not be loading as expected.
We apologize for any inconvenience and appreciate your patience while we investigate and work to resolve the issue.
Thank you for your understanding.
Hello community,
In GPU datasheet, the fp32 operations per cycle is 256 for immortalis-g715. Is this for all 16 cores or 1 core only?
Thanks,
Venkatesh.
Like all GPUs, dependency hiding (and memory latency hiding) is handled by having a _lot_ of threads per core. If one thread is blocked, pick another one. Immortalis-G715 has up to 2048 threads per core ...