Hello community,
In GPU datasheet, the fp32 operations per cycle is 256 for immortalis-g715. Is this for all 16 cores or 1 core only?
Thanks,
Venkatesh.
Like all GPUs, dependency hiding (and memory latency hiding) is handled by having a _lot_ of threads per core. If one thread is blocked, pick another one. Immortalis-G715 has up to 2048 threads per core ...