Hello community,
In GPU datasheet, the fp32 operations per cycle is 256 for immortalis-g715. Is this for all 16 cores or 1 core only?
Thanks,
Venkatesh.
Thanks for the response, Peter.
So, FMA throughput is 128 fp32 operations per cycle.
Could you suggest what is the throughput capability of CVT wrt integer ops /cycle?
For Mali-G715 it's 64 int32 ops per cycle
Hi Peter, Thanks for CVT throughput. As per Mali sources I understood that CVT unit executes branches, bitwise and integer computations. However I see that in some cases only one of CVT or FMA unit will be utilized, for example FMA unit will be idle when CVT executes branch instructions as control flow path is unknown. so I just want to know what benefit CVT unit really provides for improving performance?
I can't really add much on a public forum, sorry - that level of microarchitecture explanation isn't publicly disclosed.
Cheers, Pete
NP, thank you for the repsonses.