In GPU datasheet, the fp32 operations per cycle is 256 for immortalis-g715. Is this for all 16 cores or 1 core only?
For Mali-G715 it's 64 int32 ops per cycle
Hi Peter, Thanks for CVT throughput. As per Mali sources I understood that CVT unit executes branches, bitwise and integer computations. However I see that in some cases only one of CVT or FMA unit will be utilized, for example FMA unit will be idle when CVT executes branch instructions as control flow path is unknown. so I just want to know what benefit CVT unit really provides for improving performance?
I can't really add much on a public forum, sorry - that level of microarchitecture explanation isn't publicly disclosed.
NP, thank you for the repsonses.