This discussion has been locked.
You can no longer post new replies to this discussion. If you have a question you can start a new discussion

confusion of fp32 operations per cycle

Hello community,

In GPU datasheet, the fp32 operations per cycle is 256 for immortalis-g715. Is this for all 16 cores or 1 core only?

Thanks,

Venkatesh.

Parents
  • Thanks Peter for the link, it is really helpful. 

    After watching series of videos, I've following assumptions. could you confirm that whether my understanding is correct?

    1. The 256 fp32 operations per cycle is for both CVT and FMA units i.e. CVT has 128 operations per cycle and FMA has 128 operations per cycle. 

    2. Each PU has 64 ALUs and it has one 16-wide FMA and one 16-wide CVT pipelines. The number of ALU's (64) and number of instructions issued to pipelines (32 = 16 FMA + 16 CVT) are not matching. Am I missing something here? 

    Thanks,

    Venkatesh.

Reply
  • Thanks Peter for the link, it is really helpful. 

    After watching series of videos, I've following assumptions. could you confirm that whether my understanding is correct?

    1. The 256 fp32 operations per cycle is for both CVT and FMA units i.e. CVT has 128 operations per cycle and FMA has 128 operations per cycle. 

    2. Each PU has 64 ALUs and it has one 16-wide FMA and one 16-wide CVT pipelines. The number of ALU's (64) and number of instructions issued to pipelines (32 = 16 FMA + 16 CVT) are not matching. Am I missing something here? 

    Thanks,

    Venkatesh.

Children