Graphics, Gaming, and VR forum Int8 operation in G72

State Accepted Answer
+1 person also asked this people also asked this
Locked Locked
Replies 3 replies
Subscribers 137 subscribers
Views 10300 views
Users 0 members are here

Options

Related

How was your experience today?

This discussion has been locked.

You can no longer post new replies to this discussion. If you have a question you can start a new discussion

Int8 operation in G72

Unarmed guy over 6 years ago

Does Mali suuport 8bit int vector operation to workaround overflow issue like scalar operation?

Such as..

I tested with G72.

In scalar operation,

--------------------------------

uchar a = 255;

uchar b = 255;

Int c = a + b;

--------------------------------

It results 510 in c.

But in case of vector,

--------------------------------

uchar4 a={255,255,255,255}

uchar4 b={255,255,255,255}

int4 c = a + b;

--------------------------------

It prints wrong answer..

So my question is

1. Scalar operation uses general purpose register and it is 32bit register. That's why scalar operation results correctly. Am i right?

2. Why does Vector operation not support auto cast like scalar operation ? Does it not support general purpose register like in scalar operation?

3. I heard G52 and it supports int8 operation. Does it mean G52 supports 8bit vector register which resolve second case above?

Top replies

Peter Harris over 6 years ago +1 verified

Unarmed guy said: Scalar operation uses general purpose register and it is 32bit register. That's why scalar operation results correctly. Am i right? How the hardware works is irrelevant really; this...

Parents

0 Peter Harris over 6 years ago

To answer your third question about Mali-G52, then it adds a dedicated vector instruction for 8-bit integer dot product which effectively provides a cross-lane FMA for machine learning kernels. The instruction behaves as if all of the multiplication intermediates are 32-bits wide, so there is no clipping of the result.

See the following OpenCL extension for usage information in OpenCL kernels:

https://www.khronos.org/registry/OpenCL/extensions/arm/cl_arm_integer_dot_product.txt

Cheers,
Pete
Cancel
Up 0 Down

Cancel

Reply

0 Peter Harris over 6 years ago

To answer your third question about Mali-G52, then it adds a dedicated vector instruction for 8-bit integer dot product which effectively provides a cross-lane FMA for machine learning kernels. The instruction behaves as if all of the multiplication intermediates are 32-bits wide, so there is no clipping of the result.

See the following OpenCL extension for usage information in OpenCL kernels:

https://www.khronos.org/registry/OpenCL/extensions/arm/cl_arm_integer_dot_product.txt

Cheers,
Pete
Cancel
Up 0 Down

Cancel

Children

No data