This discussion has been locked.

You can no longer post new replies to this discussion. If you have a question you can start a new discussion

What is int_width about in arm_nn_activations_direct_q7 about

Hi all,

recently, I am trying to implement neural networks on one of my Cortex-M devices. However, I wasn't know what "int_width" in arm_nn_activations_direct_q7 is about. The definition in the document is: "bit-width of the integer part, assume to be smaller than 3" but it is still unclear to me. Please give me some example about it as I am trying to transfer neural networks from other DL framework like tensorflow and pytorch. Many thanks!!

Top replies

a.surati over 2 years ago in reply to dddyyylll +1 verified

They define q7_t as a signed 8-bit integer to represent a "fractional data type in 1.7 format". Assuming that Arm is using the AMD variant of the definition of a Q number, a q7_t is a fraction in the range...

Parents

+1 a.surati over 2 years ago in reply to dddyyylll

They define q7_t as a signed 8-bit integer to represent a "fractional data type in 1.7 format". Assuming that Arm is using the AMD variant of the definition of a Q number, a q7_t is a fraction in the range [-1.0, 1), i.e. 1 sign bit, no integer bit, and 7 fractional bits. They seem to have repurposed the same data type q7_t to represent fractions with 1, 2 or 3 integer bits at the cost of reducing the number of fractional bits.

The tables that they have prepared take an input in the range [-8.0, 8.0). By selecting the int_width value, you are informing the library of the range of your input.

For e.g., the number 0x80 is -1.0 in q1.7 fraction, but it is -8.0 in q4.4 fraction. When your input assumes an int_width of 0, so that 0x80 represents -1.0, the library needs to locate -1.0 in its range [-8.0, 8.0). It cannot directly use 0x80 as -1.0, since the table assumes that its input is in q4.4 format, and 0x80 in q4.4 is -8.0, not -1.0. The right-shift converts from q1.7 (or q2.6, q3.5, q4.4, based on int_width) to q4.4.
Cancel
Up +1 Down

Cancel

Reply

+1 a.surati over 2 years ago in reply to dddyyylll

They define q7_t as a signed 8-bit integer to represent a "fractional data type in 1.7 format". Assuming that Arm is using the AMD variant of the definition of a Q number, a q7_t is a fraction in the range [-1.0, 1), i.e. 1 sign bit, no integer bit, and 7 fractional bits. They seem to have repurposed the same data type q7_t to represent fractions with 1, 2 or 3 integer bits at the cost of reducing the number of fractional bits.

The tables that they have prepared take an input in the range [-8.0, 8.0). By selecting the int_width value, you are informing the library of the range of your input.

For e.g., the number 0x80 is -1.0 in q1.7 fraction, but it is -8.0 in q4.4 fraction. When your input assumes an int_width of 0, so that 0x80 represents -1.0, the library needs to locate -1.0 in its range [-8.0, 8.0). It cannot directly use 0x80 as -1.0, since the table assumes that its input is in q4.4 format, and 0x80 in q4.4 is -8.0, not -1.0. The right-shift converts from q1.7 (or q2.6, q3.5, q4.4, based on int_width) to q4.4.
Cancel
Up +1 Down

Cancel

Children

0 dddyyylll over 2 years ago in reply to a.surati

Thanks for the explanation! Now, I understand how the Q-format actually is.

Based on the note written in the "arm_nntables.c", it says "input is 3.x format, i.e, range of [-8, 8)". So I guess we need to right shift to the input to q3.4 format instead of q4.3? (ARM uses the TI's Q format)
Cancel
Up 0 Down

Cancel
0 a.surati over 2 years ago in reply to dddyyylll

TI's q3.4 is the same as AMD's q4.4. For int_width == 3, no shifting is necessary. For the example given in the previous post, the conversion is from q7 (in TI notation) to q3.4 (in TI notation). So yes, the input must be converted to q3.4 (in TI notation) before it can be used to index into the table.
Cancel
Up 0 Down

Cancel
0 dddyyylll over 2 years ago in reply to a.surati

So in the quantization point of view, the int_width is a scale factor that remaps the float values into [-8, 8), I suppose?
Cancel
Up 0 Down

Cancel
0 a.surati over 2 years ago in reply to dddyyylll

As I undertand it, q7_t is a single data-type being used as one of q7, q1.6, q2.5, or q3.4 (all in TI notation). By default, q7_t means q7 (in TI notation). To distinguish between different uses of q7_t, they introduced another variable int_width, so q7_t with int_width==3 is q3.4 (TI), q7_t with int_width==0 is q7 (TI), etc. They could have created different types q7_t, q1_6_t, q2_5_t, q3_4_t, etc to make the Q format explicit, i.e. the API could have been designed a bit better.
Cancel
Up 0 Down

Cancel
0 dddyyylll over 2 years ago in reply to a.surati

So how could we know the output of the previous layer is q1.6, q2.5 or q3.4 according to the library?
Cancel
Up 0 Down

Cancel
0 a.surati over 2 years ago in reply to dddyyylll

I am unable answer that question. The caller of arm_nn_activations_direct_q7 must know the format/range of the input and so must appopriately select the int_width value.
Cancel
Up 0 Down

Cancel