This discussion has been locked.
You can no longer post new replies to this discussion. If you have a question you can start a new discussion

What is int_width about in arm_nn_activations_direct_q7 about

Hi all,

recently, I am trying to implement neural networks on one of my Cortex-M devices. However, I wasn't know what "int_width" in arm_nn_activations_direct_q7 is about. The definition in the document is: "bit-width of the integer part, assume to be smaller than 3" but it is still unclear to me. Please give me some example about it as I am trying to transfer neural networks from other DL framework like tensorflow and pytorch. Many thanks!!

  • Hi, thanks for your reply. Since the data is q7_t is 2^3, the maximum integer width is 3. Is this correct? But the other problem is still remained: why we need the int_width as a factor to shift input values? Why don't we just keep the int_width as 3? I am not clearly understand this in the perspective of neural network quantization.

  • They define q7_t as a signed 8-bit integer to represent a "fractional data type in 1.7 format". Assuming that Arm is using the AMD variant of the definition of a Q number, a q7_t is a fraction in the range [-1.0, 1), i.e. 1 sign bit, no integer bit, and 7 fractional bits. They seem to have repurposed the same data type q7_t to represent fractions with 1, 2 or 3 integer bits at the cost of reducing the number of fractional bits.

    The tables that they have prepared take an input in the range [-8.0, 8.0). By selecting the int_width value, you are informing the library of the range of your input.

    For e.g., the number 0x80 is -1.0 in q1.7 fraction, but it is -8.0 in q4.4 fraction. When your input assumes an int_width of 0, so that 0x80 represents -1.0, the library needs to locate -1.0 in its range [-8.0, 8.0). It cannot directly use 0x80 as -1.0, since the table assumes that its input is in q4.4 format, and 0x80 in q4.4 is -8.0, not -1.0. The right-shift converts from q1.7 (or q2.6, q3.5, q4.4, based on int_width) to q4.4.

  • Thanks for the explanation! Now, I understand how the Q-format actually is.

    Based on the note written in the "arm_nntables.c", it says "input is 3.x format, i.e, range of [-8, 8)". So I guess we need to right shift to the input to q3.4 format instead of q4.3? (ARM uses the TI's Q format)

  • TI's q3.4 is the same as AMD's q4.4. For int_width == 3, no shifting is necessary. For the example given in the previous post, the conversion is from q7 (in TI notation) to q3.4 (in TI notation). So yes, the input must be converted to q3.4 (in TI notation) before it can be used to index into the table.

  • So in the quantization point of view, the int_width is a scale factor that remaps the float values into [-8, 8), I suppose?

  • As I undertand it, q7_t is a single data-type being used as one of q7, q1.6, q2.5, or q3.4 (all in TI notation). By default, q7_t means q7 (in TI notation). To distinguish between different uses of q7_t, they introduced another variable int_width, so q7_t with int_width==3 is q3.4 (TI), q7_t with int_width==0 is q7 (TI), etc. They could have created different types q7_t, q1_6_t, q2_5_t, q3_4_t, etc to make the Q format explicit, i.e. the API could have been designed a bit better.

  • So how could we know the output of the previous layer is q1.6, q2.5 or q3.4 according to the library?

  • I am unable answer that question. The caller of arm_nn_activations_direct_q7 must know the format/range of the input and so must appopriately select the int_width value.