This discussion has been locked.
You can no longer post new replies to this discussion. If you have a question you can start a new discussion

Implementation in NEON of non uniform address jumps

Parents
  • Note: This was originally posted on 17th July 2012 at http://forums.arm.com

    Thank You !!

    The clipping now looks like-


    "cmp    %10,#1                            \n\t"//if(isLast)     
          "bne    3f                                \n\t"         
          "vmin.s32   d4,d4,d13                        \n\t"
          "vmax.s32   d4,d4,d12                        \n\t"
          "3:                                          \n\t"   

        //d13 contains maxVal(255)
        //d12 contains 0


    Time consumed by this portion of the code has dropped from 223ms to 18ms
Reply
  • Note: This was originally posted on 17th July 2012 at http://forums.arm.com

    Thank You !!

    The clipping now looks like-


    "cmp    %10,#1                            \n\t"//if(isLast)     
          "bne    3f                                \n\t"         
          "vmin.s32   d4,d4,d13                        \n\t"
          "vmax.s32   d4,d4,d12                        \n\t"
          "3:                                          \n\t"   

        //d13 contains maxVal(255)
        //d12 contains 0


    Time consumed by this portion of the code has dropped from 223ms to 18ms
Children
No data