Looking for alternates for this instruction.
For an assembly language subroutine (which only takes 14 clock cycles), see A fairly quick Count Leading Zeroes for Cortex-M0.