This discussion has been locked.

You can no longer post new replies to this discussion. If you have a question you can start a new discussion

Using CMSIS DSP Function: arm_fir_f32 - Correct Use?

Gary over 12 years ago

Hi All;

I have some questions about correct use of the CMSIS DSP library call arm_fir_32. First, I'll provide some background about what I am doing and what the setup is.

I have a STM32F4 Discovery board, using IAR EWARM for programming. Just for testing purposes, I'm generating a low frequency test signal at 40Hz and feeding it into one of the ADC inputs. The signal is biased to swing from 0 to about 2.5Vpp. The signal has a low to moderate amount of broadband noise - but at this point I am not purposely mixing or introducing any other signals with it. There is a timer interrupt set to sample frequency of 2KHz, with a sampling buffer of 2048 samples.

I have already tested and am using the FFT function arm_cfft_f32, and can accurately determine (track) the frequency of the incoming signal when I change it at the source. This seems to be working well.

Now, I would like to use the arm_fir_32 filter. To do this, I started out reading the documentation from CMSIS on the function. To implement a FIR low pass, to generate the tap coefficients, I am using this website's only tool to do so.

I generated a 4th order filter, and set the sampling rate the same as my software, with a cutoff of 60Hz. I forced generation to 24 taps to be even. So the BLOCK_SIZE is 32, and the number of blocks is 1024/32 = 32.

Following the example from CMSIS on this function, I believe I've set up correctly. So the chain looks like this:

ADC --> FIR --> FFT

However, I'm not getting the result I would expect. The values returned from the FFT's output buffer are exponentially large (not this way if I comment out /circumvent the FIR calls). This leads me to believe I am missing a step. Do I need to normalize the values? I thought that because I input the rate into the FIR function setup, this wouldn't be required - but maybe this is incorrect.

Can someone please provide some insight or assistance as to what I am missing or doing incorrectly to apply the FIR processing?

Thank you,

Gary

Top replies

Gary over 12 years ago +1 verified

No response = close thread. Thanks for those who were polite enough to help and respond.

Parents

0 Gary over 12 years ago in reply to Dr. Paul Beckmann

Hi Dr.Beckmann,
I am processing low frequency signals (for now, just a sine wave) under 50 Hz and using a low pass filter with Fc at 100Hz. Running samples through a simple Parks-McClellan FIR with 351 taps.
the filter was created for Fs = 1000Hz.
Input signal = 27Hz.
Fs = 1000Hz
Buffer size = 4096
N (fft length) = 2048
According to your post, the resolution should be: 1000/2048 = 0.49Hz
The largest bin should be the fundamental, which is at 24 in this case. After returning from the CMSIS call to arm_cmplx_mag_f32, here are the bins around it:
22: cfftBuffer = 42.4
23: cfftBuffer = 96.9
24: cfftBuffer = 470.8
25: cfftBuffer = 218.8
26: cfftBuffer = 88.3
If: f = bin# x (Fs / N) , then: f = 24 * (1000/2048) = 11.71Hz (should be close to 27Hz)

That doesn't work out anywhere near what I am sending into the ADC, or the calculation. Where am I going astray?
Thanks again for looking at my questions...
Cancel
Vote up 0 Vote down

Cancel

Reply

0 Gary over 12 years ago in reply to Dr. Paul Beckmann

Hi Dr.Beckmann,
I am processing low frequency signals (for now, just a sine wave) under 50 Hz and using a low pass filter with Fc at 100Hz. Running samples through a simple Parks-McClellan FIR with 351 taps.
the filter was created for Fs = 1000Hz.
Input signal = 27Hz.
Fs = 1000Hz
Buffer size = 4096
N (fft length) = 2048
According to your post, the resolution should be: 1000/2048 = 0.49Hz
The largest bin should be the fundamental, which is at 24 in this case. After returning from the CMSIS call to arm_cmplx_mag_f32, here are the bins around it:
22: cfftBuffer = 42.4
23: cfftBuffer = 96.9
24: cfftBuffer = 470.8
25: cfftBuffer = 218.8
26: cfftBuffer = 88.3
If: f = bin# x (Fs / N) , then: f = 24 * (1000/2048) = 11.71Hz (should be close to 27Hz)

That doesn't work out anywhere near what I am sending into the ADC, or the calculation. Where am I going astray?
Thanks again for looking at my questions...
Cancel
Vote up 0 Vote down

Cancel

Children

0 Dr. Paul Beckmann over 12 years ago in reply to Gary

Hi Gary,
Your analysis looks correct. The frequency resolution is 0.49 Hz per FFT bin. The input sine wave at 27 Hz should appear at 27/0.49 = about bin 55. Something else must be going on. Could you post your code so that I can take a quick look at it?
-Paul
Cancel
Vote up 0 Vote down

Cancel
0 Gary over 12 years ago in reply to Dr. Paul Beckmann

Hi Dr. Beckmann,
The code is posted below. It runs in the Timer interrupt, defined by:
TIM_TimeBaseStructure.TIM_Period = ( (RCC_Clocks.PCLK2_Frequency / SAMPLE_FREQ) - 1);
Here's where the issue seems to be. It works perfectly with frequency precision to 0.5Hz, or better, if I use these defines:
#define SAMPLE_FREQ (2000)
#define INPUT_SAMPLES (4096)
Here are the bins at/around maximum when running with the above defines:
cfftBuffer[53] = 49.8
cfftBuffer[54] = 167.3
cfftBuffer[55] = 1024.9
cfftBuffer[56] = 503.9
cfftBuffer[57] = 226.6
cfftBuffer[58] = 142.8
cfftBuffer[59] = 96.9
Frequency calc: 55 * (2000/4096) = 26.9 (input generator at 27Hz)
If I use the defines as below in the sample code, it does not compute the correct frequency; the maximum bins do not seem to be in the correct place. Here are the bins around the maximum:
(sorry about the previous post, I pasted the incorrect bins from another run)
cfftBuffer[9] = 27.7
cfftBuffer[10] = 13.5
cfftBuffer[11] = 28.4
cfftBuffer[12] = 563.1
cfftBuffer[13] = 156.8
cfftBuffer[14] = 91.2
cfftBuffer[15] = 67.1
12 * (1000/2048) = 5.9 (input generator at 27Hz)
BTW - I did try the decimation method as you suggested in your post (just used a 4 point moving average filter) but got similar bad results.
Thank you again for assisting me with this, very appreciated. (just hope I'm not missing something obvious)
Gary
/***** CODE BELOW ***/
#define SAMPLE_FREQ                     (uint32_t)1000
#define INPUT_SAMPLES                   (uint32_t)2048
#define FFT_SIZE                            (uint32_t)(INPUT_SAMPLES/2)
#define MAX_FREQ                        (float32_t)50.0
/* max energy bin index. Divide sample rate by sample size,
then multiply by max frequency and cast to int */
#define MAX_ENERGY_INDEX                (uint32_t)((INPUT_SAMPLES/SAMPLE_FREQ)*MAX_FREQ)
float32_t ADC_Values[INPUT_SAMPLES];
float32_t outputBuffer[INPUT_SAMPLES];
float32_t cfftBuffer[INPUT_SAMPLES/2];
/* Timer 6 Interrupt Handler */
void TIM6_DAC_IRQHandler(void)
{
if( TIM_GetITStatus(TIM6, TIM_IT_Update) )
{
    volatile uint16_t uhADCxConvertedValue;
    float32_t *inputBufPtr;
    float32_t *outputBufPtr;
    float32_t smplRate = 0;
    uint32_t index;

    uhADCxConvertedValue = ADC_get();
    //subtract small offset from sigGen
    ADC_Values[sampleCount] = ( ((uhADCxConvertedValue * VREF) / ADC_COUNTS) - 0.58);

    if(sampleCount++ == INPUT_SAMPLES)
    {
        inputBufPtr = &ADC_Values[0];
        outputBufPtr = &outputBuffer[0];
        arm_fir_instance_f32 S;
        arm_fir_init_f32(&S, FILTER_TAP_NUM, (float32_t *)&filter_taps[0], &firStateF32[0], BLOCK_SIZE);

        for(uint16_t i = 0; i < NUM_BLOCKS; i++)
        {
            arm_fir_f32(&S, inputBufPtr + (i * BLOCK_SIZE), outputBufPtr + (i * BLOCK_SIZE), BLOCK_SIZE);
        }

        if( FFT_SIZE == 1024 )
            arm_cfft_f32(&arm_cfft_sR_f32_len1024, outputBuffer, ifftFlag, doBitReverse);
        else
            arm_cfft_f32(&arm_cfft_sR_f32_len2048, outputBuffer, ifftFlag, doBitReverse);

        /* Calculate the magnitude at each bin */
        arm_cmplx_mag_f32(outputBuffer, cfftBuffer, FFT_SIZE);

        cfftBuffer[0] = 0;
        maxValue = 0;
       //find maximum bin (CMSIS call doesn't appear to work correctly!)
        for(uint16_t i = 0; i < MAX_ENERGY_INDEX; i++)
        {
          if( cfftBuffer[i] > maxValue )
          {
            maxValue = cfftBuffer[i];
            index = i;
          }
        }

         //compute frequency at max bin
         smplRate = (INPUT_SAMPLES / 1.0);
         float32_t frequency = ((index * SAMPLE_FREQ) / smplRate);
         printf("Frequency = %.1f, bin energy[%i] = %.1f\n", frequency, index, maxValue );
        sampleCount = 0;
    }

    /* Clear TIM6 Capture compare interrupt pending bit */
    TIM_ClearITPendingBit(TIM6, TIM_IT_Update);
}
}
/* end interrupt handler */
Cancel
Vote up 0 Vote down

Cancel
0 Jens Bauer over 12 years ago in reply to Gary

(Just a minor comment, before I go on vacation tomorrow, I hope you don't mind me chipping in).
Though I do not know much about FFTs, I will recommend clearning the pending bit in the beginning of your interrupt.
It's always good to clear it as early as possible, so you reduce the risk of missing out an interrupt.
It could affect your readings, depending on how fast your MCU is running.
Also, there is an unwritten rule, a no-no; do-not-do... Never call library functions (such as printf for instance) from an interrupt service routine.
Cancel
Vote up 0 Vote down

Cancel
0 Gary over 12 years ago in reply to Jens Bauer

Do not mind a all! I appreciate your comments. 100% agree on clearing the interrupt, have already changed that. The printf is totally for debug through SWO,
and yes it does impact the performance of the IH to a large degree, it will be removed after debugging.
Happy USA day, jensbauer.
gary
Cancel
Vote up 0 Vote down

Cancel
0 Gary over 12 years ago in reply to Dr. Paul Beckmann

Hi Dr. Beckmann,
Have you had the chance to look at the code I posted to see where things are going astray?
Right now I'm convinced that something weird is going on when I use a 1KHz sampling rate and an FFT of length 1024 (2048 samples). The bin values are not correct, and it does not appear to be a scaling issue. There is no uniformly divisible number that works out in the bins of interest.
Thanks...
Cancel
Vote up 0 Vote down

Cancel
0 Daniel White over 12 years ago in reply to Gary

Hi Grog,

I work for Paul at DSP Concepts on the CMSIS library. Would you mind emailing me a small demo program that illustrates your issue? I looked at the code you posted, but there seems to be some code that's probably not related to the problem, and also some important details missing. I would like to see the actual input data that causes the issue reported. Also, it might not be related at all, but would you please let me know what processor you are using (Cortex-M0,1,3,4,4 with fpu), and little or big endian? All of this would greatly speed up my attempt to find and fix the problem.

Thanks,

Dan
Cancel
Vote up 0 Vote down

Cancel
0 Gary over 12 years ago in reply to Daniel White

Hi Dan:
Thank you for taking the time to answer my questions. Being relatively new to working with DSP, I am probably misunderstanding a fundamental here with CMSIS functionality, or perhaps just DSP.
I seem to get it working at one frequency, but not the one I want (500Hz), Bizarre.
A simple objective, with slow data, no fancy buffering or DMA of data needed. I have an eternity to process between interrupts. I am sampling using a timer interrupt set by the define SAMPLING FREQUENCY. When the interrupt reaches the BUFFER_SIZE, I will then process the FFT. For the sake of simplicity, let's say I want a sampling rate of 500Hz - 10x the Nyquist rate should be more than adequate.
So, can I take a step back, ask you to forget the code I posted and instead put this question: Given the conditions in the last paragraph, can you please tell me what I should set these variables to, given the 500Hz sampling rate, to get a resolution of 0.5Hz:
BUFFER_SIZE
FFT_SIZE
BLOCK_SIZE
When using the call: arm_fir_f32
For reference:
The P-M filter coefficients were generated by ScopeFIR at Fs = 500Hz
Using IAR EWARM 7.1
Your specific questions:
Using an STM32F407VG processor (ST's Discovery PCB MB997C)
Using this library: arm_cortexM4lf_math.lib
Assuming endian is little since I downloaded the library already built and I believe that's the default.
Please let me know if there's any other info you need.
Regards,
Gary
Cancel
Vote up 0 Vote down

Cancel
0 Daniel White over 11 years ago in reply to Gary

My apologies for delayed response to this. I must not have this forum set up to email me when I get a response. I assumed you would email me and didn't come back to this forum until today.
Block size should be irrelevant. It should just affect the performance of the function (generally larger block sizes are more efficient).
fft size seems like 1024 should be sufficiently close to 0.5Hz (0.48828125Hz)
buffer size just set it to fft size * 2 (for complex numbers)
I'll respond to your other thread in just a minute
Cancel
Vote up 0 Vote down

Cancel
0 Gary over 11 years ago in reply to Daniel White

Hi Dan:
Thank you for responding, please accept my apologies for the last post.
I think all issues are now fixed. It appears that the interrupt rate was incorrect for anything other than a 2000Hz rate. To get more accuracy I used a prescaler value and then divided down by that, and can now sample at the required rate.
Now just one other question to ask of you. In the presence of noise, how can one be certain that the signal of interest is not noise? In other words, how could the signal be validated?
Regards,
Gary
Cancel
Vote up 0 Vote down

Cancel
0 Daniel White over 11 years ago in reply to Gary

Hi Gary,
I'm not an expert in solving DSP questions (more of a firmware coder), but I'll toss in my 2 cents. You could look at the SNR of the signal in question, and if it's large enough that should indicate it's not a part of the background noise. You could track the signal over multiple interrupts to see if it is still there and still at the same frequency. You could check that only 1 or 2 bins have very large values, as opposed to a plateau of large values.
That's about all I can think of. There's not much you can do if your noise produces a strong sinusoid at the frequency of interest aside from find a way to filter it out before you measure it.
Dan
Cancel
Vote up 0 Vote down

Cancel
0 Gary over 11 years ago in reply to Daniel White

Thanks Dan.
After looking at the CMSIS documentation, there's an example there that uses the SNR function of the CMSIS DSP library:
/* ----------------------------------------------------------------------
** Compare the generated output against the reference output computed
** in MATLAB.
** ------------------------------------------------------------------- */
snr = arm_snr_f32(&refOutput[0], &testOutput[0], TEST_LENGTH_SAMPLES);
if (snr < SNR_THRESHOLD_F32)
{
status = ARM_MATH_TEST_FAILURE;
}
else
{
status = ARM_MATH_SUCCESS;
}
Dan, since you indicated that your area isn't DSP, is it possible that Dr.Beckmann could answer this question:
In the CMSIS documentation example for SNR, the input signal AND the test signal is generated by Matlab. In the real world, how is a reference signal generated when receiving signals that have an amplitude = unknown? I am thinking that the answer isn't likely in one sentence. This is a whole side of DSP processing I have not done before, so if you can point me to some reading (website, document, book) that would provide some insight as to how to correctly use the CMSIS function I would be grateful.
Thank you,
Gary
Cancel
Vote up 0 Vote down

Cancel
0 Dr. Paul Beckmann over 11 years ago in reply to Gary

Hi Gary,
To distinguish between signal and noise you need a good model of each. This is very common in other applications like communications in which you want to distinguish between 0's and 1's. It's usually not 100% reliable since noise can sometimes appear as signal and vice versa.
-Paul
Cancel
Vote up 0 Vote down

Cancel
0 Gary over 11 years ago in reply to Daniel White

Dan,
Still something bizarre going on with this - sorry to open this question again.
I've got the timer interrupt generating at these intervals:
TIM_TimeBaseStructure.TIM_Prescaler = 50;
TIM_TimeBaseStructure.TIM_Period = 1605; //timer interrupts at 1ms = 1Khz
use above, or OR use:
TIM_TimeBaseStructure.TIM_Period = 800; //timer interrupts at close to 500us = 2KHz
The main loop runs until loopCtr == BUFFER_SIZE then runs the FFT.
Below, using these defines gives me bins with 0.5Hz resolution. (A frequency of 25 Hz processed has high bin at #50)
#define SAMPLE_FREQ 1000
#define BUFFER_SIZE 2048
#define FFT_SIZE 1024
So why do these defines give me just 1 Hz resolution? (A frequency of 25 Hz processed has high bin at #25)
#define SAMPLE_FREQ 2000
#define BUFFER_SIZE 2048
#define FFT_SIZE 1024
The only thing that changes between those defines is the SAMPLE_FREQ. Also, I am at a loss to understand why the first set provides 0.5Hz resolution at all.
Because according to DSP theory, Fs / FFTsize = bin resolution.
What am I missing here?
Thanks,
Gary
Cancel
Vote up 0 Vote down

Cancel
0 Gary over 11 years ago in reply to Daniel White

No, I don't expect you to debug the program. I'm trying to figure out where the issue is, nothing more.
I am sending you the source files for the project directory in an RAR file. I'm confident that when you look at the code, you'll see there is no other code,
it's as simple as I have described, and hopefully you can tell me why the CMSIS call using those defines in question is not returning expected values.
Thanks,
Gary
Cancel
Vote up 0 Vote down

Cancel
0 Daniel White over 11 years ago in reply to Gary

I agree it should be Fs / FFTsize = bin resolution.
So in your first case
#define SAMPLE_FREQ 1000
#define BUFFER_SIZE 2048
#define FFT_SIZE 1024
resolution should be 0.9765625Hz, with the 25Hz signal showing up mostly in bins 25 and 26
and the second case
#define SAMPLE_FREQ 2000
#define BUFFER_SIZE 2048
#define FFT_SIZE 1024
resolution should be 1.953125Hz, with the 25Hz signal showing up mostly in bins 12 and 13.
I'm not sure why you're getting roughly double that. I have an inkling this might have more to do with your other code than CMSIS. I feel that it might be appropriate at this point to let you know that I'm here to support you with CMSIS related questions, not to debug any bug in your code you come across. If you're confident that it's CMSIS that is misbehaving in some way, or you're unsure how to correctly use a CMSIS function, please do let me know and I will work through it with you to resolve the issue. As always, I need as much information as possible to debug it; preferably a working example program that involves only CMSIS and nothing else.
edited which bins it should show up in because I fail at basic math
Cancel
Vote up 0 Vote down

Cancel