This discussion has been locked.

You can no longer post new replies to this discussion. If you have a question you can start a new discussion

Program failure at 80+ degrees

Hello,

I was hoping to hear your opinion about a serious problem I have - it is either I solve it or reduce my LPC2478 CPU speed from 72[MHz] to 64[MHz] (11% loss. The problem does not seem to be occurring at lower MHz settings). I posted about this in the past but it was a long time ago.
When I place a controller in an environmental chamber and increase the temperature to 80+ Celsius degrees, I often see data abort exceptions, and sometimes I get the impression that the PC takes a hike (even the firmware LED that blinks every 1 second becomes irregular for a while before it stops). The program is launched by a boot loader and has a lower level supporting firmware layer that handles some interrupts (not all). I also see that if RTX is not started at all (but the application hangs in a "for (;;)" loop instead, hence the bootloader and firmware layer were/are involved, but the application is idle) - the system never crashes! I have excluded, as far as I could tell, the roll of external memory or RTX in this situation. However, I still suspect RTX a little (even though my test programs never crashed).
My question: did you ever encounter such a situation? Where do I look best? can this be the result of a misbehaving peripheral? NXP have confirmed the LPC2478 is not the reason.

Parents

0 Tamir Michael over 16 years ago in reply to ImPer Westermark

Hello,

I have learned a little more about this problem in the mean time and was wondering if you can enlighten me further. I am currently running a weekend test of a controller that utilizes the LCD controller of the LPC2478 vs. a controller that does not. The first one is reduced to 64 [MHz] while the second one still runs at 72[MHz], and they communicate via a RS485 bus. Hopefully this remains stable but either way, I have just reduced the display's processing capacity by 12%...
'Samsung' have promised me that their DRAM (K4S561632J) does not suffer from any issues and that the EMC timing settings used now should apply to the entire range of temperatures (maybe the controller was not warmed up entirely or long enough when I concluded otherwise). I am not sure about the refresh rate, but either way I did try to play with it without any positive results. I am aware that the signals to the DRAM should be measured, but that is not so simple at 80+ degrees.
The latest LPC24xx data sheet elaborates on the AHBCFGx registers which determine the arbitration of the AHB busses (my LCD, DRAM and peripheral(MCI interface uses GPDMA) hang on AHB1) . This is a very fundamental setting that I have no experience changing. Do you think this could help me out? I did a few tests with a negative result, but I feel that I have not exhausted it. Either way, can you think of another system setting that might influence this particular problem? I have, for now, ruled out bad traces and noise as another controller (without an LCD) uses the same hardware design and accesses to external RAM (MCI DMA) ) does not crash.
Cancel
Vote up 0 Vote down

Cancel

Reply

0 Tamir Michael over 16 years ago in reply to ImPer Westermark

Hello,

I have learned a little more about this problem in the mean time and was wondering if you can enlighten me further. I am currently running a weekend test of a controller that utilizes the LCD controller of the LPC2478 vs. a controller that does not. The first one is reduced to 64 [MHz] while the second one still runs at 72[MHz], and they communicate via a RS485 bus. Hopefully this remains stable but either way, I have just reduced the display's processing capacity by 12%...
'Samsung' have promised me that their DRAM (K4S561632J) does not suffer from any issues and that the EMC timing settings used now should apply to the entire range of temperatures (maybe the controller was not warmed up entirely or long enough when I concluded otherwise). I am not sure about the refresh rate, but either way I did try to play with it without any positive results. I am aware that the signals to the DRAM should be measured, but that is not so simple at 80+ degrees.
The latest LPC24xx data sheet elaborates on the AHBCFGx registers which determine the arbitration of the AHB busses (my LCD, DRAM and peripheral(MCI interface uses GPDMA) hang on AHB1) . This is a very fundamental setting that I have no experience changing. Do you think this could help me out? I did a few tests with a negative result, but I feel that I have not exhausted it. Either way, can you think of another system setting that might influence this particular problem? I have, for now, ruled out bad traces and noise as another controller (without an LCD) uses the same hardware design and accesses to external RAM (MCI DMA) ) does not crash.
Cancel
Vote up 0 Vote down

Cancel

Children

0 Simon Eversfield over 16 years ago in reply to Tamir Michael

What does the manual say?

Try www.embeddedrelated.com/.../35996.php
Cancel
Vote up 0 Vote down

Cancel
0 Tamir Michael over 16 years ago in reply to Simon Eversfield

I have found this reference myself, but unfortunately NXP do no explain the impact of modifying these registers. It is of course exceedingly hard to solve a problem that you do not fully understand with tools you do not fully understand...
I believe this has something do to with how DMA/LCD DMA and the processor interact with the AHB bus, which changes slightly when temperature rises. I asked NXP to confirm that they have tested the LCD controller of the LPC2478 at these extreme temperatures but they have not replied yet.
Cancel
Vote up 0 Vote down

Cancel
0 Nevill Dayley over 16 years ago in reply to Tamir Michael

If only you hadn't upset Master Zeusti.
Cancel
Vote up 0 Vote down

Cancel
0 Tamir Michael over 16 years ago in reply to Nevill Dayley

Right now I am willing to use just about any help - Zeusti, that Steve figure from above, anything. It is either I solve this, or (assuming the system survives the weekend test!) CPU speed for the display has to go down to 64[MHz] !
Cancel
Vote up 0 Vote down

Cancel
0 ²erik malund over 16 years ago in reply to Tamir Michael

It is either I solve this, or (assuming the system survives the weekend test!) CPU speed for the display has to go down to 64[MHz] !

I quickread this thread and did not see it mentioned that the internal heat generated by the chip is proportional to the clock speed.

NXP claim that the LPC2478 was tested at their labs at up to 105 degrees, and some applications allow for us to 120 degrees...!
Under which operating conditions??

Erik
Cancel
Vote up 0 Vote down

Cancel
0 Tamir Michael over 16 years ago in reply to ²erik malund

Erik,

Thanks for your comments. The answer to your questions is that I do not know: NXP did not elaborate, as far as I can tell, on the exact environmental conditions used to test the chip in any report I could get my hands on. I just don't have enough data to handle this properly...! And you are right: Going down to 64[MHz] might just mask a still existing problem. But at the moment, I don't have any other choice - product beta (thus, installation at the client site) phase is approaching.
Cancel
Vote up 0 Vote down

Cancel
0 Tamir Michael over 16 years ago in reply to Tamir Michael

OK, twittering continues. The display at 64[MHz] made it though the weekend. There are a couple of display related issues, but it is alive!
Cancel
Vote up 0 Vote down

Cancel
0 S. Steve over 16 years ago in reply to Tamir Michael

"...it is alive!"

I can breathe again. (Not to be confused with a yawn.)
Cancel
Vote up 0 Vote down

Cancel
0 Tamir Michael over 16 years ago in reply to S. Steve

I will sure to keep you up to speed, Stunned Steve. Hang in there!
Cancel
Vote up 0 Vote down

Cancel
0 Tamir Michael over 16 years ago in reply to Tamir Michael
As promised, I have an update that might interest operators of a LPC2470/78 using the LCD controller. I have found that:
1. lowering the CPU clock speed to 64[MHz] at 80+ degrees seems to stabilize the system. There are no additional legal PLL settings between 72[MHz] and 64[MHz] that support USB, I'm afraid.
2. This code

AHBCFG1 &= ~1 ; AHBCFG1 |= (3<<12) ; AHBCFG1 |= (4<<16) ; AHBCFG1 |= (2<<20) ; AHBCFG1 |= (1<<24) ; AHBCFG1 |= (5<<28) ;

when placed in main() {the data sheet does not specify that these fundamental settings are disallowed in application code and indeed they work}, will put the LCD in the most preferred position to access the AHB1 bus. This prevents image jitter and distortion when doing time consuming drawing on the LCD at 64[MHz].
Cancel
Vote up 0 Vote down

Cancel
0 S. Steve over 16 years ago in reply to Tamir Michael

"(I have a new, interesting post)"

Where?

There's a new post, but surely interesting is not a true description?

Or was that an attempt at irony?
Cancel
Vote up 0 Vote down

Cancel
0 Tamir Michael over 16 years ago in reply to S. Steve

Not knowing your true name, I must wonder: have you ever posted something useful on this (or any other) forum? please stop hijacking my thread. I'm trying to be informative (=helpful). if you are in a mood for child's play, I suggest you go to a kindergarten.
Cancel
Vote up 0 Vote down

Cancel
0 S. Steve over 16 years ago in reply to Tamir Michael

Is that you being serious again?

Have you tried looking for the Keil command line option -SetMaxTemp=80

You'll not find it. It's not a Keil related problem.
Cancel
Vote up 0 Vote down

Cancel
0 Jack Sprat over 16 years ago in reply to Tamir Michael

There are no additional legal PLL settings between 72[MHz] and 64[MHz] that support USB, I'm afraid.

Have you eliminated the PLL as the cause of the problem? We do some very high temperature stuff and have had problems with PLLs becoming unstable or ceasing to function altogether in devices that have worked fine with an external oscillator.
Cancel
Vote up 0 Vote down

Cancel
0 Tamir Michael over 16 years ago in reply to S. Steve

this really is beyond you, ha? this is an issue that might effect Keil USERS and the safety of their product since Keil supports the LPC2478, thus very much their concern. Apart from that, I don't remember asking you when, if or what I may or may not post. Just go away, it won't help you.
Cancel
Vote up 0 Vote down

Cancel