I have run out of the 64K code space on my 8051 processor and need to port to something like the Cortex-M3 with 128K. BUT...with a 32 bit processor, in theory I'd need 256K of program memory just to break even. Does anyone have experience with what the real multiplier is between an 8 bit device and one of these? Can I reasonably expect that I'll have more room for code moving this way?
David, For the ARM9, the load and store multiple instructions allow you to save or restore the entire register file or any subset of registers in the one instruction. Take for example the STM and LDM instructions.