This discussion has been locked.

You can no longer post new replies to this discussion. If you have a question you can start a new discussion

24 bit address in 16 bit processor

Please do not say 'paging' I can not handle the overhead for all the other stuff that fit nicely within 64k.

I have a 2Mbyte flash that occasionally is accessed and the time here is not critical (the system pauses) as opposed to other processes that use only RAM.

Is there an elegant way to access structures in that 24 bit address space or does it have to be bent folded and mutilated to access?

Currently all addresses are specified with an 8 bit 'page' and a 16 bit 'address' and processed as such. I could gain some readability by having the whole in a long.

Also this method require data to be stored so no structure cross a page boundary and that limitation is a nuisance.

Erik

Parents

0 Keil Software Support Intl. over 21 years ago in reply to Drew Davis

The code in XBANKING.A51 does exactly what your ReadFlash routine does. But it is build into the compiler, so when you have pointers, you need not to decide whether the address is now a flash address or a RAM address. The overhead is this decision (which are 5-6 CPU cycles).

Of course you may implement your own way of doing it.
Cancel
Vote up 0 Vote down

Cancel

Reply

0 Keil Software Support Intl. over 21 years ago in reply to Drew Davis

The code in XBANKING.A51 does exactly what your ReadFlash routine does. But it is build into the compiler, so when you have pointers, you need not to decide whether the address is now a flash address or a RAM address. The overhead is this decision (which are 5-6 CPU cycles).

Of course you may implement your own way of doing it.
Cancel
Vote up 0 Vote down

Cancel

Children

0 erik malund over 21 years ago in reply to Keil Software Support Intl.

And since the routine only reads one byte, that one byte can't cross a boundary.
No, but the address can

if the structure is located at fff0 and 20 bytes long, the access of the last 4 entries will be 0000, 0001, 0002 and 0003 with 16 bit calculation.

Erik
Cancel
Vote up 0 Vote down

Cancel
0 erik malund over 21 years ago in reply to Keil Software Support Intl.

The code in XBANKING.A51 does exactly what your ReadFlash routine does. But it is build into the compiler, so when you have pointers, you need not to decide whether the address is now a flash address or a RAM address. The overhead is this decision (which are 5-6 CPU cycles).

I DO NOT want the execution routines since they assume all is 'banked' and when operating in "RAM mode" I can not afford ANY overhead.

ALL I WANT is a means of the calculation of the effective address in 32 bit mode.

IF the address of an entry in a structure or array is targeted at a 32 bit entity, the calculation should be 32 bit.

Erik
Cancel
Vote up 0 Vote down

Cancel
0 Jon Ward over 21 years ago in reply to erik malund

I DO NOT want the execution routines since they assume all is 'banked' and when operating in "RAM mode" I can not afford ANY overhead.

The XBANKING routines do not add any overhead when accessing CODE, DATA, XDATA, IDATA, PDATA, or BIT memory areas variables. They are only invoked when you use far or const far pointers.

IF the address of an entry in a structure or array is targeted at a 32 bit entity, the calculation should be 32 bit.

Far memory types are limited to 64K in size and may not cross a 64K boundary. As such, the address calculations for far memory objects are performed using 16-bit arithmetic which reduces code size and increases execution speed. A limitation is that compiler-managed objects may not cross a 64K boundary.

ALL I WANT is a means of the calculation of the effective address in 32 bit mode.

You can do this using a far pointer with a long typed index but you'll have to do it manually and you'll have to read each byte individually. However, this is only required for those objects that straddle the 64K boundary. And, there are very few of those (only 1 if you're using 128K).

Jon
Cancel
Vote up 0 Vote down

Cancel
0 erik malund over 21 years ago in reply to Jon Ward

You can do this using a far pointer with a long typed index but you'll have to do it manually and you'll have to read each byte individually. However, this is only required for those objects that straddle the 64K boundary. And, there are very few of those (only 1 if you're using 128K).
The problem here is that the data is variable and I do not know which units straddle.

Anyhow, I think it has now reached the point where I have to go back to the proplr that make tha software that generate the file that I store in flash and say "make a hole in the file so no units straddle 64k" I know it will cost me a a hefty fee, but oh well if nothing else works, pay.

Erik
Cancel
Vote up 0 Vote down

Cancel
0 Jon Ward over 21 years ago in reply to erik malund

The problem here is that the data is variable and I do not know which units straddle.

Well, that complicates things a bit, but still, couldn't you look at the address and size of the object to determine if it straddles?

Jon
Cancel
Vote up 0 Vote down

Cancel
0 erik malund over 21 years ago in reply to Jon Ward

Well, that complicates things a bit, but still, couldn't you look at the address and size of the object to determine if it straddles?

There is more to it, There are 32 copies of struct a, 64 copies of array b etc.

To process the same struct differently depending on its location would create a piece of code that would be a nightmare to debug.

Anyhow, I'll see what that cost of requesting a gap would be and if exorbiant, I'll try the suggestions here.

Thanks all,

Erik
Cancel
Vote up 0 Vote down

Cancel
0 Drew Davis over 21 years ago in reply to erik malund
Sounds like the offsetof() macro could come in handy.

if (((U16)addr + offsetof(structType, fieldName) < (U16)addr) { // field straddes 64k boundary } else { // field lies within one 64k segment }

Assuming none of your fields are bigger than 32k, that is.

But isn't such a test at runtime again going to be more expensive than just setting the bank register? I suppose you could figure out at initialization time where the break comes, and store that.

Is this operation really so time-critical that saving a couple of instructions is worthwhile? Flash access is often slower than RAM access. If you're writing to the flash, it's many orders of magnitude slower than the access time.
Cancel
Vote up 0 Vote down

Cancel
0 erik malund over 21 years ago in reply to Drew Davis

Is this operation really so time-critical that saving a couple of instructions is worthwhile? Flash access is often slower than RAM access.
when reading flash timing is of no concern, when NOT reading flash extremely so.
Basically the unit run in two modes
Haul @$$ (99.9% of the time)
work with flash

Erik
Cancel
Vote up 0 Vote down

Cancel
0 Drew Davis over 21 years ago in reply to erik malund

As Jon mentioned, the extra instructions to set up the high 8 bits of the address apply only to far and const far data. Regular xdata access does not go through these routines, and will not be slowed down by the code in XBANKING.A51. These routines are essentially the "far access library". So long as you don't declare your normal xdata items far or access them via a far pointer, you should be safe.

The remaining question seems to be whether or not the actual access pattern is such that detecting the segment boundary is worthwhile. If there's a whole lot of 1-byte reads in the same segment, then you could (in theory) optimize out most of the high-order byte setup, as in the ReadManual4 routine I posted above. It's just a matter of whether the time and code it takes to figure out whether you need to set the high byte is less than the time it takes you just to do it every time. Also, it's perhaps worth considering whether you need consistent execution time for every access, or whether it's okay for some of them to be much longer than others as long as the amortized total is less overall.
Cancel
Vote up 0 Vote down

Cancel
0 erik malund over 21 years ago in reply to Drew Davis

Also, it's perhaps worth considering whether you need consistent execution time for every access, or whether it's okay for some of them to be much longer than others as long as the amortized total is less overall.
varying execution time is irrelevant, but code that tries to read something by method a and something by method b will by me be considered 'messy' and outlawed.

Again, once the flash is in the loop timing is totlly non-critical.

I will play a bit with far and !far and see what happens.

Erik
Cancel
Vote up 0 Vote down

Cancel
0 erik malund over 21 years ago in reply to Drew Davis

one question re banking
can the 'main' bank be 64k all I have seen say 'home bank' 32k, bank 1 32k

Erik
Cancel
Vote up 0 Vote down

Cancel
0 Jon Ward over 21 years ago in reply to erik malund

If you're referring to code banking, the bank size may be anything from 0 to 64K.

Typically, you'll have a fixed common area which is stored in a 32K ROM (or something like that) and you'll have banking hardware that switches the upper 32K (or whatever's left).

But, there's nothing that prevents you from using only 8K for the common area and 56K for the banked area.

If the common area is TOO small, the compiler just merges it into each of the code banks.

Using that, you could just have 64K banks and let the compiler use whatever it needed for the common area. Of course, that area would be duplicated in each code bank (but if you keep it small, that's not really an issue). But, that may reduce the amount of development work involved.

Jon
Cancel
Vote up 0 Vote down

Cancel
0 Drew Davis over 21 years ago in reply to erik malund

I think you're thinking of code banking. The L51_BANK.A51 file contains routines for code banking, and also for data banking. Code banking often has a 32k common section that doesn't get swapped out, plus a 32k window that accesses different portions of the code space as need be.

XBANKING.A51 is just data banking. I suppose it could be implemented to operate on 32k windows with some appropriate shifts, though I'm not sure what the point would be with only one DPTR. (If you had two DPTRs, I could see having two windows to copy between widely seperated regions of physical address space.)

Normally you just add on some high order address bits in custom hardware, perhaps an I/O port, and thus have full 64k segments (addressed by the DPTR) plus another byte that holds the "segment register" (if you want to think of it that way). Segment 0 would typically decode to the "normal" xdata space.

In your case, it sounds like P4 == 0 produces the chip select for a RAM, and P4 != 0 produces a chip select for the flash, as well as A??..A16, while A15..A0 come from the usual pins (connected to the DPTR). So you would configure XBANKING.A51 to

EXT_IN_SFR EQU 1
IF EXT_IN_SFR = 1
?C?XPAGE1SFR DATA 0B0H ; SFR Address of XPAGE1 register P4
ELSE
?C?XPAGE1ADDR EQU 0FFFAH ; XDATA address of XDATA bank register
ENDIF

?C?XPAGE1RST EQU 0 ; XPAGE1 register value to address X:0 region

and away you go.
Cancel
Vote up 0 Vote down

Cancel
0 Andy Neil over 21 years ago in reply to Drew Davis

Hi Erik,

Random musing here:

the 8086 was a 16-bit processor with 20-bit addressing; it formed a 20-bit address from two 16-bit registers (Segment & Offset) and a cunning shift-and-add.

Could you maybe devise a similar scheme - probably using an external CPLD or something?
Or maybe a Trisc... oops! not any more! :-(
Cancel
Vote up 0 Vote down

Cancel
0 erik malund over 21 years ago in reply to Jon Ward

If you're referring to code banking,
I am obviously referring to data banking

Erik
Cancel
Vote up 0 Vote down

Cancel