Hi,
how is it possible to add Cyrillic scripts in one of my c-files (working with mdk version 3.4)?
const char text_cyrillic = {"Cyrillic script"};
best regards Arne
UTF8 only makes use of safe break characters. It doesn't interfere with the original 7-bit ASCII standard. To the compiler, it will just be a sequence of single-byte characters to copy from source into string constants. If the high bit is zero it's a one-byte character. if the two high bits are 11, it's a break character, signalling that two, three or four bytes are needed. if the two high bits are 10, it's one of the following bytes in a multi-byte character. In no way will this interfere with the compiler, as long as the compiler supports 8-bit data.
It is not possible to create a UTF8 text string where any character but the " will have a byte with the numercial value of the ". Same with all other critical characters in C, since all tokens in C are within the 7-bit ASCII set.
I don't think I have seen a compiler that doesn't handles 8-bit data in a long, long time, since such a compiler would not even be able the 8-bit code pages used in text mode on a PC.
Generic multibyte character sets on the other hand may not be safe since there is no guarantee that the follow-up bytes stays out of the old 7-bit ASCII range.