"Bj�rn Persson" <spam-away@nowhere.nil> wrote in message
news:Sxaxg.10370$E02.3445@newsb.telia.net...
>...
> How about a *hypothetical* counter-example? If you had a character with
> the code point 100000000 hexadecimal, how would you encode it in UTF-32?
> I believe it's impossible; I believe UTF-32 is a fixed-width encoding.

It would have to be hypothetical: Unicode is a 31-bit character set. Note
that I said 31 bits, not 32-bits.

Thus, UTF-32 and UCS-4 are the same *if encoding Unicode characters*.
(UTF-32 would need extra bytes to encode 32-bit characters with the high bit
on, but those would not be Unicode characters.)

Of course, some future character set could use more then 31-bits, but that
seems well into the future.

                         Randy Brukardt.