"Bj�rn Persson" wrote in message news:Sxaxg.10370$E02.3445@newsb.telia.net... >... > How about a *hypothetical* counter-example? If you had a character with > the code point 100000000 hexadecimal, how would you encode it in UTF-32? > I believe it's impossible; I believe UTF-32 is a fixed-width encoding. It would have to be hypothetical: Unicode is a 31-bit character set. Note that I said 31 bits, not 32-bits. Thus, UTF-32 and UCS-4 are the same *if encoding Unicode characters*. (UTF-32 would need extra bytes to encode 32-bit characters with the high bit on, but those would not be Unicode characters.) Of course, some future character set could use more then 31-bits, but that seems well into the future. Randy Brukardt.