* RE: Supporting full Unicode
@ 2004-05-12 12:40 amado.alves
2004-05-12 14:34 ` Martin Krischik
0 siblings, 1 reply; 10+ messages in thread
From: amado.alves @ 2004-05-12 12:40 UTC (permalink / raw)
To: comp.lang.ada
>> An ugly fact about Unicode is that the code space is 24-bit...
> Not quite right. The current code space is 32 bit of which only 24 bits are used...
You made me go for the book to see if this had changed. It has not. The codespace is 24 bit. Actually it's 21.
"In the Unicode Standard, the codespace consists of the integers from 0 to 10FFFF..."
(The Unicode Standard, Version 4.0.1, section 2.4)
^ permalink raw reply [flat|nested] 10+ messages in thread
* RE: Supporting full Unicode
2004-05-12 12:40 Supporting full Unicode amado.alves
@ 2004-05-12 14:34 ` Martin Krischik
2004-05-12 18:24 ` David Starner
2004-05-12 20:04 ` Florian Weimer
0 siblings, 2 replies; 10+ messages in thread
From: Martin Krischik @ 2004-05-12 14:34 UTC (permalink / raw)
amado.alves wrote:
>>> An ugly fact about Unicode is that the code space is 24-bit...
>
>> Not quite right. The current code space is 32 bit of which only 24 bits
>> are used...
>
> You made me go for the book to see if this had changed. It has not. The
> codespace is 24 bit. Actually it's 21.
>
> "In the Unicode Standard, the codespace consists of the integers from 0 to
> 10FFFF..."
>
> (The Unicode Standard, Version 4.0.1, section 2.4)
If you mean the currently used code space then yes. But they extend the
codepace from time to time
With Regards
Martin
--
mailto://krischik@users.sourceforge.net
http://www.ada.krischik.com
^ permalink raw reply [flat|nested] 10+ messages in thread
* RE: Supporting full Unicode
2004-05-12 14:34 ` Martin Krischik
@ 2004-05-12 18:24 ` David Starner
2004-05-13 10:09 ` [OT:fun] " Marius Amado Alves
2004-05-12 20:04 ` Florian Weimer
1 sibling, 1 reply; 10+ messages in thread
From: David Starner @ 2004-05-12 18:24 UTC (permalink / raw)
On Wed, 12 May 2004 16:34:34 +0200, Martin Krischik wrote:
> amado.alves wrote:
>
>>
>> You made me go for the book to see if this had changed. It has not. The
>> codespace is 24 bit. Actually it's 21.
>>
>> "In the Unicode Standard, the codespace consists of the integers from 0 to
>> 10FFFF..."
>>
>> (The Unicode Standard, Version 4.0.1, section 2.4)
>
> If you mean the currently used code space then yes. But they extend the
> codepace from time to time
No, they don't. They extended it once, in the Unicode/ISO-10646 merge in
1996, as well as moving characters around. There is no plan to do either
ever again.
There's no reason to extend the code space, either. There's over a million
code points, and in a decade they've managed to fill about a hundred
thousand of them. The Unicode roadmaps <http://www.unicode.org/roadmaps/>
have space blocked out for every character they think they might want to
encode, from Rongorong to Egyptian Hieroglyphics. All of it fits in
three planes of 65,536 characters with plenty of room to spare on one of
them. It's though that Chinese character might exceed their current plane,
and need a new one. Even with two planes for Chinese, there's still only
four planes in use, plus two planes permanently dedicated to Private Use
characters, leaving 650,000+ characters with no conceivable use. Short of
extraterrestrials, there's no point in extending the code space.
^ permalink raw reply [flat|nested] 10+ messages in thread
* [OT:fun] Re: Supporting full Unicode
2004-05-12 18:24 ` David Starner
@ 2004-05-13 10:09 ` Marius Amado Alves
2004-05-13 10:46 ` Ludovic Brenta
0 siblings, 1 reply; 10+ messages in thread
From: Marius Amado Alves @ 2004-05-13 10:09 UTC (permalink / raw)
To: comp.lang.ada
[when the extraterrestrials come...]
Unicode will be renamed Earthcode, and become a part of Galacticode, a
42-bit code space managed from Vulcan, which will eventually be renamed to
Lacteacode and become a part of Clustercode. By then language is not
evolving anymore so there is a code for each word. Clustercode will finally
become a part of Unicode--this time named right. By then everything has been
said so there is a code for each sentence. At some point in this timeline
the ISO (Intergalactical Standards Organization) went out of business
because 1,000,000,000,000-page printed volumes became unpractical to ship.
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: Supporting full Unicode
2004-05-12 14:34 ` Martin Krischik
2004-05-12 18:24 ` David Starner
@ 2004-05-12 20:04 ` Florian Weimer
1 sibling, 0 replies; 10+ messages in thread
From: Florian Weimer @ 2004-05-12 20:04 UTC (permalink / raw)
* Martin Krischik:
> If you mean the currently used code space then yes. But they extend the
> codepace from time to time
There's a promise from both the Unicode Consortium and the WG behind
ISO 10646-1 that they won't assign characters outside the first
seventeen planes.
--
Current mail filters: many dial-up/DSL/cable modem hosts, and the
following domains: atlas.cz, bigpond.com, di-ve.com, hotmail.com,
jumpy.it, libero.it, netscape.net, postino.it, simplesnet.pt,
tiscali.co.uk, tiscali.cz, tiscali.it, voila.fr, yahoo.com.
^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: [OT:fun] Re: Supporting full Unicode
@ 2004-05-13 17:03 Marius Amado Alves
0 siblings, 0 replies; 10+ messages in thread
From: Marius Amado Alves @ 2004-05-13 17:03 UTC (permalink / raw)
To: comp.lang.ada
> Yes, I long for the Babel fish.
Which is of course an embedded application written in Ada 100000 integrating
the Clustercode database as a Mneson graph of 300Z links stored in a quantum
chip.
>
^ permalink raw reply [flat|nested] 10+ messages in thread
end of thread, other threads:[~2004-05-17 0:49 UTC | newest]
Thread overview: 10+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2004-05-12 12:40 Supporting full Unicode amado.alves
2004-05-12 14:34 ` Martin Krischik
2004-05-12 18:24 ` David Starner
2004-05-13 10:09 ` [OT:fun] " Marius Amado Alves
2004-05-13 10:46 ` Ludovic Brenta
[not found] ` <00b101c4390b$c2a8ea90$310d5451@BACUS>
[not found] ` <87fza4vtxj.fsf@insalien.org>
2004-05-14 11:58 ` Marius Amado Alves
2004-05-16 13:47 ` Robert I. Eachus
2004-05-17 0:49 ` Alexander Kopilovitch
2004-05-12 20:04 ` Florian Weimer
-- strict thread matches above, loose matches on Subject: below --
2004-05-13 17:03 [OT:fun] " Marius Amado Alves
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox