comp.lang.ada
 help / color / mirror / Atom feed
From: Martin Krischik <krischik@users.sourceforge.net>
Subject: Re: Supporting full Unicode
Date: Wed, 12 May 2004 12:43:22 +0200
Date: 2004-05-12T12:43:22+02:00	[thread overview]
Message-ID: <1326387.jKy1seHaPs@linux1.krischik.com> (raw)
In-Reply-To: mailman.115.1084354437.313.comp.lang.ada@ada-france.org

Marius  wrote:

>> But I would favour using UTF-8 as the internal encoding anyway.  It is
>> easy to define a UTF8_String type similar to the above.  GtkAda has
>> such a type, as GTK+ uses UTF-8 as both internal and external
>> encoding.
 
> Indeed UTF-8 seems to rule. Probably because there are more ready-to-use
> low level tools for 8-bit characters. Actually the proper tools for
> Unicode should be 24-bit based. An ugly fact about Unicode is that the
> code space is 24-bit and the encodings are all but 24 (8, 16, 32).

Not quite right. The current code space is 32 bit of which only 24 bits are
used. That of corse means that in UTF-8 a max of 4 character are used.

However, this may change when the extrateristials arrive ;-). Any program
with only 24 bit will break then.

Won't happen. Well up until recently only 16 bit where used and programmers
freely mixed UTF-16 and UCS-16. But then the archaeologist came.

Of corse currently we repeat that mistake: UTF-32 is variable length as well
and should not be mixed with UCS-32.

With regards

Martin

-- 
mailto://krischik@users.sourceforge.net
http://www.ada.krischik.com




  reply	other threads:[~2004-05-12 10:43 UTC|newest]

Thread overview: 25+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2004-05-11 17:45 Supporting full Unicode Brian Catlin
2004-05-12  7:44 ` Ludovic Brenta
2004-05-12  8:23   ` Marius Amado Alves
2004-05-12 10:43     ` Martin Krischik [this message]
2004-05-12 14:56       ` Björn Persson
2004-05-12 19:09       ` David Starner
2004-05-12 19:25     ` David Starner
2004-05-12  9:41   ` David Starner
2004-05-12 10:16     ` Björn Persson
2004-05-12 10:57       ` Ludovic Brenta
2004-05-12 14:53         ` Björn Persson
2004-05-12 18:55           ` David Starner
2004-05-12  9:30 ` Martin Krischik
2004-05-13  1:15 ` Randy Brukardt
2004-05-13 17:58   ` Brian Catlin
2004-05-13 19:42     ` Randy Brukardt
2004-05-14  8:40       ` Andersen Jacob Sparre
2004-05-14 20:20         ` Randy Brukardt
2004-05-14  4:00 ` Vadim Godunko
2004-05-14 17:51   ` Brian Catlin
  -- strict thread matches above, loose matches on Subject: below --
2004-05-12 12:40 amado.alves
2004-05-12 14:34 ` Martin Krischik
2004-05-12 18:24   ` David Starner
2004-05-12 20:04   ` Florian Weimer
2004-05-12 14:12 amado.alves
replies disabled

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox