comp.lang.ada
 help / color / mirror / Atom feed
From: Martin Krischik <krischik@users.sourceforge.net>
Subject: Re: UTF-8 in strings - a bug?
Date: Mon, 10 May 2004 08:29:50 +0200
Date: 2004-05-10T08:29:50+02:00	[thread overview]
Message-ID: <3878175.nfHeE0N58X@linux1.krischik.com> (raw)
In-Reply-To: c7l7e7$pdn$1@a1-hrz.uni-duisburg.de

Georg Bauhaus wrote:

> Martin Krischik <krischik@users.sourceforge.net> wrote:
>  
> : The UTF-X encodings can start with a BOM "Byte-order mark".
> 
> However, systems are allowed to define protocols which may
> restrict the use of a BOM in case of UTF-8 (require/forbid).
> A #!/shell script is an example.
> 
> A BOM is said to be useful to distinguish a UTF-8 Unicode file
> from a file using another 8bit encoding. Though I wonder how by
> the absence of the Unicode BOM they think a program can find
> out which of the other encodings has been used...

XML/Ada does some guessing on the the usual beginning of an xml file.

Apart from that I guess they can't..

With Regards

Martin
-- 
mailto://krischik@users.sourceforge.net
http://www.ada.krischik.com




  reply	other threads:[~2004-05-10  6:29 UTC|newest]

Thread overview: 16+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2004-05-05 22:12 UTF-8 in strings - a bug? Björn Persson
2004-05-05 23:31 ` Robert I. Eachus
2004-05-06  8:34   ` Björn Persson
2004-05-06  9:25     ` Ludovic Brenta
2004-05-06 17:13       ` Björn Persson
2004-05-06 18:24       ` Martin Krischik
2004-05-07 23:32         ` Björn Persson
2004-05-08  6:38           ` Martin Krischik
2004-05-08  7:44           ` Jacob Sparre Andersen
2004-05-08 11:06             ` Björn Persson
2004-05-08 16:25               ` Martin Krischik
2004-05-09 12:16                 ` Georg Bauhaus
2004-05-10  6:29                   ` Martin Krischik [this message]
2004-05-08 12:10           ` Georg Bauhaus
2004-05-06  9:06 ` David Starner
2004-05-06 17:36   ` Björn Persson
replies disabled

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox