From: Georg Bauhaus <sb463ba@l1-hrz.uni-duisburg.de>
Subject: Re: UTF-8 in strings - a bug?
Date: Sun, 9 May 2004 12:16:07 +0000 (UTC)
Date: 2004-05-09T12:16:07+00:00 [thread overview]
Message-ID: <c7l7e7$pdn$1@a1-hrz.uni-duisburg.de> (raw)
In-Reply-To: 3171026.RJblE7u9LK@linux1.krischik.com
Martin Krischik <krischik@users.sourceforge.net> wrote:
: The UTF-X encodings can start with a BOM "Byte-order mark".
However, systems are allowed to define protocols which may
restrict the use of a BOM in case of UTF-8 (require/forbid).
A #!/shell script is an example.
A BOM is said to be useful to distinguish a UTF-8 Unicode file
from a file using another 8bit encoding. Though I wonder how by
the absence of the Unicode BOM they think a program can find
out which of the other encodings has been used...
-- Georg
next prev parent reply other threads:[~2004-05-09 12:16 UTC|newest]
Thread overview: 16+ messages / expand[flat|nested] mbox.gz Atom feed top
2004-05-05 22:12 UTF-8 in strings - a bug? Björn Persson
2004-05-05 23:31 ` Robert I. Eachus
2004-05-06 8:34 ` Björn Persson
2004-05-06 9:25 ` Ludovic Brenta
2004-05-06 17:13 ` Björn Persson
2004-05-06 18:24 ` Martin Krischik
2004-05-07 23:32 ` Björn Persson
2004-05-08 6:38 ` Martin Krischik
2004-05-08 7:44 ` Jacob Sparre Andersen
2004-05-08 11:06 ` Björn Persson
2004-05-08 16:25 ` Martin Krischik
2004-05-09 12:16 ` Georg Bauhaus [this message]
2004-05-10 6:29 ` Martin Krischik
2004-05-08 12:10 ` Georg Bauhaus
2004-05-06 9:06 ` David Starner
2004-05-06 17:36 ` Björn Persson
replies disabled
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox