comp.lang.ada
 help / color / mirror / Atom feed
From: Georg Bauhaus <rm.dash-bauhaus@futureapps.de>
Subject: Re: strange behaviour of utf-8 files
Date: Fri, 06 Dec 2013 03:17:02 +0100
Date: 2013-12-06T03:17:02+01:00	[thread overview]
Message-ID: <52a1339e$0$9505$9b4e6d93@newsspool1.arcor-online.net> (raw)
In-Reply-To: <l6pa3q$h87$1@loke.gir.dk>

On 23.11.13 05:14, Randy Brukardt wrote:
> "Shark8" <onewingedshark@gmail.com> wrote
>> Not a lot of demand for UTF-8, or not a lot of demand for Ada-2012 [from
>> the customers]?
>
> Not a lot of demand for UTF-8 or wide characters in general. As far as Ada
> 2012 goes, if I want to use a feature, it somehow gets in the compiler. :-)
> Customer demand not required (but it always helps).

Actually, programmers seem to suppress existing demand.

Equating "customers" to "consumers" of software for the moment
(who pays?), customers suffer from ASCII-fied communication in
ways that would not be accepted if written on paper.
I got a terribly malformed computer generated messages from no lesser
company than DHL (inspiring this follow up).

"??" in the mail text quoted below has obviously been put in place
of what was perfectly UTF-8 encoded character data. (In the mail's
source text, to be sure.)  The non-ASCII character is 'ü' (16#FC#)
in both cases (L.8, L.10):

+======================================================================+
Subject: Ihre Sendung wurde in eine FILIALE umgeleitet
MIME-Version: 1.0
Content-Type: text/plain; charset=ANSI_X3.4-1968
Content-Transfer-Encoding: 7bit

Guten Tag Herr Georg Bauhaus,

leider konnte Ihre Sendung  NICHT in die gew??nschte PACKSTATION eingestellt werden.

Die Sendung liegt f??r Sie in der FILIALE (...)
+======================================================================+

Ironically, the messages are produced using an industry standard Java
framework while Java's char data are not 7bit ASCII:
  
Message-ID: <...48667.JavaMail.ypqbson@HANPQ021>

These messages used to be o.K. in the past. Judging by the count of excess
spaces and long and empty lines in the message, I guess they are having some
competitive programming shop streamline their software.

Character set support can be a real issue when the use of ASCII leads
to misprints of addresses, or to ambiguity in legal documents. Consider
families
    Joseph Müller (16#FC#)
and
   Joseph Möller (16#F6#)
each owning a flat in the same house. If rendered

   Fam. Joseph M??ller
   X Str. 15
   ...

and

   Fam. Joseph M??ller
   X Str. 15
   ...

respectively, what is the postman to do?

Proper support for encoding all characters is a necessity!

  reply	other threads:[~2013-12-06  2:17 UTC|newest]

Thread overview: 33+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2013-11-16 13:12 strange behaviour of utf-8 files Stoik
2013-11-16 13:34 ` Dmitry A. Kazakov
2013-11-16 15:09   ` Stoik
2013-11-16 15:55     ` Dmitry A. Kazakov
2013-11-17 13:32       ` Georg Bauhaus
2013-11-17 14:07         ` Dmitry A. Kazakov
2013-11-17 17:19           ` Dennis Lee Bieber
2013-11-17 18:07             ` Dmitry A. Kazakov
2013-11-17 19:05           ` Georg Bauhaus
2013-11-17 20:38             ` Dmitry A. Kazakov
2013-11-18  8:38               ` Georg Bauhaus
2013-11-18  9:01                 ` Dmitry A. Kazakov
2013-11-18 10:06                   ` Georg Bauhaus
2013-11-18  8:44               ` Georg Bauhaus
2013-11-18 10:24                 ` Dmitry A. Kazakov
2013-11-18 13:05                   ` G.B.
2013-11-18 15:25                     ` Dmitry A. Kazakov
2013-11-18 15:51                       ` G.B.
2013-11-18 17:34                         ` Dmitry A. Kazakov
2013-11-18  0:34           ` Stoik
2013-11-16 17:01     ` Georg Bauhaus
2013-11-17 10:38       ` Stoik
2013-11-16 15:12   ` Stoik
2013-11-16 15:57     ` Dmitry A. Kazakov
2013-11-17 11:12       ` Stoik
2013-11-22  1:03         ` Randy Brukardt
2013-11-22  3:02           ` Shark8
2013-11-22 11:54             ` Georg Bauhaus
2013-11-23  4:14             ` Randy Brukardt
2013-12-06  2:17               ` Georg Bauhaus [this message]
2013-11-16 20:06     ` Peter C. Chapin
2013-11-17 10:34       ` Stoik
2013-11-22  0:53       ` Randy Brukardt
replies disabled

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox