From: Manuel Collado <m.collado@fi.upm.es>
Subject: Re: Avatox 1.1: Trouble with encoding in Windows
Date: Mon, 11 Sep 2006 15:49:34 +0200
Date: 2006-09-11T15:49:34+02:00 [thread overview]
Message-ID: <4505696b@news.upm.es> (raw)
In-Reply-To: <45053aec$0$5142$9b4e6d93@newsspool1.arcor-online.net>
Georg Bauhaus escribió:
> Manuel Collado wrote:
>
>> 1. The ASIS API should provide a way to know the character encoding of
>> the source file (I think it doesn't).
>
> Yes! This will help a lot in avoiding character set issues.
> And it might help prevent dodgy arguments like the ones presented
> by implementers against the clever requirement to write the
> identifier π in the Ada 2005 library. :-)
Spanish identifiers like 'tamaño' (size) or 'año' (year) are currently
accepted by GNAT.
>
>
>> 2. The non-ASCII characters could be converted to XML character
>> references (&#nnn;) by Avatox.
>
> This is beyond my comprehension, in particular when XML does
> have standardized character set support. Numeric character
> entities will force me to look at geographical names
> like España (Spain) or Łódź (in Poland)
> written España and Łódź
> respectively.
XML markup is meant to be written and read mostly by tools, not by humans.
So it doesn't matter if a text fragment is coded as 'España' or as
'España'. In fact, after parsing, an XML processing agent cannot know
how it was coded.
My suggestion is that the Avatox encoding issue can be solved by simply
writing non-ASCII characters as XML character references just when the
final XML output is generated.
>
> Will any Ada programmer find it pleasing to even write them as
> character strings in this equivalent way?
>
> Country: Wide_String := "Espa" & Wide_Character'Val(241) & "a";
> Town: Wide_String := (
> Wide_Character'Val(16#141#),
> Wide_Character'Val(16#F3#),
> 'd',
> Wide_Character'Val(16#17A#));
>
> You could then go on and recommend writing Ada in the following style,
> just because some text editing tool that you remember might not properly
> handle white space:
>
> Town: String := "New" & Character'Val(32) & "York";
>
This is outside of scope. I've not spoken about adequate character
representation in Ada sources, just in XML documents.
> <steam>
> If programmers don't start accepting that there are more
> characters than can be expressed in 7bin ASCII and start
> making less buggy text tools (including Ada tools)
> then anyone will continue to have difficulties in international
> communications.
>
> I hope that ASIS is a chance to get this done.
> </>
Amen to that.
>
> -- Georg
Regards.
--
Manuel Collado
next prev parent reply other threads:[~2006-09-11 13:49 UTC|newest]
Thread overview: 50+ messages / expand[flat|nested] mbox.gz Atom feed top
2006-09-11 8:24 Avatox 1.0: Trouble with encoding in Windows Manuel Collado
2006-09-11 10:35 ` Georg Bauhaus
2006-09-11 13:49 ` Manuel Collado [this message]
2006-09-11 16:43 ` Avatox 1.1: " Georg Bauhaus
2006-09-11 17:50 ` Björn Persson
2006-09-12 0:06 ` Marc A. Criley
2006-09-12 8:35 ` Manuel Collado
2006-09-13 0:01 ` Avatox 1.0: " Randy Brukardt
2006-09-13 9:01 ` Georg Bauhaus
2006-09-13 19:28 ` Björn Persson
2006-09-14 6:34 ` Georg Bauhaus
2006-09-14 23:09 ` Björn Persson
2006-09-14 22:13 ` Björn Persson
2006-09-16 7:40 ` Martin Krischik
2006-09-16 9:43 ` Björn Persson
2006-09-16 9:59 ` Georg Bauhaus
2006-09-16 11:15 ` Björn Persson
2006-09-17 9:30 ` Martin Krischik
2006-09-13 10:32 ` Manuel Collado
2006-09-13 18:28 ` Björn Persson
2006-09-14 8:11 ` Manuel Collado
2006-09-13 23:05 ` Randy Brukardt
2006-09-13 11:04 ` vgodunko
2006-09-14 8:56 ` Martin Krischik
2006-09-14 21:16 ` Jeffrey R. Carter
2006-09-14 22:55 ` Björn Persson
2006-09-15 23:15 ` Jeffrey R. Carter
2006-09-16 7:38 ` Martin Krischik
2006-09-17 19:41 ` Jeffrey R. Carter
2006-09-15 5:47 ` Martin Krischik
2006-09-15 23:16 ` Jeffrey R. Carter
2006-09-16 7:31 ` Martin Krischik
2006-09-17 19:43 ` Jeffrey R. Carter
2006-09-15 9:41 ` Georg Bauhaus
2006-09-15 23:28 ` Jeffrey R. Carter
2006-09-16 9:52 ` Georg Bauhaus
2006-09-16 10:31 ` Björn Persson
2006-09-17 19:57 ` Jeffrey R. Carter
2006-09-18 0:06 ` Björn Persson
2006-09-18 20:14 ` Jeffrey R. Carter
2006-09-16 5:10 ` Simon Wright
2006-09-15 18:11 ` Pascal Obry
2006-09-15 18:53 ` Dmitry A. Kazakov
2006-09-15 22:29 ` Georg Bauhaus
2006-09-16 7:46 ` Dmitry A. Kazakov
2006-09-15 23:35 ` Jeffrey R. Carter
2006-09-15 5:34 ` Simon Wright
2006-09-12 9:52 ` Stephen Leake
2006-09-19 1:16 ` Marc A. Criley
2006-09-19 9:20 ` Stephen Leake
replies disabled
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox