comp.lang.ada
 help / color / mirror / Atom feed
From: Manuel Collado <m.collado@fi.upm.es>
Subject: Re: Avatox 1.1: Trouble with encoding in Windows
Date: Mon, 11 Sep 2006 15:49:34 +0200
Date: 2006-09-11T15:49:34+02:00	[thread overview]
Message-ID: <4505696b@news.upm.es> (raw)
In-Reply-To: <45053aec$0$5142$9b4e6d93@newsspool1.arcor-online.net>

Georg Bauhaus escribió:
> Manuel Collado wrote:
> 
>> 1. The ASIS API should provide a way to know the character encoding of
>> the source file (I think it doesn't).
> 
> Yes! This will help a lot in avoiding character set issues.
> And it might help prevent dodgy arguments like the ones presented
> by implementers against the clever requirement to write the
> identifier π in the Ada 2005 library. :-)

Spanish identifiers like 'tamaño' (size) or 'año' (year) are currently 
accepted by GNAT.

> 
> 
>> 2. The non-ASCII characters could be converted to XML character
>> references (&#nnn;) by Avatox.
> 
> This is beyond my comprehension, in particular when XML does
> have standardized character set support. Numeric character
> entities will force me to look at geographical names
> like España (Spain) or Łódź (in Poland)
> written  Espa&#xF1;a  and  &#x141;&#xF3;d&#x17A;
> respectively.

XML markup is meant to be written and read mostly by tools, not by humans. 
So it doesn't matter if a text fragment is coded as 'España' or as 
'Espa&#xF1;a'. In fact, after parsing, an XML processing agent cannot know 
how it was coded.

My suggestion is that the Avatox encoding issue can be solved by simply 
writing non-ASCII characters as XML character references just when the 
final XML output is generated.

> 
> Will any Ada programmer find it pleasing to even write them as
> character strings in this equivalent way?
> 
>    Country: Wide_String := "Espa" & Wide_Character'Val(241) & "a";
>    Town: Wide_String := (
>       Wide_Character'Val(16#141#),
>       Wide_Character'Val(16#F3#),
>       'd',
>       Wide_Character'Val(16#17A#));
> 
> You could then go on and recommend writing Ada in the following style,
> just because some text editing tool that you remember might not properly
> handle white space:
> 
>    Town: String := "New" & Character'Val(32) & "York";
> 

This is outside of scope. I've not spoken about adequate character 
representation in Ada sources, just in XML documents.


> <steam>
> If programmers don't start accepting that there are more
> characters than can be expressed in 7bin ASCII and start
> making less buggy text tools (including Ada tools)
> then anyone will continue to have difficulties in international
> communications.
> 
> I hope that ASIS is a chance to get this done.
> </>

Amen to that.

> 
> -- Georg

Regards.
-- 
Manuel Collado



  reply	other threads:[~2006-09-11 13:49 UTC|newest]

Thread overview: 50+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2006-09-11  8:24 Avatox 1.0: Trouble with encoding in Windows Manuel Collado
2006-09-11 10:35 ` Georg Bauhaus
2006-09-11 13:49   ` Manuel Collado [this message]
2006-09-11 16:43     ` Avatox 1.1: " Georg Bauhaus
2006-09-11 17:50     ` Björn Persson
2006-09-12  0:06       ` Marc A. Criley
2006-09-12  8:35         ` Manuel Collado
2006-09-13  0:01   ` Avatox 1.0: " Randy Brukardt
2006-09-13  9:01     ` Georg Bauhaus
2006-09-13 19:28       ` Björn Persson
2006-09-14  6:34         ` Georg Bauhaus
2006-09-14 23:09           ` Björn Persson
2006-09-14 22:13         ` Björn Persson
2006-09-16  7:40         ` Martin Krischik
2006-09-16  9:43           ` Björn Persson
2006-09-16  9:59             ` Georg Bauhaus
2006-09-16 11:15               ` Björn Persson
2006-09-17  9:30             ` Martin Krischik
2006-09-13 10:32     ` Manuel Collado
2006-09-13 18:28       ` Björn Persson
2006-09-14  8:11         ` Manuel Collado
2006-09-13 23:05       ` Randy Brukardt
2006-09-13 11:04     ` vgodunko
2006-09-14  8:56       ` Martin Krischik
2006-09-14 21:16         ` Jeffrey R. Carter
2006-09-14 22:55           ` Björn Persson
2006-09-15 23:15             ` Jeffrey R. Carter
2006-09-16  7:38             ` Martin Krischik
2006-09-17 19:41               ` Jeffrey R. Carter
2006-09-15  5:47           ` Martin Krischik
2006-09-15 23:16             ` Jeffrey R. Carter
2006-09-16  7:31               ` Martin Krischik
2006-09-17 19:43                 ` Jeffrey R. Carter
2006-09-15  9:41           ` Georg Bauhaus
2006-09-15 23:28             ` Jeffrey R. Carter
2006-09-16  9:52               ` Georg Bauhaus
2006-09-16 10:31               ` Björn Persson
2006-09-17 19:57                 ` Jeffrey R. Carter
2006-09-18  0:06                   ` Björn Persson
2006-09-18 20:14                     ` Jeffrey R. Carter
2006-09-16  5:10             ` Simon Wright
2006-09-15 18:11           ` Pascal Obry
2006-09-15 18:53             ` Dmitry A. Kazakov
2006-09-15 22:29               ` Georg Bauhaus
2006-09-16  7:46                 ` Dmitry A. Kazakov
2006-09-15 23:35             ` Jeffrey R. Carter
2006-09-15  5:34         ` Simon Wright
2006-09-12  9:52 ` Stephen Leake
2006-09-19  1:16   ` Marc A. Criley
2006-09-19  9:20     ` Stephen Leake
replies disabled

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox