comp.lang.ada
 help / color / mirror / Atom feed
From: "Dmitry A. Kazakov" <mailbox@dmitry-kazakov.de>
Subject: Re: Avatox 1.0: Trouble with encoding in Windows
Date: Sat, 16 Sep 2006 09:46:21 +0200
Date: 2006-09-16T09:46:05+02:00	[thread overview]
Message-ID: <d2k1w4avbl71.a3iefvlvh858.dlg@40tude.net> (raw)
In-Reply-To: 1158359363.29388.36.camel@localhost.localdomain

On Sat, 16 Sep 2006 00:29:24 +0200, Georg Bauhaus wrote:

> On Fri, 2006-09-15 at 20:53 +0200, Dmitry A. Kazakov wrote:
> 
>> IMO, the idea to use Unicode for program sources is wrong. The language (be
>> it formal or natural) should have a finite and reasonably small alphabet.
>> Unicode is practically an open-end set of symbols most of them you wouldn't
>> be able to either recognize or remember again.
> 
> Unicode is quite flexible and allows a project to choose a reasonable
> subset of characters. A portable subset is fairly easy to describe
> because both Ada and UCS define a common character set from which you
> can choose. No lengthy discussions of how to interpret 8 bits,
> no issues with conforming compilers.

Are you disagree with the point? How can a language be based on multiple
alphabets? [you are talking about subsets] Would it be still one language?
In the history there are examples of written natural languages changing
alphabets.

> Greek.Ω /= Electric.Ω is an issue in Ada 95, too, when you
> use local character sets for two different files.
> 
> Shou1d the number l, sorry, 1, not occur in source text, because it
> is too easy to miss the difference, so please, remove it from the
> Ada grammar? ;-)

That is an issue of choosing a proper typeface. But Omega (glyph) is same.
Code positions (semantic meaning of the symbol, Ohm vs. Greek Omega) are
different. Exactly this is wrong. Because the semantics of a symbol is to
be defined solely by the language, by Ada in our case. Unicode is not a
language, so far, however, nothing would prevent us to define a Unicode
position for any possible Ada program... (:-))

> You can extend the Unicode subset chosen for the project later, without
> introducing ambiguity or a configuration issue. Using Unicode for
> program source text lets you write identifiers that just cannot coexists
> in Latin_1, or any 8bit character set.

There are many ways to make code unmaintainable, like writing identifiers
in linear B syllabary...

-- 
Regards,
Dmitry A. Kazakov
http://www.dmitry-kazakov.de



  reply	other threads:[~2006-09-16  7:46 UTC|newest]

Thread overview: 50+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2006-09-11  8:24 Avatox 1.0: Trouble with encoding in Windows Manuel Collado
2006-09-11 10:35 ` Georg Bauhaus
2006-09-11 13:49   ` Avatox 1.1: " Manuel Collado
2006-09-11 16:43     ` Georg Bauhaus
2006-09-11 17:50     ` Björn Persson
2006-09-12  0:06       ` Marc A. Criley
2006-09-12  8:35         ` Manuel Collado
2006-09-13  0:01   ` Avatox 1.0: " Randy Brukardt
2006-09-13  9:01     ` Georg Bauhaus
2006-09-13 19:28       ` Björn Persson
2006-09-14  6:34         ` Georg Bauhaus
2006-09-14 23:09           ` Björn Persson
2006-09-14 22:13         ` Björn Persson
2006-09-16  7:40         ` Martin Krischik
2006-09-16  9:43           ` Björn Persson
2006-09-16  9:59             ` Georg Bauhaus
2006-09-16 11:15               ` Björn Persson
2006-09-17  9:30             ` Martin Krischik
2006-09-13 10:32     ` Manuel Collado
2006-09-13 18:28       ` Björn Persson
2006-09-14  8:11         ` Manuel Collado
2006-09-13 23:05       ` Randy Brukardt
2006-09-13 11:04     ` vgodunko
2006-09-14  8:56       ` Martin Krischik
2006-09-14 21:16         ` Jeffrey R. Carter
2006-09-14 22:55           ` Björn Persson
2006-09-15 23:15             ` Jeffrey R. Carter
2006-09-16  7:38             ` Martin Krischik
2006-09-17 19:41               ` Jeffrey R. Carter
2006-09-15  5:47           ` Martin Krischik
2006-09-15 23:16             ` Jeffrey R. Carter
2006-09-16  7:31               ` Martin Krischik
2006-09-17 19:43                 ` Jeffrey R. Carter
2006-09-15  9:41           ` Georg Bauhaus
2006-09-15 23:28             ` Jeffrey R. Carter
2006-09-16  9:52               ` Georg Bauhaus
2006-09-16 10:31               ` Björn Persson
2006-09-17 19:57                 ` Jeffrey R. Carter
2006-09-18  0:06                   ` Björn Persson
2006-09-18 20:14                     ` Jeffrey R. Carter
2006-09-16  5:10             ` Simon Wright
2006-09-15 18:11           ` Pascal Obry
2006-09-15 18:53             ` Dmitry A. Kazakov
2006-09-15 22:29               ` Georg Bauhaus
2006-09-16  7:46                 ` Dmitry A. Kazakov [this message]
2006-09-15 23:35             ` Jeffrey R. Carter
2006-09-15  5:34         ` Simon Wright
2006-09-12  9:52 ` Stephen Leake
2006-09-19  1:16   ` Marc A. Criley
2006-09-19  9:20     ` Stephen Leake
replies disabled

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox