comp.lang.ada
 help / color / mirror / Atom feed
From: "Randy Brukardt" <randy@rrsoftware.com>
Subject: Re: GNAT vs UTF-8 source file names
Date: Wed, 5 Jul 2017 13:42:14 -0500
Date: 2017-07-05T13:42:14-05:00	[thread overview]
Message-ID: <ojjbu7$mga$1@franka.jacob-sparre.dk> (raw)
In-Reply-To: ojihrl$qu2$1@dont-email.me

[-- Warning: decoded text below may be mangled, UTF-8 assumed --]
[-- Attachment #1: Type: text/plain, Size: 1503 bytes --]

"J-P. Rosen" <rosen@adalog.fr> wrote in message 
news:ojihrl$qu2$1@dont-email.me...
> Le 05/07/2017 à 11:47, Simon Wright a écrit :
>> That reference specifies NFKC which I suppose is near!
> Not that near when it comes to ligatures and other crazy characters...
> But you are right, it's NFKC.

Actually, you were right the first time, but it doesn't show up in the Ada 
2012 as this is a recent correction (recall AI12-0004-1? It was just 
approved by WG 9 at the June meeting). NFKC is *definitely* the wrong rule.

Note that we chose NFC in part because WC3 recommends that all Internet 
content be in NFC, and because it is the more compact representation. I'm 
surprised that anyone would use NFD (since it can be three times larger than 
NFC), but I suppose I shouldn't ever be surprised by the choices of others. 
;-)

As always, you can see the *current* state of Ada by using the working draft 
RM (see http://www.ada-auth.org/standards/ada2x.html). For this rule, that 
is 2.1(4.1/5).

I suppose the working draft is a bit confusing for this use (that is, 
Ada-Comment) as corrections (like this) take effect immediately upon WG 9 
approval while amendments don't take effect until the next Standard update. 
You can tell them apart by looking at the bottom of each subclause at the 
"<something> from Ada 2012" (for instance, "Wording Changes from Ada 
2012") -- "corrections" are identified that way, while amendments are not 
identified specially.

                               Randy. 


  reply	other threads:[~2017-07-05 18:42 UTC|newest]

Thread overview: 22+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2017-04-30 17:10 GNAT vs UTF-8 source file names Simon Wright
2017-06-17 17:20 ` Simon Wright
2017-06-27 13:22   ` Jacob Sparre Andersen
2017-06-27 21:45     ` Niklas Holsti
2017-06-28  5:05       ` G.B.
2017-07-04 13:57   ` Simon Wright
2017-07-04 17:30     ` Shark8
2017-07-04 18:08       ` Dennis Lee Bieber
2017-07-05  5:25       ` J-P. Rosen
2017-07-06 15:18         ` Shark8
2017-07-07  8:19           ` J-P. Rosen
2017-07-05  5:21     ` J-P. Rosen
2017-07-05  9:47       ` Simon Wright
2017-07-05 11:20         ` J-P. Rosen
2017-07-05 18:42           ` Randy Brukardt [this message]
2017-07-06 18:43           ` Simon Wright
2017-07-07  8:26             ` J-P. Rosen
2017-07-07 11:01               ` Simon Wright
2017-07-07 11:49                 ` Jacob Sparre Andersen
2017-07-07 19:44                   ` Randy Brukardt
2017-07-07 19:40                 ` Randy Brukardt
2017-07-07 21:02                   ` Simon Wright
replies disabled

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox