comp.lang.ada
 help / color / mirror / Atom feed
* Tiny error in ARM 2012 ? (about characters)
@ 2011-01-23 23:43 Yannick Duchêne (Hibou57)
  2011-01-24  0:08 ` Yannick Duchêne (Hibou57)
  0 siblings, 1 reply; 11+ messages in thread
From: Yannick Duchêne (Hibou57) @ 2011-01-23 23:43 UTC (permalink / raw)


Hillo,

I do not know if I've missed something. I am currently adding an Ada  
colorizer to an editor, and I decided to use the ARM 2012 instead of ARM  
2005 (for fun). It appears the wording in 2.1 are not exactly the same any  
more as in ARM 2005 (didn't suspected this section could be updated).

Unless I've missed something, is appears to exclude the space in source  
text.

2.1 is here:
http://www.ada-auth.org/standards/12rm/html/RM-2-1.html

2.1(4/3) says
> The only characters allowed outside of comments are those in
> categories other_format, format_effector, and graphic_character.

Other,Format:
http://www.fileformat.info/info/unicode/category/Cf/list.htm
No space

Format_Effector, defined by 2.1(13/2) to be
U+09..U+0D,U+85, and Separator,Line and Separator,Paragraph.

Separator,Line
http://www.fileformat.info/info/unicode/category/Zl/list.htm
is U+2028
Separator,Paragraph
http://www.fileformat.info/info/unicode/category/Zp/list.htm
is U+2029

 From a strict interpretation (in my possibly erroneous interpretation) of  
the ARM 2012, the character U+20 is not allowed outside of comments.

Seem the category Separator,Space
http://www.fileformat.info/info/unicode/category/Zs/list.htm
is missing from Format_Effector. It is safe, as luckily this does not  
include the zero-width space (U=200B) nor the zero-width no-break space  
(U+FEFF), which would be both unsafe. Separator,Space is defined in  
2.1(11/2), but not referred to.

Guess if I missed something, I am really foolish here :D (oops)


-- 
Si les chats miaulent et font autant de vocalises bizarres, c’est pas pour  
les chiens.

“I am fluent in ASCII” [Warren 2010]



^ permalink raw reply	[flat|nested] 11+ messages in thread

* Tiny error in ARM 2012 ? (about characters)
@ 2011-01-23 23:43 Yannick Duchêne (Hibou57)
  2011-01-24  2:16 ` Jeffrey Carter
                   ` (2 more replies)
  0 siblings, 3 replies; 11+ messages in thread
From: Yannick Duchêne (Hibou57) @ 2011-01-23 23:43 UTC (permalink / raw)


Hillo,

I do not know if I've missed something. I am currently adding an Ada  
colorizer to an editor, and I decided to use the ARM 2012 instead of ARM  
2005 (for fun). It appears the wording in 2.1 are not exactly the same any  
more as in ARM 2005 (didn't suspected this section could be updated).

Unless I've missed something, is appears to exclude the space in source  
text.

2.1 is here:
http://www.ada-auth.org/standards/12rm/html/RM-2-1.html

2.1(4/3) says
> The only characters allowed outside of comments are those in
> categories other_format, format_effector, and graphic_character.

Other,Format:
http://www.fileformat.info/info/unicode/category/Cf/list.htm
No space

Format_Effector, defined by 2.1(13/2) to be
U+09..U+0D,U+85, and Separator,Line and Separator,Paragraph.

Separator,Line
http://www.fileformat.info/info/unicode/category/Zl/list.htm
is U+2028
Separator,Paragraph
http://www.fileformat.info/info/unicode/category/Zp/list.htm
is U+2029

 From a strict interpretation (in my possibly erroneous interpretation) of  
the ARM 2012, the character U+20 is not allowed outside of comments.

Seem the category Separator,Space
http://www.fileformat.info/info/unicode/category/Zs/list.htm
is missing from Format_Effector. It is safe, as luckily this does not  
include the zero-width space (U=200B) nor the zero-width no-break space  
(U+FEFF), which would be both unsafe. Separator,Space is defined in  
2.1(11/2), but not referred to.

Guess if I missed something, I am really foolish here :D (oops)


-- 
Si les chats miaulent et font autant de vocalises bizarres, c’est pas pour  
les chiens.

“I am fluent in ASCII” [Warren 2010]



^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: Tiny error in ARM 2012 ? (about characters)
  2011-01-23 23:43 Yannick Duchêne (Hibou57)
@ 2011-01-24  0:08 ` Yannick Duchêne (Hibou57)
  0 siblings, 0 replies; 11+ messages in thread
From: Yannick Duchêne (Hibou57) @ 2011-01-24  0:08 UTC (permalink / raw)


Le Mon, 24 Jan 2011 00:43:26 +0100, Yannick Duchêne (Hibou57)  
<yannick_duchene@yahoo.fr> a écrit:
> Seem the category Separator,Space
> http://www.fileformat.info/info/unicode/category/Zs/list.htm
> is missing from Format_Effector. It is safe, as luckily this does not  
> include the zero-width space (U=200B) nor the zero-width no-break space  
> (U+FEFF), which would be both unsafe. Separator,Space is defined in  
> 2.1(11/2), but not referred to.
No, the Separator_Space should not be part or Format_Effector, but should  
be listed in 2.1(4/3).

-- 
Si les chats miaulent et font autant de vocalises bizarres, c’est pas pour  
les chiens.

“I am fluent in ASCII” [Warren 2010]



^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: Tiny error in ARM 2012 ? (about characters)
  2011-01-23 23:43 Tiny error in ARM 2012 ? (about characters) Yannick Duchêne (Hibou57)
@ 2011-01-24  2:16 ` Jeffrey Carter
  2011-01-24  2:32   ` Yannick Duchêne (Hibou57)
  2011-01-24  8:17 ` Yannick Duchêne (Hibou57)
  2011-01-25  1:52 ` Randy Brukardt
  2 siblings, 1 reply; 11+ messages in thread
From: Jeffrey Carter @ 2011-01-24  2:16 UTC (permalink / raw)


On 01/23/2011 04:43 PM, Yannick Duchêne (Hibou57) wrote:
>
> Seem the category Separator,Space
> http://www.fileformat.info/info/unicode/category/Zs/list.htm
> is missing from Format_Effector. It is safe, as luckily this does not include
> the zero-width space (U=200B) nor the zero-width no-break space (U+FEFF), which
> would be both unsafe. Separator,Space is defined in 2.1(11/2), but not referred to.

 From http://www.ada-auth.org/standards/12aarm/html/AA-2-1.html:

11/2
  {AI95-00285-01} {separator_space} separator_space

Any character whose General Category is defined to be “Separator, Space”.

-- 
Jeff Carter
"We use a large, vibrating egg."
Annie Hall
44



^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: Tiny error in ARM 2012 ? (about characters)
  2011-01-24  2:16 ` Jeffrey Carter
@ 2011-01-24  2:32   ` Yannick Duchêne (Hibou57)
  0 siblings, 0 replies; 11+ messages in thread
From: Yannick Duchêne (Hibou57) @ 2011-01-24  2:32 UTC (permalink / raw)


Le Mon, 24 Jan 2011 03:16:26 +0100, Jeffrey Carter  
<spam.jrcarter.not@spam.not.acm.org> a écrit:

>  From http://www.ada-auth.org/standards/12aarm/html/AA-2-1.html
>
> 11/2
>   {AI95-00285-01} {separator_space} separator_space
>
> Any character whose General Category is defined to be “Separator, Space”.
Hello Jeffrey,

Yes I saw this one, but as I then said in a reply to my own post (in the  
other instance of this thread, sorry I accidentally posted this thread  
twice), I noticed although this Separator_Space is defined, it is not  
referred to. If you follow the rules given in 2.1, Separator_Space,  
although defined, is not made part of valid character outside of comments  
(nor inside of comments BTW).


-- 
Si les chats miaulent et font autant de vocalises bizarres, c’est pas pour  
les chiens.

“I am fluent in ASCII” [Warren 2010]



^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: Tiny error in ARM 2012 ? (about characters)
  2011-01-23 23:43 Tiny error in ARM 2012 ? (about characters) Yannick Duchêne (Hibou57)
  2011-01-24  2:16 ` Jeffrey Carter
@ 2011-01-24  8:17 ` Yannick Duchêne (Hibou57)
  2011-01-24  8:36   ` Yannick Duchêne (Hibou57)
  2011-01-25  1:52 ` Randy Brukardt
  2 siblings, 1 reply; 11+ messages in thread
From: Yannick Duchêne (Hibou57) @ 2011-01-24  8:17 UTC (permalink / raw)


Le Mon, 24 Jan 2011 00:43:27 +0100, Yannick Duchêne (Hibou57)  
<yannick_duchene@yahoo.fr> a écrit:
> http://www.ada-auth.org/standards/12rm/html/RM-2-1.html

Then
http://www.ada-auth.org/standards/12rm/html/RM-2-2.html
says

> 3/2 In some cases an explicit separator is required to separate
> adjacent lexical elements. A separator is any of a separator_space,
> a format_effector, or the end of a line, as follows:

and later


> 7.1/3 One of more other_format characters are allowed anywhere that a
> separator is; any such characters have no effect on the meaning of an
> Ada program.

So why not simply remove 7.1/3 and add other_format in the list which  
forms separators in 3/2 ?

-- 
Si les chats miaulent et font autant de vocalises bizarres, c’est pas pour  
les chiens.

“I am fluent in ASCII” [Warren 2010]



^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: Tiny error in ARM 2012 ? (about characters)
  2011-01-24  8:17 ` Yannick Duchêne (Hibou57)
@ 2011-01-24  8:36   ` Yannick Duchêne (Hibou57)
  2011-01-24  9:00     ` AdaMagica
  2011-01-25  1:48     ` Randy Brukardt
  0 siblings, 2 replies; 11+ messages in thread
From: Yannick Duchêne (Hibou57) @ 2011-01-24  8:36 UTC (permalink / raw)


Le Mon, 24 Jan 2011 09:17:21 +0100, Yannick Duchêne (Hibou57)  
<yannick_duchene@yahoo.fr> a écrit:
> So why not simply remove 7.1/3 and add other_format in the list which  
> forms separators in 3/2 ?

May be an other_format is simply not enough as an explicit separator,  
which is the purpose of 3/2 (don't remember what these code-points are,  
will have a look-back at it).


-- 
Si les chats miaulent et font autant de vocalises bizarres, c’est pas pour  
les chiens.

“I am fluent in ASCII” [Warren 2010]



^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: Tiny error in ARM 2012 ? (about characters)
  2011-01-24  8:36   ` Yannick Duchêne (Hibou57)
@ 2011-01-24  9:00     ` AdaMagica
  2011-01-24  9:37       ` Yannick Duchêne (Hibou57)
  2011-01-25  1:48     ` Randy Brukardt
  1 sibling, 1 reply; 11+ messages in thread
From: AdaMagica @ 2011-01-24  9:00 UTC (permalink / raw)


Can' see a problem:

(4/3)
... The only characters allowed outside of comments are those in
categories other_format, format_effector, and graphic_character.

(11/2)
separator_space
Any character whose General Category is defined to be “Separator,
Space”.

(14/2)
graphic_character
Any character that is not in the categories other_control,
other_private_use, other_surrogate, format_effector, and whose
relative code position in its plane is neither 16#FFFE# nor 16#FFFF

Thus separator_space is in graphic_character. Where's the problem?



^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: Tiny error in ARM 2012 ? (about characters)
  2011-01-24  9:00     ` AdaMagica
@ 2011-01-24  9:37       ` Yannick Duchêne (Hibou57)
  0 siblings, 0 replies; 11+ messages in thread
From: Yannick Duchêne (Hibou57) @ 2011-01-24  9:37 UTC (permalink / raw)


Le Mon, 24 Jan 2011 10:00:15 +0100, AdaMagica  
<christoph.grein@eurocopter.com> a écrit:

> Can' see a problem:
>
> (4/3)
> ... The only characters allowed outside of comments are those in
> categories other_format, format_effector, and graphic_character.
>
> (11/2)
> separator_space
> Any character whose General Category is defined to be “Separator,
> Space”.
>
> (14/2)
> graphic_character
> Any character that is not in the categories other_control,
> other_private_use, other_surrogate, format_effector, and whose
> relative code position in its plane is neither 16#FFFE# nor 16#FFFF
>
> Thus separator_space is in graphic_character. Where's the problem?

You're right.

I was fooled by the name, could not think space was part of  
graphic-character (that's not I would call graphic), so did not nocited.

I feel better now, there is no error.

-- 
Si les chats miaulent et font autant de vocalises bizarres, c’est pas pour  
les chiens.

“I am fluent in ASCII” [Warren 2010]



^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: Tiny error in ARM 2012 ? (about characters)
  2011-01-24  8:36   ` Yannick Duchêne (Hibou57)
  2011-01-24  9:00     ` AdaMagica
@ 2011-01-25  1:48     ` Randy Brukardt
  1 sibling, 0 replies; 11+ messages in thread
From: Randy Brukardt @ 2011-01-25  1:48 UTC (permalink / raw)


[-- Warning: decoded text below may be mangled, UTF-8 assumed --]
[-- Attachment #1: Type: text/plain, Size: 750 bytes --]

"Yannick Duch�ne (Hibou57)" <yannick_duchene@yahoo.fr> wrote in message 
news:op.vps0u50nule2fv@garhos...
Le Mon, 24 Jan 2011 09:17:21 +0100, Yannick Duch�ne (Hibou57)
<yannick_duchene@yahoo.fr> a �crit:
>> So why not simply remove 7.1/3 and add other_format in the list which 
>> forms separators in 3/2 ?
>
>May be an other_format is simply not enough as an explicit separator, 
>which is the purpose of 3/2 (don't remember what these code-points are, 
>will have a look-back at it).

Right. An "other_format" (like a soft-hyphen) is not considered a separator 
by itself.

                        Randy.



-- 
Si les chats miaulent et font autant de vocalises bizarres, c'est pas pour
les chiens.

"I am fluent in ASCII" [Warren 2010] 





^ permalink raw reply	[flat|nested] 11+ messages in thread

* Re: Tiny error in ARM 2012 ? (about characters)
  2011-01-23 23:43 Tiny error in ARM 2012 ? (about characters) Yannick Duchêne (Hibou57)
  2011-01-24  2:16 ` Jeffrey Carter
  2011-01-24  8:17 ` Yannick Duchêne (Hibou57)
@ 2011-01-25  1:52 ` Randy Brukardt
  2 siblings, 0 replies; 11+ messages in thread
From: Randy Brukardt @ 2011-01-25  1:52 UTC (permalink / raw)


[-- Warning: decoded text below may be mangled, UTF-8 assumed --]
[-- Attachment #1: Type: text/plain, Size: 854 bytes --]

>"Yannick Duch�ne (Hibou57)" <yannick_duchene@yahoo.fr> wrote in message 
>news:op.vpsb6piaule2fv@garhos...
Hillo,
>
>I do not know if I've missed something. I am currently adding an Ada 
>colorizer to an editor, and I decided to use the ARM 2012 instead of ARM 
>2005 (for fun). It appears the wording in 2.1 are not exactly the same any 
>more as in ARM 2005 (didn't suspected this section could be updated).
>
>Unless I've missed something, is appears to exclude the space in source 
>text.

>2.1 is here:
>http://www.ada-auth.org/standards/12rm/html/RM-2-1.html
>
>2.1(4/3) says
>> The only characters allowed outside of comments are those in
>> categories other_format, format_effector, and graphic_character.

Space is a graphic_character, of course. So are letters and numbers and so 
on. See 2.1(14/2).

                             Randy.





^ permalink raw reply	[flat|nested] 11+ messages in thread

end of thread, other threads:[~2011-01-25  1:52 UTC | newest]

Thread overview: 11+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2011-01-23 23:43 Tiny error in ARM 2012 ? (about characters) Yannick Duchêne (Hibou57)
2011-01-24  2:16 ` Jeffrey Carter
2011-01-24  2:32   ` Yannick Duchêne (Hibou57)
2011-01-24  8:17 ` Yannick Duchêne (Hibou57)
2011-01-24  8:36   ` Yannick Duchêne (Hibou57)
2011-01-24  9:00     ` AdaMagica
2011-01-24  9:37       ` Yannick Duchêne (Hibou57)
2011-01-25  1:48     ` Randy Brukardt
2011-01-25  1:52 ` Randy Brukardt
  -- strict thread matches above, loose matches on Subject: below --
2011-01-23 23:43 Yannick Duchêne (Hibou57)
2011-01-24  0:08 ` Yannick Duchêne (Hibou57)

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox