From mboxrd@z Thu Jan 1 00:00:00 1970 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on polar.synack.me X-Spam-Level: X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00 autolearn=unavailable autolearn_force=no version=3.4.4 Path: eternal-september.org!reader01.eternal-september.org!reader02.eternal-september.org!news.eternal-september.org!mx02.eternal-september.org!feeder.eternal-september.org!feeder.erje.net!1.eu.feeder.erje.net!newspeer1.nac.net!border2.nntp.dca1.giganews.com!nntp.giganews.com!enother.net!enother.net!peer02.iad.highwinds-media.com!news.highwinds-media.com!feed-me.highwinds-media.com!post02.iad.highwinds-media.com!news.flashnewsgroups.com-b7.4zTQh5tI3A!not-for-mail From: Stephen Leake Newsgroups: comp.lang.ada Subject: Re: OpenToken: Parsing Ada (subset)? References: <878uc3r2y6.fsf@adaheads.sparre-andersen.dk> <85twupvjxo.fsf@stephe-leake.org> <81ceb070-16fe-4578-a09a-eb11a2bbb664@googlegroups.com> <162zj7c2l0ykp$.1rxias18vby83.dlg@40tude.net> <856172bk80.fsf@stephe-leake.org> <26ccc147-7a15-48d7-8808-3248edfbf433@googlegroups.com> <85k2v3aeyv.fsf@stephe-leake.org> Date: Wed, 17 Jun 2015 12:44:17 -0500 Message-ID: <85lhfi8c26.fsf@stephe-leake.org> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/24.4 (windows-nt) Cancel-Lock: sha1:rCgjp+UQIVWbs9zIWADbFM9fU44= MIME-Version: 1.0 Content-Type: text/plain X-Complaints-To: abuse@flashnewsgroups.com Organization: FlashNewsgroups.com X-Trace: 89f485581b1f3e97f808412844 X-Received-Bytes: 1721 X-Received-Body-CRC: 3180495263 Xref: news.eternal-september.org comp.lang.ada:26360 Date: 2015-06-17T12:44:17-05:00 List-Id: "G.B." writes: > On 16.06.15 16:46, Stephen Leake wrote: >> Using the regular expression "'[^']'|''''" for CHARACTER_LITERAL > > ISO/IEC 10646 question: Is [^'] matching a single wide character? I don't think this is an ISO question; the Emacs regexp engine does not claim to match an ISO regexp spec (at least, not that I'm aware of). In practice, this does work for non-ASCII characters in Emacs, so the answer is "yes" for Emacs. I haven't tested this with Aflex yet; I don't think Aflex will work with non-ASCII text (one of the reasons I'm not planning to use it with Emacs). I suspect it would require a major rewrite to get Aflex to work with UTF-8 or some other non-ASCII representation. -- -- Stephe