From mboxrd@z Thu Jan 1 00:00:00 1970 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on polar.synack.me X-Spam-Level: X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00 autolearn=ham autolearn_force=no version=3.4.4 X-Google-Language: ENGLISH,ASCII-7-bit X-Google-Thread: 103376,7624df5e57d09688 X-Google-Attributes: gid103376,public X-Google-ArrivalTime: 2002-06-03 09:25:25 PST Newsgroups: comp.lang.ada Subject: Re: config files proposal References: <3CFA8E42.B7844253@san.rr.com> <4519e058.0206030656.34c424ff@posting.google.com> <3CFB94A7.A455B8DD@san.rr.com> From: Jean-Marc Bourguet Date: 03 Jun 2002 18:24:57 +0200 MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii NNTP-Posting-Host: cdssoph29.cadence.com Message-ID: <3cfb9866@news.cadence.com> X-Trace: news.cadence.com 1023121510 cdssoph29.cadence.com (3 Jun 2002 09:25:10 -0800) Path: archiver1.google.com!news1.google.com!newsfeed.stanford.edu!news-spur1.maxwell.syr.edu!news.maxwell.syr.edu!news.airnews.net!cabal11.airnews.net!news.cadence.com Xref: archiver1.google.com comp.lang.ada:25263 Date: 2002-06-03T18:24:57+02:00 List-Id: Darren New writes: > While it's admirable to ignore the case of the letters, I think trying to > implement a general "to_lower" that'll handle Unicode properly is going to > be more of a nightmare than writing the entire rest of the config file > parser. Unless you restrict the keys to US-ASCII, that is. Well unicode define canonical forms (4 of them: x2 for the case issues, x2 for determining if things like "superscript 2" and 2 are equivalent). And you are right, the case independant canonical forms are dependent on the locale. But even for case dependant comparisons, using canonical form is needed as some glyphs have multiple representations. Yours, -- Jean-Marc