From mboxrd@z Thu Jan 1 00:00:00 1970 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on polar.synack.me X-Spam-Level: X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00,FREEMAIL_FROM autolearn=unavailable autolearn_force=no version=3.4.4 X-Received: by 10.129.92.198 with SMTP id q189mr11788025ywb.40.1447301182358; Wed, 11 Nov 2015 20:06:22 -0800 (PST) X-Received: by 10.182.191.5 with SMTP id gu5mr128038obc.20.1447301182325; Wed, 11 Nov 2015 20:06:22 -0800 (PST) Path: eternal-september.org!reader01.eternal-september.org!reader02.eternal-september.org!news.eternal-september.org!mx02.eternal-september.org!feeder.eternal-september.org!feeder.erje.net!us.feeder.erje.net!enother.net!enother.net!peer02.iad.highwinds-media.com!news.highwinds-media.com!feed-me.highwinds-media.com!f78no73852qge.1!news-out.google.com!f6ni1737igq.0!nntp.google.com!i2no2046381igv.0!postnews.google.com!glegroupsg2000goo.googlegroups.com!not-for-mail Newsgroups: comp.lang.ada Date: Wed, 11 Nov 2015 20:06:22 -0800 (PST) Complaints-To: groups-abuse@google.com Injection-Info: glegroupsg2000goo.googlegroups.com; posting-host=174.28.149.7; posting-account=lJ3JNwoAAAAQfH3VV9vttJLkThaxtTfC NNTP-Posting-Host: 174.28.149.7 User-Agent: G2/1.0 MIME-Version: 1.0 Message-ID: <00aab01c-7d18-408a-9a4c-feb80ac9a1e1@googlegroups.com> Subject: Unicode string comparision functions From: Shark8 Injection-Date: Thu, 12 Nov 2015 04:06:22 +0000 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable X-Received-Bytes: 1698 X-Received-Body-CRC: 1335702826 Xref: news.eternal-september.org comp.lang.ada:28315 Date: 2015-11-11T20:06:22-08:00 List-Id: I thought I had come across a unicode Equals_Case_Insensitive (and less tha= n) for unicode using Wide_Wide_Strings some time ago, but I cannot seem to = find them again; am I misremembering, or were they in a really odd place? For this particular application I would rather use Wide_Wide_String than Wi= de_String so I wouldn't have to worry about invalid character [sequences] f= or the non-ASCII characters. (And, while UTF-8 encoded strings have the nic= e property of being endian agnostic, they still have that property.) -- But= I suppose the main thing is to have a good case insensitive compare such t= hat PRUSSIAN and Pru=DFian are considered equal. Thanks.