From mboxrd@z Thu Jan 1 00:00:00 1970 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on polar.synack.me X-Spam-Level: X-Spam-Status: No, score=-0.7 required=5.0 tests=BAYES_00,NORMAL_HTTP_TO_IP, NUMERIC_HTTP_ADDR autolearn=ham autolearn_force=no version=3.4.4 X-Google-Thread: a07f3367d7,a0a08f10fc14bf82 X-Google-Attributes: gida07f3367d7,public,usenet X-Google-NewGroupId: yes X-Google-Language: ENGLISH,ASCII X-Received: by 10.180.10.230 with SMTP id l6mr3465728wib.3.1364937514594; Tue, 02 Apr 2013 14:18:34 -0700 (PDT) Path: ex12ni19268wid.1!nntp.google.com!feeder1.cambriumusenet.nl!feed.tweaknews.nl!194.109.133.81.MISMATCH!newsfeed.xs4all.nl!newsfeed2.news.xs4all.nl!xs4all!border4.nntp.ams.giganews.com!border2.nntp.ams.giganews.com!border3.nntp.ams.giganews.com!border1.nntp.ams.giganews.com!backlog1.nntp.dca.giganews.com!border3.nntp.dca.giganews.com!border1.nntp.dca.giganews.com!border4.nntp.dca.giganews.com!border2.nntp.dca.giganews.com!nntp.giganews.com!news.bbs-scene.org!weretis.net!feeder4.news.weretis.net!news.teledata-fn.de!newsfeed.arcor.de!newsspool2.arcor-online.net!news.arcor.de.POSTED!not-for-mail Date: Mon, 25 Mar 2013 20:48:02 +0100 From: Georg Bauhaus User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.7; rv:17.0) Gecko/20130307 Thunderbird/17.0.4 MIME-Version: 1.0 Newsgroups: comp.lang.ada Subject: Re: The letter Sharp S and the English language References: <514e1d12$0$6579$9b4e6d93@newsspool3.arcor-online.net> <792f8298-4502-40cf-acef-bda706555738@googlegroups.com> In-Reply-To: <792f8298-4502-40cf-acef-bda706555738@googlegroups.com> X-Enigmail-Version: 1.5.1 Message-ID: <5150a9f2$0$6567$9b4e6d93@newsspool4.arcor-online.net> Organization: Arcor NNTP-Posting-Date: 25 Mar 2013 20:48:02 CET NNTP-Posting-Host: 073a44ec.newsspool4.arcor-online.net X-Trace: DXC=15lY@YQ0WoSYQ5E:lZLh>_cHTX3j]oP_cISO8Th^ X-Complaints-To: usenet-abuse@arcor.de X-Original-Bytes: 3474 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 8bit Date: 2013-03-25T20:48:02+01:00 List-Id: On 25.03.13 16:23, Adam Beneschan wrote: > On Saturday, March 23, 2013 2:22:26 PM UTC-7, Georg Bauhaus wrote: >> In case you remember a heated discussions of what � is, >> whether it is an S-Z ligature or an S-S, and how to (not) >> downcase "ACCESS", more evidence comes from Ireland of 1759, >> in the signature of Arthur Guinne�, >> >> http://home.arcor.de/bauhaus/Ada/GUINNESS.jpg > > That pretty clearly looks like two separate letters to me, although the two s's are in different styles. But it isn't a ligature. I'm not sure what your point is since I don't remember the original thread very well. I'm investigating how Unicode enabled Ada can help me "export" street names to Switzerland. Thus, To_Upper ("Xyz-Stra�e"); -- String or Wide_String What interests me is whether or not this might or might not work in the future, i.e. with Ada 2012, in the light of recent developments of ISO/IEC 10646: First, you'd typically not be writing '�' in Switzerland and instead replace every occurrence with "ss". That's for both lower case and upper case. (And also when using small caps). So, To_Upper's definition won't help. But! Since Ada 2005 there are two new twists. In 2008, ISO/IEC 10646 has published an official upper case character for '�', U+1E9E. And in 2010, official spelling (read: government; "amtlich") requires U+1E9E in geographical names. These include street names. http://141.74.33.52/stagn/Portals/0/101125_TopR5.pdf Currently, GNAT's implementation of Ada.Wide_Characters.Handling.To_Upper gives Wide_Character'Val (223) for To_Upper ('�'), AFAICS. Unicode's CaseFolding.txt, if applicable, has two lines pertaining to the matter, 1E9E; F; 0073 0073; # LATIN CAPITAL LETTER SHARP S 1E9E; S; 00DF; # LATIN CAPITAL LETTER SHARP S So I'm wondering if Simple Case Mapping might mean that To_Upper (Wide_Character'('�')) should return Wide_Character'Val (16#1E9E#) in Ada 2012. ('�' will thus continue to cause problems originating in web based form entry fields and elsewhere, I'm almost sure. Just one out of many experiences: a major fruit company's customer invoices have consistently shown what looks like junk HTML right after "Stra" in my address for years.)