From mboxrd@z Thu Jan 1 00:00:00 1970 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on polar.synack.me X-Spam-Level: X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00,FREEMAIL_FROM autolearn=unavailable autolearn_force=no version=3.4.4 X-Received: by 10.224.126.137 with SMTP id c9mr9897381qas.2.1380302953530; Fri, 27 Sep 2013 10:29:13 -0700 (PDT) X-Received: by 10.50.108.114 with SMTP id hj18mr139126igb.4.1380302953362; Fri, 27 Sep 2013 10:29:13 -0700 (PDT) Path: border1.nntp.dca3.giganews.com!border2.nntp.dca3.giganews.com!border4.nntp.dca.giganews.com!border2.nntp.dca.giganews.com!nntp.giganews.com!news.snarked.org!news.glorb.com!q9no71531qas.0!news-out.google.com!9ni469qaf.0!nntp.google.com!ek7no125213qab.0!postnews.google.com!glegroupsg2000goo.googlegroups.com!not-for-mail Newsgroups: comp.lang.ada Date: Fri, 27 Sep 2013 10:29:12 -0700 (PDT) In-Reply-To: Complaints-To: groups-abuse@google.com Injection-Info: glegroupsg2000goo.googlegroups.com; posting-host=81.157.175.250; posting-account=pmkN8QoAAAAtIhXRUfydb0SCISnwaeyg NNTP-Posting-Host: 81.157.175.250 References: User-Agent: G2/1.0 MIME-Version: 1.0 Message-ID: Subject: Re: Information Request. From: Austin Obyrne Injection-Date: Fri, 27 Sep 2013 17:29:13 +0000 Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: quoted-printable X-Original-Bytes: 3392 Xref: number.nntp.dca.giganews.com comp.lang.ada:183490 Date: 2013-09-27T10:29:12-07:00 List-Id: On Friday, September 27, 2013 4:53:46 PM UTC+1, Shark8 wrote: > On Friday, September 27, 2013 3:45:27 AM UTC-6, Austin Obyrne wrote: > I= =92m Sure Somebody Knows This. > > Question: > > Unicode is a splendid solu= tion to a huge problem but how does it work in practice. Is it? The problem= of "combining characters" seems to me to defeat that if there is any way t= o display the same character with different combining-sequences/codepoints.= (e.g. "overstriking" w/ combining characters |S and S| to yield $ [just an= example from my 105-key keyboard].) This is to say that there should be on= ly one combination of combining-character to yield a glyph. Another problem= is casing. (e.g. how do you handle '=DF'? Does "ss" ever become '=DF'? Doe= s '=DF' ever become 'S'?) While there has been a lot of work on the Unicode= standard, I'm not sure that (a) the majority of Unicode *users* understand= the vision that the Unicode standard embodies, and (b) the Unicode standar= d has as clear an insight into matters as, say, the ARG has into Ada. (Afte= r all, Unicode is aiming to be a universal character-encoding scheme, and t= hat requires an insight into *all* languages and their writing-systems, whi= le the ARG is focused only on a single artificial/constructed language.) > = I would be very grateful to anybody who can shed light on how it is done in= such cases especially from anybody with hands-on experience. Well, you cou= ld try starting here: http://www.joelonsoftware.com/articles/Unicode.html Thanks for that but the link was far too technical and was aimed at progra= mmers while I am talking about the much simpler person who is a non-special= ist keyboard operator. My question is how is the interface with CJK provided =96 there must be som= ebody some where who converts the CJK into Unicode hexadecimal code points = for understanding by the keyboard person other wise she cannot handle it. Many thanks - Austin