From mboxrd@z Thu Jan 1 00:00:00 1970 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on polar.synack.me X-Spam-Level: X-Spam-Status: No, score=-2.9 required=5.0 tests=BAYES_00,MAILING_LIST_MULTI autolearn=unavailable autolearn_force=no version=3.4.4 X-Google-Thread: 103376,43ab55a75a8b5d1 X-Google-Attributes: gid103376,public X-Google-Language: ENGLISH,ASCII-7-bit Path: g2news2.google.com!news4.google.com!border1.nntp.dca.giganews.com!nntp.giganews.com!nx01.iad01.newshosting.com!newshosting.com!newsfeed.icl.net!newsfeed.fjserv.net!newsfeed.icl.net!proxad.net!cleanfeed3-b.proxad.net!nnrp14-1.free.fr!not-for-mail Return-Path: In-Reply-To: References: Mime-Version: 1.0 (Apple Message framework v752.2) X-Priority: 3 (Normal) Content-Type: text/plain; charset=US-ASCII; delsp=yes; format=flowed Content-Transfer-Encoding: 7bit From: Marius Amado-Alves Subject: Re: System.WCh_Cnv Date: Tue, 25 Jul 2006 11:31:08 +0100 To: comp.lang.ada@ada-france.org X-Mailer: Apple Mail (2.752.2) X-Antivirus-Scanner: Clean mail though you should still use an Antivirus X-AntiAbuse: This header was added to track abuse, please include it with any abuse report X-AntiAbuse: Primary Hostname - zaphen.serversonline.net X-AntiAbuse: Original Domain - ada-france.org X-AntiAbuse: Originator/Caller UID/GID - [0 0] / [47 12] X-AntiAbuse: Sender Address Domain - amado-alves.info X-Source: X-Source-Args: X-Source-Dir: X-Virus-Scanned: amavisd-new at ada-france.org X-BeenThere: comp.lang.ada@ada-france.org X-Mailman-Version: 2.1.8rc1 Precedence: list List-Id: "Gateway to the comp.lang.ada Usenet newsgroup" List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Newsgroups: comp.lang.ada Message-ID: X-Leafnode-NNTP-Posting-Host: 88.191.17.134 Organization: Guest of ProXad - France NNTP-Posting-Date: 25 Jul 2006 12:35:24 MEST NNTP-Posting-Host: 88.191.14.223 X-Trace: 1153823724 nnrp14-1.free.fr 5998 88.191.14.223:52651 X-Complaints-To: abuse@proxad.net Xref: g2news2.google.com comp.lang.ada:5915 Date: 2006-07-25T12:35:24+02:00 >> Actually the Unicode codepoint range is 0 .. 10FFFF and therefore >> fits in 21 bits. > > ... the definition would allow expansion to 31-bits (but no > further). The definition of some particular *encoding* namely UCS-4. Not of the "character set" range. Character = codepoint. And this stops at 10FFFF. And it will not be extended. IIRC both Organizations went on record on this. Silly maybe, but not per se. It has to do with variable length encodings. It facilitates search and verification. Now these encodings may be a bit silly, yes. I have been sketching a highly simplified, short, clear, logical, understandable, usable, no nonsense, package for Unicode. I have not been making much progress for several reasons. If someone wants to join that would be great. The first lines of the spec follow. -- Unico : no nonsense Unicode support for Ada -- (C) 2006 Marius Amado Alves with Ada.Containers.Vectors; with Ada.Streams; package Unico is type Character is range 0 .. 16#10FFFF#; for Character'Size use 24; procedure Write (Stream : access Ada.Streams.Root_Stream_Type'Class; Item : in Character); procedure Read (Stream : access Ada.Streams.Root_Stream_Type'Class; Item : out Character); for Character'Write use Write; for Character'Read use Read; package Strings is new Ada.Containers.Vectors (Index_Type => Positive, Element_Type => Character); subtype String is Strings.Vector; type Fixed_String is array (Positive range <>) of Character; for Fixed_String'Component_Size use 24; pragma Pack (Fixed_String);