From mboxrd@z Thu Jan 1 00:00:00 1970 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on polar.synack.me X-Spam-Level: X-Spam-Status: No, score=-2.9 required=5.0 tests=BAYES_00,MAILING_LIST_MULTI autolearn=unavailable autolearn_force=no version=3.4.4 X-Google-Thread: 103376,43ab55a75a8b5d1 X-Google-Attributes: gid103376,public X-Google-Language: ENGLISH,ASCII-7-bit Path: g2news2.google.com!news4.google.com!news.glorb.com!proxad.net!cleanfeed1-a.proxad.net!nnrp6-1.free.fr!not-for-mail Return-Path: Mime-Version: 1.0 (Apple Message framework v752.2) In-Reply-To: <1nbqjel4blzuj$.obwkz78gfdph$.dlg@40tude.net> References: <1nbqjel4blzuj$.obwkz78gfdph$.dlg@40tude.net> Content-Type: text/plain; charset=US-ASCII; delsp=yes; format=flowed Content-Transfer-Encoding: 7bit From: Marius Amado-Alves Subject: Re: System.WCh_Cnv Date: Tue, 25 Jul 2006 14:03:21 +0100 To: comp.lang.ada@ada-france.org X-Mailer: Apple Mail (2.752.2) X-Antivirus-Scanner: Clean mail though you should still use an Antivirus X-AntiAbuse: This header was added to track abuse, please include it with any abuse report X-AntiAbuse: Primary Hostname - zaphen.serversonline.net X-AntiAbuse: Original Domain - ada-france.org X-AntiAbuse: Originator/Caller UID/GID - [0 0] / [47 12] X-AntiAbuse: Sender Address Domain - amado-alves.info X-Source: X-Source-Args: X-Source-Dir: X-Virus-Scanned: amavisd-new at ada-france.org X-BeenThere: comp.lang.ada@ada-france.org X-Mailman-Version: 2.1.8rc1 Precedence: list List-Id: "Gateway to the comp.lang.ada Usenet newsgroup" List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Newsgroups: comp.lang.ada Message-ID: X-Leafnode-NNTP-Posting-Host: 88.191.17.134 Organization: Guest of ProXad - France NNTP-Posting-Date: 25 Jul 2006 15:05:13 MEST NNTP-Posting-Host: 88.191.14.223 X-Trace: 1153832713 nnrp6-1.free.fr 18117 88.191.14.223:53509 X-Complaints-To: abuse@proxad.net Xref: g2news2.google.com comp.lang.ada:5917 Date: 2006-07-25T15:05:13+02:00 > places where that might be needed, like text rendering, don't work > on per > code point basis anyway.... Exactly. And that is wrong, and I want to fix it. > So I'm quite happy with UTF-8 and plain strings. I am more or less happy with this too [1], but I think we can do better. With UTF-8 in strings the two abstractions (codepoints, encodings) are too entangled for my taste. In rigour you cannot use the standard string operations. I mean you can but must fiddle with the encodings i.e. you are not searching for a codepoint but for a particular encoding. Instead I want to be able to write things like for I in Str'Range loop if Str (I) = Euro_Sign then ... end loop; I cannot do that with UTF-8 in strings. Note that Wide_Wide_String is of little help here, because of the endianess issue. But it might be a good idea to base Unico on Wide_Wide_String for closeness to the standard. [1] What makes me happy about UTF-8 is that it seems to have become a de facto default, common denominator encoding.