From mboxrd@z Thu Jan 1 00:00:00 1970 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on polar.synack.me X-Spam-Level: X-Spam-Status: No, score=-0.3 required=5.0 tests=BAYES_00, REPLYTO_WITHOUT_TO_CC autolearn=no autolearn_force=no version=3.4.4 X-Google-Thread: 103376,5bcc293dc5642650 X-Google-NewGroupId: yes X-Google-Attributes: gida07f3367d7,domainid0,public,usenet X-Google-Language: ENGLISH,ASCII-7-bit Received: by 10.68.33.161 with SMTP id s1mr16873363pbi.2.1318667905908; Sat, 15 Oct 2011 01:38:25 -0700 (PDT) Path: d5ni10080pbc.0!nntp.google.com!news2.google.com!goblin2!goblin.stu.neva.ru!aioe.org!.POSTED!not-for-mail From: "Dmitry A. Kazakov" Newsgroups: comp.lang.ada Subject: Re: Why no Ada.Wide_Directories? Date: Sat, 15 Oct 2011 10:38:23 +0200 Organization: cbb software GmbH Message-ID: References: <9937871.172.1318575525468.JavaMail.geo-discussion-forums@prib32> <418b8140-fafb-442f-b91c-e22cc47f8adb@y22g2000pri.googlegroups.com> Reply-To: mailbox@dmitry-kazakov.de NNTP-Posting-Host: EMY6V9w2JsuJ/8EEiAFEEw.user.speranza.aioe.org Mime-Version: 1.0 X-Complaints-To: abuse@aioe.org User-Agent: 40tude_Dialog/2.0.15.1 X-Notice: Filtered by postfilter v. 0.8.2 Xref: news2.google.com comp.lang.ada:13958 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Date: 2011-10-15T10:38:23+02:00 List-Id: On Fri, 14 Oct 2011 18:06:05 -0700 (PDT), ytomino wrote: > In RM 3.5.2, Ada's Character/String types are not UTF-8 but Latin-1 > (except Ada.Strings.UTF_Encoding). > I'm afraid that is violation of the standard even if the > implementation accepts UTF-8. The same applies to Wide_String, which is UCS-2 not UTF-16. Implementations pretending otherwise are wrong. For that matter Windows xW calls are UTF-16. Passing Wide_String there is wrong. > Of course, I think that the standard is impractical, too. There are two problems with the standard: 1. It does not define strings and characters in terms of a code point type to be consistent with Unicode; 2. It does not provide automatic conversions between character/string types, because of the problem #1, and because the Ada type system is too weak for that. Clearly file operations, directory operations, character maps should be defined using code points rather than characters. There should be only one instance of each operation/package independent on the encoding and the combinations of encodings. -- Regards, Dmitry A. Kazakov http://www.dmitry-kazakov.de