From mboxrd@z Thu Jan 1 00:00:00 1970 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on polar.synack.me X-Spam-Level: X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00 autolearn=ham autolearn_force=no version=3.4.4 X-Google-Thread: 103376,a82f86f344c98f79 X-Google-Attributes: gid103376,public X-Google-Language: ENGLISH,ASCII Date: Thu, 14 Sep 2006 10:11:53 +0200 From: Manuel Collado User-Agent: Thunderbird 1.5 (Windows/20051201) MIME-Version: 1.0 Newsgroups: comp.lang.ada Subject: Re: Avatox 1.0: Trouble with encoding in Windows References: <45051d37@news.upm.es> <45053aec$0$5142$9b4e6d93@newsspool1.arcor-online.net> <5ZednRK-0M3K15rYnZ2dnUVZ_o2dnZ2d@megapath.net> <4507de42@news.upm.es> In-Reply-To: Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 8bit NNTP-Posting-Host: 138.100.242.205 Message-ID: <45090ec8@news.upm.es> X-Trace: 14 Sep 2006 10:11:52 +0100, 138.100.242.205 Path: g2news2.google.com!news3.google.com!border1.nntp.dca.giganews.com!nntp.giganews.com!newsfeed00.sul.t-online.de!t-online.de!130.59.10.21.MISMATCH!kanaga.switch.ch!switch.ch!news.rediris.es!news.upm.es!138.100.242.205 Xref: g2news2.google.com comp.lang.ada:6580 Date: 2006-09-14T10:11:53+02:00 List-Id: Bj�rn Persson escribi�: > Manuel Collado wrote: >> Sorry, the use of [Wide_]Wide_Strings doesn't imply anything about >> encoding. > > ARM95 3.5.2(3) says: > "The predefined type Wide_Character is a character type whose values > correspond to the 65536 code positions of the ISO 10646 Basic > Multilingual Plane (BMP)." > > This is essentially unchanged in the draft Ada 2005 standard. Paragraph > 3.5.2(3/2) says: > "The predefined type Wide_Character is a character type whose values > correspond to the 65536 code positions of the ISO/IEC 10646:2003 Basic > Multilingual Plane (BMP)." > > And the next paragraph, 3.1/2, adds: > "The predefined type Wide_Wide_Character is a character type whose > values correspond to the 2147483648 code positions of the ISO/IEC > 10646:2003 character set." > > This means that a Wide_String is UCS-2LE on a little-endian machine and > UCS-2BE on a big-endian machine, and a Wide_Wide_String is UCS-4LE or > UCS-4BE. Thanks for the pointer. I've certainly missed that. -- Manuel Collado