From mboxrd@z Thu Jan 1 00:00:00 1970 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on polar.synack.me X-Spam-Level: X-Spam-Status: No, score=-0.3 required=5.0 tests=BAYES_00, REPLYTO_WITHOUT_TO_CC autolearn=no autolearn_force=no version=3.4.4 X-Google-Thread: 103376,421baaa91aa096a7 X-Google-Attributes: gid103376,domainid0,public,usenet X-Google-Language: ENGLISH,ASCII-7-bit Path: g2news1.google.com!news1.google.com!border1.nntp.dca.giganews.com!nntp.giganews.com!newsfeed00.sul.t-online.de!t-online.de!newsfeed.freenet.de!newsfeed01.chello.at!newsfeed.arcor.de!newsspool2.arcor-online.net!news.arcor.de.POSTED!not-for-mail From: "Dmitry A. Kazakov" Subject: Re: Wide_[Wide_]Character Newsgroups: comp.lang.ada User-Agent: 40tude_Dialog/2.0.15.1 MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Reply-To: mailbox@dmitry-kazakov.de Organization: cbb software GmbH References: Date: Sat, 12 Jul 2008 10:11:38 +0200 Message-ID: NNTP-Posting-Date: 12 Jul 2008 10:11:42 CEST NNTP-Posting-Host: 20b3e90d.newsspool2.arcor-online.net X-Trace: DXC=Qne@X:mKISR5TOT9_N5ikNeEST?YjPG7[D:LEoRi\ X-Complaints-To: usenet-abuse@arcor.de Xref: g2news1.google.com comp.lang.ada:1123 Date: 2008-07-12T10:11:42+02:00 List-Id: On Sat, 12 Jul 2008 07:44:38 GMT, Dale Stanbrough wrote: > Unicode can be represented using UTF-8, UTF-16 and UTF-32 (amongst > others). > > I gather that Character is simply ISO-8859-1 (Latin-1). > > I suspect that Wide_Character is UCS-2 (simple 2 byte values, no escapes > like UTF-16). > > Is Wide_Wide_Character > > * UTF-16 > * UTF-32 (i.e. UCS-4) > * System dependent > * Something else RM 3.5.2 talks about "code positions" (=code points, I guess), represented by Wide_Wide_Character. From this I conclude that it shall be UCS-4 with some implementation-defined endianness. -- Regards, Dmitry A. Kazakov http://www.dmitry-kazakov.de