From mboxrd@z Thu Jan 1 00:00:00 1970 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on polar.synack.me X-Spam-Level: * X-Spam-Status: No, score=1.3 required=5.0 tests=BAYES_00,INVALID_MSGID, MSGID_RANDY autolearn=no autolearn_force=no version=3.4.4 X-Google-Language: ENGLISH,ASCII-7-bit X-Google-Thread: 103376,36d7c374b100245d X-Google-Attributes: gid103376,public From: Robert Dewar Subject: Re: Wide_String, Chinese & Japanese text files Date: 1999/08/22 Message-ID: <7ppecr$jfj$1@nnrp1.deja.com>#1/1 X-Deja-AN: 515849334 References: <7pka2j$lnn$1@front2.grolier.fr> <7pmcir$l62$1@nnrp1.deja.com> <7pmitf$r71$1@front3.grolier.fr> X-Http-Proxy: 1.0 x34.deja.com:80 (Squid/1.1.22) for client 166.72.71.31 Organization: Deja.com - Share what you know. Learn what you don't. X-Article-Creation-Date: Sun Aug 22 18:09:37 1999 GMT X-MyDeja-Info: XMYDJUIDrobert_dewar Newsgroups: comp.lang.ada X-Http-User-Agent: Mozilla/4.04 [en] (OS/2; I) Date: 1999-08-22T00:00:00+00:00 List-Id: In article <7pmitf$r71$1@front3.grolier.fr>, "Thierry Lelegard" wrote: > Does anyone have some pointers to these standards and to some > simple free text utilities which can create a few sample > text files with a US or European keyboard on UNIX (for > test purpose, not production of course). One thing to realize here is that, unlike the typical situation with 8-bit codes, there are two separate things to worry about: 1. The encoding of each character into its 16-bit value 2. The manner in which 16-bit values are encoded, typically into a stream of 8-bit bytes. Ada has a lot to say about 1, but nothing at all to say about 2. In particular you cannot look at an encoding standard that gives the 16-bit codes and then ask for a sample text file, because a sample text file is about 2. rather than 1. Sent via Deja.com http://www.deja.com/ Share what you know. Learn what you don't.