From mboxrd@z Thu Jan 1 00:00:00 1970 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on polar.synack.me X-Spam-Level: X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00 autolearn=ham autolearn_force=no version=3.4.4 X-Google-Thread: 103376,e4abd14106db0029 X-Google-NewGroupId: yes X-Google-Attributes: gida07f3367d7,domainid0,public,usenet X-Google-Language: ENGLISH,UTF8 Path: g2news1.google.com!news1.google.com!news.glorb.com!eternal-september.org!feeder.eternal-september.org!.POSTED!not-for-mail From: "J-P. Rosen" Newsgroups: comp.lang.ada Subject: Re: Ada 2012 and Unicode package (UTF-nn encodings handling) Date: Sun, 22 Aug 2010 22:40:37 +0200 Organization: Adalog Message-ID: References: <4c717f18$0$7652$9b4e6d93@newsspool1.arcor-online.net> Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Injection-Date: Sun, 22 Aug 2010 20:40:39 +0000 (UTC) Injection-Info: mx03.eternal-september.org; posting-host="z/xN1DBP8RUb+r9ug/i0hg"; logging-data="22965"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX19IWVC+yv5JJEwN39iZjqJU" User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; fr; rv:1.9.2.8) Gecko/20100802 Thunderbird/3.1.2 In-Reply-To: <4c717f18$0$7652$9b4e6d93@newsspool1.arcor-online.net> Cancel-Lock: sha1:L+U2qIYdBVBhj1k3dXBh+izHTWQ= Xref: g2news1.google.com comp.lang.ada:13635 Date: 2010-08-22T22:40:37+02:00 List-Id: Le 22/08/2010 21:48, Georg Bauhaus a écrit : > On 8/22/10 8:51 PM, J-P. Rosen wrote: > >> I think you missed the "Encoding" function. The intended usage >> (extracted from the !discussion section) is: >> 1) Read the first line. Call function Encoding on that line with an >> appropriate default to use if the line does not start with a >> BOM. Initialize the encoding scheme to the value returned by the >> function. > > Since Ada is an ISO language, is the name BOM for the non-UTF-8 > thing used by Microsoft actually ISO? (I.e., has it become part of ISO > 10646)? > It's from Unicode. ISO 10646 defines only character encodings (code-points). Unicode uses the same encodings, and in addition defines UTF-8 and siblings. -- --------------------------------------------------------- J-P. Rosen (rosen@adalog.fr) Visit Adalog's web site at http://www.adalog.fr