From mboxrd@z Thu Jan 1 00:00:00 1970 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on polar.synack.me X-Spam-Level: X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00 autolearn=ham autolearn_force=no version=3.4.4 X-Google-Thread: 103376,e4abd14106db0029 X-Google-NewGroupId: yes X-Google-Attributes: gida07f3367d7,domainid0,public,usenet X-Google-Language: ENGLISH,UTF8 Path: g2news1.google.com!news3.google.com!feeder.news-service.com!85.214.198.2.MISMATCH!eternal-september.org!feeder.eternal-september.org!.POSTED!not-for-mail From: "J-P. Rosen" Newsgroups: comp.lang.ada Subject: Re: Ada 2012 and Unicode package (UTF-nn encodings handling) Date: Sat, 21 Aug 2010 09:01:53 +0200 Organization: Adalog Message-ID: References: Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Injection-Date: Sat, 21 Aug 2010 07:02:05 +0000 (UTC) Injection-Info: mx03.eternal-september.org; posting-host="z/xN1DBP8RUb+r9ug/i0hg"; logging-data="24188"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX19jPmrP6oDu2ui3eLwwt5T+" User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; fr; rv:1.9.2.8) Gecko/20100802 Thunderbird/3.1.2 In-Reply-To: Cancel-Lock: sha1:fPg3S1D35s+j0F/begMP4KGohCI= Xref: g2news1.google.com comp.lang.ada:13565 Date: 2010-08-21T09:01:53+02:00 List-Id: Le 20/08/2010 23:38, Yannick DuchĂȘne (Hibou57) a Ă©crit : > Time for my stupid question of the day :) A question is never stupid. Answers sometimes... > I could not avoid two questions: why no UTF-32 ? (this would not be an > implementation nightmare) I still fail to see the benefit of encoding 31 bits values into 32 bits values... And even if implementation is not a nightmare, it always has a cost. Implementers are reluctant to spend money for features that nobody will use. (Wide_Wide_Character was forced on us by ISO). > and why BOM handled for each string while BOM > is to be used at stream/file level ? (see XML or HTML files for > example). A package provides functionnalities. It should not presume how it is used. Since this package is clearly in the "string handling" class, it makes sense to handle this with strings. For files, the usage is to have a BOM on the first line of the file. The way the functions are defined makes it easy to not process the first line specially; see the use case in the AI. -- --------------------------------------------------------- J-P. Rosen (rosen@adalog.fr) Visit Adalog's web site at http://www.adalog.fr