From mboxrd@z Thu Jan 1 00:00:00 1970 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on polar.synack.me X-Spam-Level: X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00 autolearn=unavailable autolearn_force=no version=3.4.4 Path: eternal-september.org!reader01.eternal-september.org!reader02.eternal-september.org!news.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail From: "J-P. Rosen" Newsgroups: comp.lang.ada Subject: Re: GNAT vs UTF-8 source file names Date: Wed, 5 Jul 2017 13:20:53 +0200 Organization: A noiseless patient Spider Message-ID: References: Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit Injection-Date: Wed, 5 Jul 2017 11:17:09 -0000 (UTC) Injection-Info: mx02.eternal-september.org; posting-host="2a6b0de60cdda3f9e5bb4fe5a55cb412"; logging-data="27586"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX188MWsuuukmGic11OZX2qzI" User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.2.1 In-Reply-To: Content-Language: fr Cancel-Lock: sha1:dQUthwEPxf503ChtBFJVlec4p/U= Xref: news.eternal-september.org comp.lang.ada:47299 Date: 2017-07-05T13:20:53+02:00 List-Id: Le 05/07/2017 à 11:47, Simon Wright a écrit : > That reference specifies NFKC which I suppose is near! Not that near when it comes to ligatures and other crazy characters... But you are right, it's NFKC. > GNAT uses this if > either you compile with -gnatW8 or the file begins with a UTF8 BOM. Actually, this has nothing to do with encoding or coded character sets. Even if you use Latin-1, the set of allowed characters is defined as those that belong to NFKC. > The problems I've noted in this thread in the GNAT implementation are > two: > > (1) On Windows and macOS (and possibly on VMS, not sure if that's > relevant any more) the file name corresponding to a unit name is > converted to lower-case assuming it's Latin-1 - > System.Case_Util.To_Lower, I can talk about character issues since I gave that tutorial at AE'17... How operating systems manage that, I don't know. -- J-P. Rosen Adalog 2 rue du Docteur Lombard, 92441 Issy-les-Moulineaux CEDEX Tel: +33 1 45 29 21 52, Fax: +33 1 45 29 25 00 http://www.adalog.fr