From mboxrd@z Thu Jan 1 00:00:00 1970 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on polar.synack.me X-Spam-Level: X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00 autolearn=unavailable autolearn_force=no version=3.4.4 Path: eternal-september.org!reader01.eternal-september.org!reader02.eternal-september.org!news.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail From: "J-P. Rosen" Newsgroups: comp.lang.ada Subject: Re: GNAT vs UTF-8 source file names Date: Wed, 5 Jul 2017 07:21:31 +0200 Organization: A noiseless patient Spider Message-ID: References: Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit Injection-Date: Wed, 5 Jul 2017 05:17:50 -0000 (UTC) Injection-Info: mx02.eternal-september.org; posting-host="74830e7abe8176ae62ff5365c235ac6a"; logging-data="29026"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1/vahQcpVezQrJY7H2r3m4A" User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.2.1 In-Reply-To: Content-Language: fr Cancel-Lock: sha1:2ptkN30XqDUvDuuXlj6WweKTKXA= Xref: news.eternal-september.org comp.lang.ada:47294 Date: 2017-07-05T07:21:31+02:00 List-Id: Le 04/07/2017 à 15:57, Simon Wright a écrit : > The reason for this apparently-bizarre message is[3] that macOS takes > the composed form (lowercase a acute) and converts it under the hood > to what HFS+ insists on, the fully decomposed form (lowercase a, combining > acute); thus the names are actually different even though they _look_ > the same. Apparently, they use NFD (Normalization Form D). Normalization forms are necessary to avoid a whole lot of problems, although Ada requires normalization form C (ARM 2.1 (4.1/3)), or more precisely, it is implementation defined if the text is not in NFC. -- J-P. Rosen Adalog 2 rue du Docteur Lombard, 92441 Issy-les-Moulineaux CEDEX Tel: +33 1 45 29 21 52, Fax: +33 1 45 29 25 00 http://www.adalog.fr