From mboxrd@z Thu Jan 1 00:00:00 1970 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on polar.synack.me X-Spam-Level: X-Spam-Status: No, score=-0.3 required=5.0 tests=BAYES_00, REPLYTO_WITHOUT_TO_CC autolearn=no autolearn_force=no version=3.4.4 Path: eternal-september.org!reader01.eternal-september.org!reader02.eternal-september.org!news.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail From: "G.B." Newsgroups: comp.lang.ada Subject: Re: Bug in Ada - Latin 1 is not a subset of UTF-8 Date: Tue, 18 Oct 2016 19:35:32 +0200 Organization: A noiseless patient Spider Message-ID: References: <86f0d2fe-d498-4bc4-bb9d-e34629c89bb4@googlegroups.com> Reply-To: nonlegitur@futureapps.de Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit Injection-Date: Tue, 18 Oct 2016 17:35:14 -0000 (UTC) Injection-Info: mx02.eternal-september.org; posting-host="f28d4d9e002b08ad13f896787673606f"; logging-data="7643"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+CArXFJ7DngLSn/BvOUjMoPMskd/q6SK0=" User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.9; rv:45.0) Gecko/20100101 Thunderbird/45.4.0 In-Reply-To: Cancel-Lock: sha1:i6EW2qrTT9j3F/n8HmQlywQN/qI= Xref: news.eternal-september.org comp.lang.ada:32125 Date: 2016-10-18T19:35:32+02:00 List-Id: On 18.10.16 18:35, Dmitry A. Kazakov wrote: > No invariant can make Latin-1 A-umlaut UTF-8 A-umlaut. Who would ever want to do that? Before I/O, there is nothing. UTF_8_String is for encoding and decoding subprograms of Ada. For them to be successful, a predicate could be used to express the set of values that can be parsed. It so happens that its members are officially said to be in encoded form. To get a subset U from a set S, you apply a constraint to S. That's not (easily) expressible in Ada in this case. But if it is, with the help of a predicate, the we can say that UTF_8_String is-a "constrained" String because their sets are.