From mboxrd@z Thu Jan 1 00:00:00 1970 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on polar.synack.me X-Spam-Level: X-Spam-Status: No, score=-0.3 required=5.0 tests=BAYES_00, REPLYTO_WITHOUT_TO_CC autolearn=no autolearn_force=no version=3.4.4 Path: eternal-september.org!reader01.eternal-september.org!reader02.eternal-september.org!news.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail From: "G.B." Newsgroups: comp.lang.ada Subject: Re: Bug in Ada - Latin 1 is not a subset of UTF-8 Date: Tue, 18 Oct 2016 01:25:18 +0200 Organization: A noiseless patient Spider Message-ID: References: <86f0d2fe-d498-4bc4-bb9d-e34629c89bb4@googlegroups.com> Reply-To: nonlegitur@futureapps.de Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit Injection-Date: Mon, 17 Oct 2016 23:25:01 -0000 (UTC) Injection-Info: mx02.eternal-september.org; posting-host="33c7532313ceae98ec8560b57cd233ab"; logging-data="10768"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1/QcrYz5OeO/QhVwkXsftpojJtP6T8wBPs=" User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.9; rv:45.0) Gecko/20100101 Thunderbird/45.4.0 In-Reply-To: <86f0d2fe-d498-4bc4-bb9d-e34629c89bb4@googlegroups.com> Cancel-Lock: sha1:xnYhEo/cefK8FuMx5mowv3j6jnA= Xref: news.eternal-september.org comp.lang.ada:32100 Date: 2016-10-18T01:25:18+02:00 List-Id: On 17.10.16 22:18, Lucretia wrote: > Hi, > > Whilst binding SDL_TTF function, I was going to Overload the TTF_Size* functions, but I couldn't do that because UTF_8_String is a subtype of String; String is Latin 1 and Latin 1 is not a subset of UTF-8, ASCII is. > > UTF_String should be implemented as an array like String and then UTF_8_String should be a subtype of UTF_String or a renaming, if that is the intent. > According to ISO 10646, UTF stands for UCS Transformation Format. So, it's a format, suggesting a representation. On similar grounds, one could define a string subtype for other types of objects, for example subtype Number_String is String; The components represent the bits of the octets of the numbers (base 256) in sequence, of whole numbers assumed to be phone numbers. Each phone number is headed by a plus sign. So, calling a taxi by telephone in Berlin, Dublin, or Ho Chi Minh City might be helped by turning the string "+^@^A%??+^@^H9^K-+^@^S??^C" into the respective numbers. The intent, I guess, of UTF_String and its kin is to facilitate reading and writing items of UCS. -- "HOTDOGS ARE NOT BOOKMARKS" Springfield Elementary teaching staff