From mboxrd@z Thu Jan 1 00:00:00 1970 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on polar.synack.me X-Spam-Level: X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00,FREEMAIL_FROM autolearn=ham autolearn_force=no version=3.4.4 X-Google-Language: ENGLISH,ASCII-7-bit X-Google-Thread: 103376,88ed72d98e6b3457 X-Google-Attributes: gid103376,public X-Google-ArrivalTime: 2003-10-04 18:38:33 PST Path: archiver1.google.com!news2.google.com!newsfeed.stanford.edu!headwall.stanford.edu!newshub.sdsu.edu!elnk-nf2-pas!newsfeed.earthlink.net!wn14feed!worldnet.att.net!204.127.198.203!attbi_feed3!attbi_feed4!attbi.com!rwcrnsc51.ops.asp.att.net.POSTED!not-for-mail Message-ID: <3F7F760E.2020901@comcast.net> From: "Robert I. Eachus" User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.0; en-US; rv:1.0.2) Gecko/20021120 Netscape/7.01 X-Accept-Language: en-us, en MIME-Version: 1.0 Newsgroups: comp.lang.ada Subject: Re: Standard Library Interest? References: Content-Type: text/plain; charset=us-ascii; format=flowed Content-Transfer-Encoding: 7bit NNTP-Posting-Host: 24.34.139.183 X-Complaints-To: abuse@comcast.net X-Trace: rwcrnsc51.ops.asp.att.net 1065317912 24.34.139.183 (Sun, 05 Oct 2003 01:38:32 GMT) NNTP-Posting-Date: Sun, 05 Oct 2003 01:38:32 GMT Organization: Comcast Online Date: Sun, 05 Oct 2003 01:38:32 GMT Xref: archiver1.google.com comp.lang.ada:244 Date: 2003-10-05T01:38:32+00:00 List-Id: chris wrote: > XML and Unicode support don't quite fit within the language standard > because they change more frequently, but IMO it'd be nice to have some > standardised support for both of these things. Note with unicode I am > thinking about things like normalisation, etc. Huh? Read the RM: The predefined type Wide_Character is a character type whose values correspond to the 65536 code positions of the ISO 10646 Basic Multilingual Plane (BMP). Each of the graphic characters of the BMP has a corresponding character_literal in Wide_Character. The first 256 values of Wide_Character have the same character_literal or language-defined name as defined for Character. The last 2 values of Wide_Character correspond to the nongraphic positions FFFE and FFFF of the BMP, and are assigned the language-defined names FFFE and FFFF. As with the other language-defined names for nongraphic characters, the names FFFE and FFFF are usable only with the attributes (Wide_)Image and (Wide_)Value; they are not usable as enumeration literals. All other values of Wide_Character are considered graphic characters, and have a corresponding character_literal. The BMP character set is identical to Unicode. There are additional functions which are part of the Unicode definition. Some of them can be found in A.3.2 and A.4.7. If there is something you think should be added, let me know. (I already have a "little list" for Ada 0Y.) Also, don't get misled by some people (including me! ;-) saying that the Ada standard has a slow revision cycle. Technically, the standard is considered for revision every five years, but in practice if there are problems, the ARG will fix them as soon as possible. Unicode/ISO10646 is a perfect case in point. I could tell you the (highly boring) changes that have been made in ISO 10646 and Unicode since they converged. (If you are really interested, look here: http://www.unicode.org/versions/Unicode4.0.0/ Just to give you a flavor: "1,226 new character assignments were made to the Unicode Standard, Version 4.0 (over and above what was in Unicode 3.2). These additions include currency symbols, additional Latin and Cyrillic characters, the Limbu and Tai Le scripts; Yijing Hexagram symbols, Khmer symbols, Linear B syllables and ideograms, Cypriot, Ugaritic, and a new block of variation selectors (especially for future CJK variants). Double diacritic characters were added for dictionary use." -- Robert I. Eachus "Quality is the Buddha. Quality is scientific reality. Quality is the goal of Art. It remains to work these concepts into a practical, down-to-earth context, and for this there is nothing more practical or down-to-earth than what I have been talking about all along...the repair of an old motorcycle." -- from Zen and the Art of Motorcycle Maintenance by Robert Pirsig