From mboxrd@z Thu Jan 1 00:00:00 1970 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on polar.synack.me X-Spam-Level: X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00,FREEMAIL_FROM autolearn=ham autolearn_force=no version=3.4.4 X-Google-Language: ENGLISH,ASCII-7-bit X-Google-Thread: 103376,bcb6f63419c2a56b X-Google-Attributes: gid103376,public Path: controlnews3.google.com!news2.google.com!news.maxwell.syr.edu!feed1.news.rcn.net!rcn!wn13feed!worldnet.att.net!bgtnsc04-news.ops.worldnet.att.net.POSTED!not-for-mail From: David Starner Subject: RE: Supporting full Unicode User-Agent: Pan/0.14.2.91 (As She Crawled Across the Table (Debian GNU/Linux)) Message-Id: Newsgroups: comp.lang.ada References: <6620921.XHr2C6C0OH@linux1.krischik.com> MIME-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 8bit Date: Wed, 12 May 2004 18:24:44 GMT NNTP-Posting-Host: 12.72.70.249 X-Complaints-To: abuse@worldnet.att.net X-Trace: bgtnsc04-news.ops.worldnet.att.net 1084386284 12.72.70.249 (Wed, 12 May 2004 18:24:44 GMT) NNTP-Posting-Date: Wed, 12 May 2004 18:24:44 GMT Organization: AT&T Worldnet Xref: controlnews3.google.com comp.lang.ada:502 Date: 2004-05-12T18:24:44+00:00 List-Id: On Wed, 12 May 2004 16:34:34 +0200, Martin Krischik wrote: > amado.alves wrote: > >> >> You made me go for the book to see if this had changed. It has not. The >> codespace is 24 bit. Actually it's 21. >> >> "In the Unicode Standard, the codespace consists of the integers from 0 to >> 10FFFF..." >> >> (The Unicode Standard, Version 4.0.1, section 2.4) > > If you mean the currently used code space then yes. But they extend the > codepace from time to time No, they don't. They extended it once, in the Unicode/ISO-10646 merge in 1996, as well as moving characters around. There is no plan to do either ever again. There's no reason to extend the code space, either. There's over a million code points, and in a decade they've managed to fill about a hundred thousand of them. The Unicode roadmaps have space blocked out for every character they think they might want to encode, from Rongorong to Egyptian Hieroglyphics. All of it fits in three planes of 65,536 characters with plenty of room to spare on one of them. It's though that Chinese character might exceed their current plane, and need a new one. Even with two planes for Chinese, there's still only four planes in use, plus two planes permanently dedicated to Private Use characters, leaving 650,000+ characters with no conceivable use. Short of extraterrestrials, there's no point in extending the code space.