From mboxrd@z Thu Jan 1 00:00:00 1970 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on polar.synack.me X-Spam-Level: X-Spam-Status: No, score=-0.3 required=5.0 tests=BAYES_00, REPLYTO_WITHOUT_TO_CC autolearn=no autolearn_force=no version=3.4.4 X-Google-Thread: 103376,347f7cbe27451012 X-Google-Attributes: gid103376,public,usenet X-Google-Language: ENGLISH,ASCII Path: g2news1.google.com!news3.google.com!feeder3.cambrium.nl!feeder5.cambrium.nl!feed.tweaknews.nl!217.73.144.44.MISMATCH!ecngs!feeder.ecngs.de!news.osn.de!diablo2.news.osn.de!news.belwue.de!newsfeed.arcor.de!newsspool1.arcor-online.net!news.arcor.de.POSTED!not-for-mail From: "Dmitry A. Kazakov" Subject: Re: [ranting] Take Command Plugin, Win32Ada and Ada.Directories Newsgroups: comp.lang.ada User-Agent: 40tude_Dialog/2.0.15.1 MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 8bit Reply-To: mailbox@dmitry-kazakov.de Organization: cbb software GmbH References: <3343563.psPA6e9Oqh@linux1.krischik.com> <1pkwbr5s12cf9$.9k2umlcxqzbr.dlg@40tude.net> <473A1241.90702@obry.net> <1vrktz0rlauvu.1nm0xh8n2wl90.dlg@40tude.net> <473ab86d$1@news.post.ch> <19x2od13tuh2a$.1lpcr349f44yp$.dlg@40tude.net> <1220629.5SNnNGaTIf@linux1.krischik.com> Date: Thu, 15 Nov 2007 10:08:17 +0100 Message-ID: <1g63wx1m2rf3a.1bopw7zyeeyy6.dlg@40tude.net> NNTP-Posting-Date: 15 Nov 2007 10:01:14 CET NNTP-Posting-Host: 42eff4cf.newsspool2.arcor-online.net X-Trace: DXC=H`Q:`[;9n_TYI9]OHn9o5^A9EHlD;3YcR4Fo<]lROoRQ4nDHegD_]RUZec;=k@[_]YDNcfSJ;bb[UFCTGGVUmh?TLK[5LiR>kgR^aW@AZm_0Y\ X-Complaints-To: usenet-abuse@arcor.de Xref: g2news1.google.com comp.lang.ada:18412 Date: 2007-11-15T10:01:14+01:00 List-Id: On Wed, 14 Nov 2007 19:38:32 +0100, Martin Krischik wrote: > Dmitry A. Kazakov wrote: > >>> Not just Ada.Directories - any package where a file name is needed. >>> And not just Win32 - any platform which supports UTF-8 file names. >> >> Sure. >> >> One formal problem is that Character is specified as Latin-1. > > Sure - but then there is allways the BOM [1] 16#EFBBBF# would mark the > string UTF-8. But I guess it would be an enormous effort to support two > file name string types - one Latin-1 and another UTF-8 - and to find out > which one you got you need the check the first 3 characters. I would just drop Latin-1 and return back to 7-bit ASCII, requiring that all Strings passed to or received from outside were UTF-8. For legacy programs Latin-1 versions of Ada.Text_IO and Ada.Directories could be provided as Ada.Latin_1.Text_IO etc. (There is an issue with literals like '�'. I would make them all Wide_Character only and provide built-in conversions from Wide to String. Clearly all string types are convertible subtypes of some Universal_String with unlimited-length characters.) -- Regards, Dmitry A. Kazakov http://www.dmitry-kazakov.de