From mboxrd@z Thu Jan 1 00:00:00 1970 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on polar.synack.me X-Spam-Level: X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00,FREEMAIL_FROM autolearn=ham autolearn_force=no version=3.4.4 X-Google-Language: ENGLISH,ASCII-7-bit X-Google-Thread: 103376,5cb36983754f64da X-Google-Attributes: gid103376,public X-Google-ArrivalTime: 2004-04-17 20:57:31 PST Path: archiver1.google.com!news1.google.com!news.glorb.com!wn13feed!worldnet.att.net!bgtnsc04-news.ops.worldnet.att.net.POSTED!not-for-mail From: David Starner Subject: Re: No call for Ada (was Re: Announcing new scripting/prototyping language) User-Agent: Pan/0.14.2.91 (As She Crawled Across the Table (Debian GNU/Linux)) Message-Id: Newsgroups: comp.lang.ada References: <20040206174017.7E84F4C4114@lovelace.ada-france.org> <54759e7e.0402071124.322ea376@posting.google.com> <406EB6D2.8030801@noplace.com> <87d66pyw1g.fsf@insalien.org> <406EEC35.7040109@noplace.com> <874qs0zvy1.fsf@insalien.org> <40714C98.90601@noplace.com> <1073gv22t969q5a@corp.supernews.com> <40729B9D.30906@noplace.com> <1076000ef5oj06f@corp.supernews.com> MIME-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 8bit Date: Sun, 18 Apr 2004 03:57:31 GMT NNTP-Posting-Host: 12.72.68.158 X-Complaints-To: abuse@worldnet.att.net X-Trace: bgtnsc04-news.ops.worldnet.att.net 1082260651 12.72.68.158 (Sun, 18 Apr 2004 03:57:31 GMT) NNTP-Posting-Date: Sun, 18 Apr 2004 03:57:31 GMT Organization: AT&T Worldnet Xref: archiver1.google.com comp.lang.ada:7296 Date: 2004-04-18T03:57:31+00:00 List-Id: On Sat, 17 Apr 2004 20:07:32 -0400, Robert I. Eachus wrote: > David Starner wrote: > >> Why would you think that? HTML and XML does it. Unless you're dealing with >> EBCDIC, you have two encodings to deal with in order to reach a "pragma >> charset ("UTF-8");": ASCII and UTF-16, which are trivial to distinguish. > > But Shift-JIS and EUC can't be distinguished that way. Sure they can. Shift-JIS and EUC just look like ASCII until you start sticking in non-ASCII characters. If you put the pragma charset ("Shift-JIS"); before any Japanese characters, the compiler will have no problem reading it. Again this is how HTML does it; do you really need more variety of character sets in source code than HTML or XML? > Then don't use it. Again, it is the intended audience. GNAT tries to > allow itself to be adapted to any existing environment, including > character sets. In say, a Shift-JIS environment, almost all files will > be written in Shift-JIS. I maintain one Ada program for open distribution. I don't know whether the users are using it in a EUC-JIS environment or ISO-8859-1 environment or KOI8-R environment or a UTF-8 environment. Almost any program suitable for distribution with a Linux distribution is going to have to accept that as reality and react appropriately. (I personally deal with nothing more complex than filenames, and hence escape most of the pain.)