From mboxrd@z Thu Jan 1 00:00:00 1970 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on polar.synack.me X-Spam-Level: X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00 autolearn=ham autolearn_force=no version=3.4.4 X-Google-Language: ENGLISH,ASCII-7-bit X-Google-Thread: 103376,5cb36983754f64da X-Google-Attributes: gid103376,public X-Google-ArrivalTime: 2004-04-17 11:17:04 PST Path: archiver1.google.com!news1.google.com!news.glorb.com!newsgate.cistron.nl!transit.news.xs4all.nl!195.241.76.212.MISMATCH!tiscali!transit1.news.tiscali.nl!dreader2.news.tiscali.nl!not-for-mail Sender: lbrenta@deuteronomy Newsgroups: comp.lang.ada Subject: Re: No call for Ada (was Re: Announcing new scripting/prototyping References: <20040206174017.7E84F4C4114@lovelace.ada-france.org> <54759e7e.0402071124.322ea376@posting.google.com> <406EB6D2.8030801@noplace.com> <87d66pyw1g.fsf@insalien.org> <406EEC35.7040109@noplace.com> <874qs0zvy1.fsf@insalien.org> <40714C98.90601@noplace.com> <1073gv22t969q5a@corp.supernews.com> <40729B9D.30906@noplace.com> <1076000ef5oj06f@corp.supernews.com> From: Ludovic Brenta Date: 17 Apr 2004 20:19:16 +0200 Message-ID: <87d666wjq3.fsf@insalien.org> User-Agent: Gnus/5.09 (Gnus v5.9.0) Emacs/21.3 MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Organization: Tiscali bv NNTP-Posting-Date: 17 Apr 2004 20:17:04 CEST NNTP-Posting-Host: 83.134.241.245 X-Trace: 1082225824 dreader2.news.tiscali.nl 41750 83.134.241.245:33602 X-Complaints-To: abuse@tiscali.nl Xref: archiver1.google.com comp.lang.ada:7269 Date: 2004-04-17T20:17:04+02:00 List-Id: Wes Groleau writes: > Robert I. Eachus wrote: > > You CAN imbed use of UTF-8 for Ada.Text_IO files in the form > > string. But -gnatWa (for various values of a) tells the compiler > > how to interpret the character set in the SOURCE file. That is a > > little difficult to do in the source file itself. > > Ada (in ARM-strict mode) cannot use byte order markers (BOM) to > determine (or partially determine) encodings? > > Would it be worthwhile for an AI to allow a file to start with a BOM > and then a comment whose contents are functionally similar to the > things an XML first line using to specify encoding? I don't think so. We are talking about source files here; Ada does not require any characters outside of ASCII. So, the only reason why you would want to encode source files as UTF-8 would be because you have comments or hardcoded strings that use non-ASCII characters. You can avoid comments with non-ASCII characters, and you should avoid hardcoded strings with non-ASCII characters. If you have strings meant for display to the end user, you should place them in a separate file and use gettext. So, even though it might sound like a good idea, I suspect that the benefits do not warrant a language change. -- Ludovic Brenta.