From: Nicolas Paul Colin de Glocester <Spamassassin@irrt.De>
Subject: Re: Ada 202x; 2022; and 2012 and Unicode package (UTF-nn encodings handling)
Date: Tue, 2 Sep 2025 19:40:07 +0200 [thread overview]
Message-ID: <4248d82b-d759-b5d2-aa45-e4c0e26c81e5@insomnia247.nl> (raw)
In-Reply-To: <10974d1$jn0e$1@dont-email.me>
[-- Attachment #1: Type: text/plain, Size: 5113 bytes --]
Alex // nytpu wrote during this decade, specifically today:
|---------------------------------------------------------------------------|
|"I can't find any of my old writing on it so I've rewritten it |
|here lol." |
|---------------------------------------------------------------------------|
Dear Alex:
A teammate had once solved a problem but he had forgotten how he
solved it. So he has queried a search engine. So it showed him a
webpage with a perfect solution --- a webpage written by him!
I recommend searching for that old writing about Unicode: perhaps it
has more details than this comp.lang.ada thread, or perhaps a
perspective has been changed in an interesting way. Even if there is
no difference, perhaps it is in a directory with other missing files
which need to be backed up!
|---------------------------------------------------------------------------|
|"If you use Latin-1 or Windows-1252 or some weird |
|regional encoding everyone will hate you, and if you restrict inputs to |
|7-bit ASCII everyone will hate you too lol. And people will get annoyed |
|if you use UTF-16 or UTF-32 instead of UTF-8 as the interchange/storage |
|format in a new program." |
|---------------------------------------------------------------------------|
I quote Usenet articles in a way which does not endear me to
persons. Not everyone reacts in the same way. OC Systems asked me how
do I draw those boxes.
I advocate Ada which also does not endear me to persons.
|---------------------------------------------------------------------------|
|"[. . .] |
| |
|I personally use Wide_Wide_<> for everything just because it's more |
|convenient to have more useful built-in string functions, and it makes |
|dealing with input/output encoding much easier later (detailed below). |
| |
|[. . .] |
| |
|I'm unfortunate enough to know most of the nuances of Unicode but I |
|won't subject you to it, but a lot of the statements in your collection |
|are a bit oversimplified (UCS-4 has a number of additional differences |
|from UTF-32 regarding "valid encodings", [. . .] |
|[. . .]" |
|---------------------------------------------------------------------------|
Thanks for this feedback and more will be as welcome as can be. I
quoted examples of what I found in this newsgroup. This newsgroup used
not have many statements with explicit references to "UTF-32" or
"UTF32" or "UCS-4" which differ overwhelmingly from what I quoted
during the previous week.
|---------------------------------------------------------------------------|
|"Also, I just stumbled across Ada.Strings.Text_Buffers which seems to be |
|new to Ada 2022, makes "string builder" stuff much more convenient |
|because you can write text using any of Ada's string types and then get |
|a string in whatever encoding you want [. . .] |
|[. . .]" |
|---------------------------------------------------------------------------|
Package Ada.Strings.Text_Buffers does not support UCS-4.
|---------------------------------------------------------------------------|
|"Note that there is zero chance in hell that UTF-32 will ever be adopted as|
|an interchange or storage encoding (except in isolated singular corporate |
|apps *maybe*), so UTF-32 being used should purely be an internal |
|implementation detail: incoming text in whatever encoding gets converted to|
|it and outgoing text will always get converted from it." |
|---------------------------------------------------------------------------|
One can know but what one can too optimistically know can be
false. Character sets or encodings used to be subjects of unfulfilled
expectations.
I can say that for now, UTF-8 is enough for a particular application.
Deadly Head did not have the same luck.
|---------------------------------------------------------------------------|
|"The encodings used by |
|Text_IO are mostly (but not entirely) based off of the `-gnatW` flag, which|
|is configuring the encoding of THE PROGRAM'S SOURCE CODE." |
|---------------------------------------------------------------------------|
GNAT has many switches. It could easily gain more switches.
Sincères salutations.
Nicolas Paul Colin de Glocester
next prev parent reply other threads:[~2025-09-02 17:40 UTC|newest]
Thread overview: 27+ messages / expand[flat|nested] mbox.gz Atom feed top
2010-08-20 21:38 Ada 2012 and Unicode package (UTF-nn encodings handling) Yannick Duchêne (Hibou57)
2010-08-20 21:41 ` Yannick Duchêne (Hibou57)
2010-08-21 6:21 ` Dmitry A. Kazakov
2010-08-21 7:01 ` J-P. Rosen
2010-08-21 8:12 ` Yannick Duchêne (Hibou57)
2010-08-22 18:51 ` J-P. Rosen
2010-08-22 19:48 ` Georg Bauhaus
2010-08-22 20:40 ` J-P. Rosen
2010-08-23 10:32 ` Georg Bauhaus
2010-08-23 22:28 ` Randy Brukardt
2025-08-31 17:39 ` Ada 202x; 2022; and " Nicolas Paul Colin de Glocester
2025-08-31 21:23 ` Kevin Chadwick
2025-08-31 21:27 ` Nicolas Paul Colin de Glocester
2025-09-02 16:01 ` Alex // nytpu
2025-09-02 17:40 ` Nicolas Paul Colin de Glocester [this message]
2025-09-02 18:49 ` Keith Thompson
2025-09-02 19:27 ` Nicolas Paul Colin de Glocester
2025-09-02 20:02 ` Keith Thompson
2025-09-02 17:42 ` Nicolas Paul Colin de Glocester
2025-09-02 19:15 ` Alex // nytpu
2025-09-02 19:50 ` Nicolas Paul Colin de Glocester
2025-09-02 18:08 ` Dmitry A. Kazakov
2025-09-02 19:13 ` Alex // nytpu
2025-09-02 22:56 ` Lawrence D’Oliveiro
2025-09-03 0:20 ` Alex // nytpu
2025-09-03 4:10 ` Lawrence D’Oliveiro
2025-09-03 17:25 ` Alex // nytpu
replies disabled
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox