comp.lang.ada
 help / color / mirror / Atom feed
From: Nicolas Paul Colin de Glocester <Spamassassin@irrt.De>
Subject: Re: Ada 202x; 2022; and 2012 and Unicode package (UTF-nn encodings handling)
Date: Tue, 2 Sep 2025 19:40:07 +0200	[thread overview]
Message-ID: <4248d82b-d759-b5d2-aa45-e4c0e26c81e5@insomnia247.nl> (raw)
In-Reply-To: <10974d1$jn0e$1@dont-email.me>

[-- Attachment #1: Type: text/plain, Size: 5113 bytes --]

Alex // nytpu wrote during this decade, specifically today:
|---------------------------------------------------------------------------|
|"I can't find any of my old writing on it so I've rewritten it             |
|here lol."                                                                 |
|---------------------------------------------------------------------------|

Dear Alex:

A teammate had once solved a problem but he had forgotten how he
solved it. So he has queried a search engine. So it showed him a
webpage with a perfect solution --- a webpage written by him!

I recommend searching for that old writing about Unicode: perhaps it
has more details than this comp.lang.ada thread, or perhaps a
perspective has been changed in an interesting way. Even if there is
no difference, perhaps it is in a directory with other missing files
which need to be backed up!

|---------------------------------------------------------------------------|
|"If you use Latin-1 or Windows-1252 or some weird                          |
|regional encoding everyone will hate you, and if you restrict inputs to    |
|7-bit ASCII everyone will hate you too lol.  And people will get annoyed   |
|if you use UTF-16 or UTF-32 instead of UTF-8 as the interchange/storage    |
|format in a new program."                                                  |
|---------------------------------------------------------------------------|

I quote Usenet articles in a way which does not endear me to
persons. Not everyone reacts in the same way. OC Systems asked me how
do I draw those boxes.

I advocate Ada which also does not endear me to persons.

|---------------------------------------------------------------------------|
|"[. . .]                                                                   |
|                                                                           |
|I personally use Wide_Wide_<> for everything just because it's more        |
|convenient to have more useful built-in string functions, and it makes     |
|dealing with input/output encoding much easier later (detailed below).     |
|                                                                           |
|[. . .]                                                                    |
|                                                                           |
|I'm unfortunate enough to know most of the nuances of Unicode but I        |
|won't subject you to it, but a lot of the statements in your collection    |
|are a bit oversimplified (UCS-4 has a number of additional differences     |
|from UTF-32 regarding "valid encodings", [. . .]                           |
|[. . .]"                                                                   |
|---------------------------------------------------------------------------|

Thanks for this feedback and more will be as welcome as can be. I
quoted examples of what I found in this newsgroup. This newsgroup used
not have many statements with explicit references to "UTF-32" or
"UTF32" or "UCS-4" which differ overwhelmingly from what I quoted
during the previous week.

|---------------------------------------------------------------------------|
|"Also, I just stumbled across Ada.Strings.Text_Buffers which seems to be   |
|new to Ada 2022, makes "string builder" stuff much more convenient         |
|because you can write text using any of Ada's string types and then get    |
|a string in whatever encoding you want [. . .]                             |
|[. . .]"                                                                   |
|---------------------------------------------------------------------------|

Package Ada.Strings.Text_Buffers does not support UCS-4.

|---------------------------------------------------------------------------|
|"Note that there is zero chance in hell that UTF-32 will ever be adopted as|
|an interchange or storage encoding (except in isolated singular corporate  |
|apps *maybe*), so UTF-32 being used should purely be an internal           |
|implementation detail: incoming text in whatever encoding gets converted to|
|it and outgoing text will always get converted from it."                   |
|---------------------------------------------------------------------------|

One can know but what one can too optimistically know can be
false. Character sets or encodings used to be subjects of unfulfilled
expectations.

I can say that for now, UTF-8 is enough for a particular application.

Deadly Head did not have the same luck.

|---------------------------------------------------------------------------|
|"The encodings used by                                                     |
|Text_IO are mostly (but not entirely) based off of the `-gnatW` flag, which|
|is configuring the encoding of THE PROGRAM'S SOURCE CODE."                 |
|---------------------------------------------------------------------------|

GNAT has many switches. It could easily gain more switches.

Sincères salutations.



Nicolas Paul Colin de Glocester

  reply	other threads:[~2025-09-02 17:40 UTC|newest]

Thread overview: 27+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2010-08-20 21:38 Ada 2012 and Unicode package (UTF-nn encodings handling) Yannick Duchêne (Hibou57)
2010-08-20 21:41 ` Yannick Duchêne (Hibou57)
2010-08-21  6:21 ` Dmitry A. Kazakov
2010-08-21  7:01 ` J-P. Rosen
2010-08-21  8:12   ` Yannick Duchêne (Hibou57)
2010-08-22 18:51     ` J-P. Rosen
2010-08-22 19:48       ` Georg Bauhaus
2010-08-22 20:40         ` J-P. Rosen
2010-08-23 10:32           ` Georg Bauhaus
2010-08-23 22:28 ` Randy Brukardt
2025-08-31 17:39 ` Ada 202x; 2022; and " Nicolas Paul Colin de Glocester
2025-08-31 21:23   ` Kevin Chadwick
2025-08-31 21:27     ` Nicolas Paul Colin de Glocester
2025-09-02 16:01   ` Alex // nytpu
2025-09-02 17:40     ` Nicolas Paul Colin de Glocester [this message]
2025-09-02 18:49       ` Keith Thompson
2025-09-02 19:27         ` Nicolas Paul Colin de Glocester
2025-09-02 20:02           ` Keith Thompson
2025-09-02 17:42     ` Nicolas Paul Colin de Glocester
2025-09-02 19:15       ` Alex // nytpu
2025-09-02 19:50         ` Nicolas Paul Colin de Glocester
2025-09-02 18:08     ` Dmitry A. Kazakov
2025-09-02 19:13       ` Alex // nytpu
2025-09-02 22:56     ` Lawrence D’Oliveiro
2025-09-03  0:20       ` Alex // nytpu
2025-09-03  4:10         ` Lawrence D’Oliveiro
2025-09-03 17:25           ` Alex // nytpu
replies disabled

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox