From: "Dmitry A. Kazakov" <mailbox@dmitry-kazakov.de>
Subject: Re: unicode and wide_text_io
Date: Sat, 30 Dec 2017 16:56:24 +0100
Date: 2017-12-30T16:56:24+01:00 [thread overview]
Message-ID: <p28cv7$lo2$1@gioia.aioe.org> (raw)
In-Reply-To: 19cf4dhtoec32ti6nnnduqrgatdj27phvm@4ax.com
On 2017-12-30 16:33, Dennis Lee Bieber wrote:
> Isn't that 0..2^7... Any byte with the MSB set is a multibyte code (and
> number of MSB bits set before a 0 bit indicates how many bytes).
Yes. Furthermore, the subsequent octets have MSB set. The reason for
this "waste" is to allow bidirectional scanning of UTF-8 strings.
--
Regards,
Dmitry A. Kazakov
http://www.dmitry-kazakov.de
next prev parent reply other threads:[~2017-12-30 15:56 UTC|newest]
Thread overview: 26+ messages / expand[flat|nested] mbox.gz Atom feed top
2017-12-27 18:08 unicode and wide_text_io Mehdi Saada
2017-12-27 20:04 ` Dmitry A. Kazakov
2017-12-27 21:47 ` Dennis Lee Bieber
2017-12-27 22:32 ` Mehdi Saada
2017-12-27 22:33 ` Mehdi Saada
2017-12-27 22:48 ` Mehdi Saada
2017-12-27 23:32 ` Mehdi Saada
2017-12-27 23:57 ` Randy Brukardt
2017-12-28 5:20 ` Robert Eachus
2017-12-31 21:41 ` Keith Thompson
2017-12-28 9:04 ` Dmitry A. Kazakov
2017-12-28 11:06 ` Niklas Holsti
2017-12-28 11:50 ` Dmitry A. Kazakov
2017-12-28 13:15 ` Mehdi Saada
2017-12-28 14:25 ` Dmitry A. Kazakov
2017-12-28 14:32 ` Simon Wright
2017-12-28 15:28 ` Niklas Holsti
2017-12-28 15:47 ` 00120260b
2017-12-28 22:35 ` G.B.
2017-12-28 18:15 ` Simon Wright
2017-12-28 22:36 ` Mehdi Saada
2017-12-29 0:51 ` Randy Brukardt
2017-12-30 12:50 ` Björn Lundin
2017-12-30 15:33 ` Dennis Lee Bieber
2017-12-30 15:56 ` Dmitry A. Kazakov [this message]
2017-12-30 23:20 ` Björn Lundin
replies disabled
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox