|
From: | Markus Mützel |
Subject: | [Octave-bug-tracker] [bug #49348] Treat multi-byte characters as one character for char array |
Date: | Thu, 29 Oct 2020 11:00:40 -0400 (EDT) |
User-agent: | Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/86.0.4240.111 Safari/537.36 Edg/86.0.622.56 |
Follow-up Comment #17, bug #49348 (project octave): Glad, we agree. Just a minor addition: 32 bits are enough to encode all Unicode characters. The encoding that stores the Unicode code point in 32bit units (4 bytes) is called UTF-32. (Plus endianness, …) Also note that one code point does not necessarily equal one glyph. One glyph could be made up of one or multiple Unicode code points. _______________________________________________________ Reply to this item at: <https://savannah.gnu.org/bugs/?49348> _______________________________________________ Message sent via Savannah https://savannah.gnu.org/
[Prev in Thread] | Current Thread | [Next in Thread] |