bug-texinfo
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Displaying characters in user's locale


From: Per Bothner
Subject: Re: Displaying characters in user's locale
Date: Sat, 01 Feb 2014 13:33:23 -0800
User-agent: Mozilla/5.0 (X11; Linux x86_64; rv:24.0) Gecko/20100101 Thunderbird/24.2.0

On 02/01/2014 09:32 AM, Eli Zaretskii wrote:
What if a sequence of bytes intended to encode ISO-8859-1 characters
happens to correctly represent UTF-8 characters?

This cannot happen, by virtue of the UTF-8 definition and the fact
that ISO-8859-1 is a single-byte encoding.

Except for ASCII characters, that is.

I don't believe that is correct.

Imagine the 2-byte sequence 110xxxxx 10yyyyyy.  In UTF-8 that represents
the character xxxxxyyyyyy, while in ISO-8859-1 that can be a valid
2-character sequence.

--
        --Per Bothner
address@hidden   http://per.bothner.com/



reply via email to

[Prev in Thread] Current Thread [Next in Thread]