[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: letters like é
From: |
Werner LEMBERG |
Subject: |
Re: letters like é |
Date: |
Fri, 30 Dec 2011 13:00:51 +0100 (CET) |
>> If we get an invalid UTF-8 sequence, I'm all for it. But it is not
>> too difficult to not get invalid sequences but still have wrong
>> output.
>
> Theoretically. But it is impossible to write just a single
> non-ASCII byte without hitting an invalid sequence since all
> non-ASCII bytes must be part of multi-byte sequences. Only
> combinations of non-ASCII bytes can form valid utf-8 sequences, and
> the probability of several of them being "just right" is not all
> that high.
For single-byte encodings, you are correct. However, the probability
is *much* higher if you consider legacy two-byte encodings for CJK
scripts.
Werner
- letters like é, Amund, 2011/12/29
- Re: letters like é, Francisco Vila, 2011/12/29
- Re: letters like é, David Kastrup, 2011/12/29
- Re: letters like é, Janek Warchoł, 2011/12/29
- Re: letters like é, Werner LEMBERG, 2011/12/29
- Re: letters like é, David Kastrup, 2011/12/29
- Re: letters like é, Werner LEMBERG, 2011/12/29
- Re: letters like é, Janek Warchoł, 2011/12/29
- Re: letters like é, Werner LEMBERG, 2011/12/29
- Re: letters like é, David Kastrup, 2011/12/30
- Re: letters like é,
Werner LEMBERG <=
- Re: letters like é, David Kastrup, 2011/12/30
- Re: letters like é, Werner LEMBERG, 2011/12/30
- Re: letters like é, David Kastrup, 2011/12/30
- Re: letters like é, Werner LEMBERG, 2011/12/30
- Re: letters like é, David Kastrup, 2011/12/30
- Re: letters like é, Werner LEMBERG, 2011/12/30
- Re: letters like é, David Kastrup, 2011/12/31
- Re: letters like é, David Kastrup, 2011/12/29
- Re: letters like é, Pavel Roskin, 2011/12/29