bug-lilypond
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: letters like é


From: Werner LEMBERG
Subject: Re: letters like é
Date: Fri, 30 Dec 2011 13:00:51 +0100 (CET)

>> If we get an invalid UTF-8 sequence, I'm all for it.  But it is not
>> too difficult to not get invalid sequences but still have wrong
>> output.
> 
> Theoretically.  But it is impossible to write just a single
> non-ASCII byte without hitting an invalid sequence since all
> non-ASCII bytes must be part of multi-byte sequences.  Only
> combinations of non-ASCII bytes can form valid utf-8 sequences, and
> the probability of several of them being "just right" is not all
> that high.

For single-byte encodings, you are correct.  However, the probability
is *much* higher if you consider legacy two-byte encodings for CJK
scripts.


    Werner



reply via email to

[Prev in Thread] Current Thread [Next in Thread]