[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: accents
From: |
Chet Ramey |
Subject: |
Re: accents |
Date: |
Tue, 10 May 2011 10:33:54 -0400 |
User-agent: |
Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10.5; en-US; rv:1.9.2.17) Gecko/20110414 Thunderbird/3.1.10 |
On 5/10/11 9:17 AM, Greg Wooledge wrote:
> In yours, however, it is 0x65 0xcc 0x81 which is U+0065 LATIN SMALL
> LETTER E followed by U+0301 COMBINING ACUTE ACCENT.
>
> Perhaps Bash does not know how to interpret COMBINING ACUTE ACCENT which
> follows a letter...?
>
> I'm not intimately familiar with this stuff myself, but it looks like
> a real bastard to me... I thought the point of UTF-8 was that you could
> read it a byte at a time, and know when you encountered a byte that
> signified the start of a multi-byte character. But apparently not!
> If I'm interpreting this COMBINING ACUTE ACCENT thing properly, the
> only indicator that you are in a multi-byte character comes with the
> *second* byte, so you have to backtrack. What idiot thought this up?
http://lists.gnu.org/archive/html/bug-readline/2011-01/msg00011.html
--
``The lyf so short, the craft so long to lerne.'' - Chaucer
``Ars longa, vita brevis'' - Hippocrates
Chet Ramey, ITS, CWRU chet@case.edu http://cnswww.cns.cwru.edu/~chet/
- accents, Thomas De Contes, 2011/05/09
- Re: accents, Greg Wooledge, 2011/05/09
- Re: accents, Thomas De Contes, 2011/05/09
- Re: accents, Greg Wooledge, 2011/05/10
- Re: accents, Andreas Schwab, 2011/05/10
- Re: accents,
Chet Ramey <=
- Re: accents, Chet Ramey, 2011/05/15
- Re: accents, Andreas Schwab, 2011/05/15
- Re: accents, Chet Ramey, 2011/05/15
- Re: accents, Andreas Schwab, 2011/05/16
- Re: accents, Chet Ramey, 2011/05/15
Re: accents, Chet Ramey, 2011/05/16