bug-gnulib
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: bug#24975: Matching issues with characters whose encoding ends in so


From: Norihiro Tanaka
Subject: Re: bug#24975: Matching issues with characters whose encoding ends in some other character
Date: Mon, 28 Nov 2016 23:47:57 +0900

Jim Meyering <address@hidden> wrote:

> I suspect this won't be the last word in this area, because it feels
> like we should be able to adjust DFA's tables so that people using
> such locales can retain DFA's efficiency without the bug in the
> current implementation.

Hi Jim,

It is a bug in dfa for period expression in non-UTF8 locales.  dfa
calculates transition for single byte characters and a multibyte
character separately and merge both results.  However, if backs to
an initial state in transition for single byte characters, we should
stop matching single byte characters.

Thanks,
Norihiro

Attachment: 0001-dfa-avoid-match-middle-in-multibyte-character.patch
Description: Text document


reply via email to

[Prev in Thread] Current Thread [Next in Thread]