[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
bug#540: 23.0.60; Unicode search bug
From: |
Juri Linkov |
Subject: |
bug#540: 23.0.60; Unicode search bug |
Date: |
Sun, 06 Jul 2008 21:43:23 +0300 |
User-agent: |
Gnus/5.13 (Gnus v5.13) Emacs/23.0.60 (x86_64-pc-linux-gnu) |
There is a weird bug in searching Unicode text. The search function
fails on Cyrillic letters between codepoints #x0400 and #x041f, but
successfully finds a Cyrillic letter between #x0420 and #x042f.
I tried to debug this and see that in case of failure
it calls `boyer_moore', and in case of successful search
it calls `simple_search'. I checked the Unicode properties,
but everything seems correct.
This bug didn't exist before the Unicode merge.
The easiest way to reproduce it: run `emacs -Q',
put in the *scratch* buffer the following 4 lines
(note the leading space):
(search-forward " П" nil t)
(search-forward " Р" nil t)
П
Р
and type `C-x C-e' after each of first two lines.
In GNU Emacs 23.0.60 (x86_64-pc-linux-gnu)
Important settings:
value of $LC_ALL: nil
value of $LC_COLLATE: nil
value of $LC_CTYPE: nil
value of $LC_MESSAGES: nil
value of $LC_MONETARY: nil
value of $LC_NUMERIC: nil
value of $LC_TIME: nil
value of $LANG: en_US.UTF-8
value of $XMODIFIERS: nil
locale-coding-system: utf-8-unix
default-enable-multibyte-characters: t
--
Juri Linkov
http://www.jurta.org/emacs/
[Prev in Thread] |
Current Thread |
[Next in Thread] |
- bug#540: 23.0.60; Unicode search bug,
Juri Linkov <=