emacs-devel
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Word boundary (was: find-composition still depends on the compositio


From: Kenichi Handa
Subject: Re: Word boundary (was: find-composition still depends on the composition property)
Date: Sun, 26 Oct 2008 22:36:05 +0900
User-agent: SEMI/1.14.3 (Ushinoya) FLIM/1.14.2 (Yagi-Nishiguchi) APEL/10.2 Emacs/23.0.60 (i686-pc-linux-gnu) MULE/6.0 (HANACHIRUSATO)

In article <address@hidden>, Eli Zaretskii <address@hidden> writes:

> Unless I'm missing something important, my reading of th UAX #29
> (http://www.unicode.org/reports/tr29/tr29-13.html) is that almost all
> scripts should _not_ have word breaks between letters and digits.  And
> neither should we define a word break on script boundaries, in most
> cases.

Although it says "Do not break between most letters. ALetter
x ALetter", ALetter doesn't include Han, Katakana, and
Hiragana.

And, it also has this note:

Normally word breaking does not require breaking between
different scripts. However, adding that capability may be
useful in combination with other extensions of word
segmentation. For example, ...

---
Kenichi Handa
address@hidden






reply via email to

[Prev in Thread] Current Thread [Next in Thread]