emacs-devel
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: ucs-normalize and diacritics


From: K. Handa
Subject: Re: ucs-normalize and diacritics
Date: Tue, 31 Jul 2018 21:37:40 +0900

In article <address@hidden>, Eli Zaretskii <address@hidden> writes:

> > > - It doesn't show this for the whole range 0200 to 0377, only for 0240
> > >   to 0377, eg for 0200:
> > 
> > That's a bug, somewhere.  I will look into it when I have time.

> Not a bug, an unintended consequence of how we set up charsets.  Now
> fixed on the emacs-26 branch.

> I wonder whether we can mark tis620-2533 a :supplementary-p charset.
> Kenichi, do you envision any problems that could be caused by that?

If we put :supplementary-p t to tis620-2533, M-x list-character-sets
lists that charset under "Supplementary Character Sets" section, which
will surprise tis620-2533 users (I think there are still many).

And, anyway, that change won't change the behavior of char-charset used
by describe-char.  I have not yet read all mails in this thread, but if
the problem is that describe-char does not show eight-bit as a charset
of eight-bit characters, perhaps the best way is to modify describe-char
as this.

diff --git a/lisp/descr-text.el b/lisp/descr-text.el
index d8f8188eb1..aea090b907 100644
--- a/lisp/descr-text.el
+++ b/lisp/descr-text.el
@@ -403,6 +403,7 @@ describe-char
            (eight-bit-p (and (not enable-multibyte-characters) (>= char 128)))
            (charset (if eight-bit-p 'eight-bit
                       (or (get-text-property pos 'charset)
+                          (and (encode-char char 'eight-bit) 'eight-bit)
                           (char-charset char))))
            (composition (find-composition pos nil nil t))
            (component-chars nil)

---
K. Handa
address@hidden




reply via email to

[Prev in Thread] Current Thread [Next in Thread]