bug-gnu-emacs
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

bug#37036: [PATCH] Inconsistent ASCII and Latin char categories


From: Mattias Engdegård
Subject: bug#37036: [PATCH] Inconsistent ASCII and Latin char categories
Date: Thu, 15 Aug 2019 14:17:15 +0200

The ASCII (a) and Latin (l) character categories are inconsistent in what 
characters they contain.

It should be clear what the ASCII category means, but it omits 00-1f (contrary 
to a comment in the code).

The Latin category isn't exactly defined anywhere but should reasonably 
comprise letters from Latin-based scripts. Currently, it also includes many 
control characters and symbols from the ASCII and Latin-1 Supplement blocks, 
which seems hard to justify.

Other changes to Latin could be argued: what modifiers/combining chars should 
be included? What about IPA and non-IPA phonetics? Ligatures? What about 
Latin-derived forms such as circled letters? &c. The attached patch does not go 
there but only fixes the glaring errors in the 00-ff range.

Attachment: 0001-Fix-ASCII-and-Latin-character-categories.patch
Description: Binary data


reply via email to

[Prev in Thread] Current Thread [Next in Thread]