[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
bug#37036: [PATCH] Inconsistent ASCII and Latin char categories
From: |
Mattias Engdegård |
Subject: |
bug#37036: [PATCH] Inconsistent ASCII and Latin char categories |
Date: |
Thu, 15 Aug 2019 14:17:15 +0200 |
The ASCII (a) and Latin (l) character categories are inconsistent in what
characters they contain.
It should be clear what the ASCII category means, but it omits 00-1f (contrary
to a comment in the code).
The Latin category isn't exactly defined anywhere but should reasonably
comprise letters from Latin-based scripts. Currently, it also includes many
control characters and symbols from the ASCII and Latin-1 Supplement blocks,
which seems hard to justify.
Other changes to Latin could be argued: what modifiers/combining chars should
be included? What about IPA and non-IPA phonetics? Ligatures? What about
Latin-derived forms such as circled letters? &c. The attached patch does not go
there but only fixes the glaring errors in the 00-ff range.
0001-Fix-ASCII-and-Latin-character-categories.patch
Description: Binary data
- bug#37036: [PATCH] Inconsistent ASCII and Latin char categories,
Mattias Engdegård <=
- bug#37036: [PATCH] Inconsistent ASCII and Latin char categories, Eli Zaretskii, 2019/08/15
- bug#37036: [PATCH] Inconsistent ASCII and Latin char categories, Mattias Engdegård, 2019/08/15
- bug#37036: [PATCH] Inconsistent ASCII and Latin char categories, Eli Zaretskii, 2019/08/15
- bug#37036: [PATCH] Inconsistent ASCII and Latin char categories, Mattias Engdegård, 2019/08/15
- bug#37036: [PATCH] Inconsistent ASCII and Latin char categories, Eli Zaretskii, 2019/08/15
- bug#37036: [PATCH] Inconsistent ASCII and Latin char categories, Mattias Engdegård, 2019/08/15
- bug#37036: [PATCH] Inconsistent ASCII and Latin char categories, Eli Zaretskii, 2019/08/15
- bug#37036: [PATCH] Inconsistent ASCII and Latin char categories, Eli Zaretskii, 2019/08/15
- bug#37036: [PATCH] Inconsistent ASCII and Latin char categories, Mattias Engdegård, 2019/08/15
- bug#37036: [PATCH] Inconsistent ASCII and Latin char categories, Eli Zaretskii, 2019/08/16