[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
UAX #29 changes
From: |
Daiki Ueno |
Subject: |
UAX #29 changes |
Date: |
Wed, 19 Oct 2016 16:11:17 +0200 |
User-agent: |
Gnus/5.13 (Gnus v5.13) Emacs/25.1.50 (gnu/linux) |
Hello,
I have been trying to update libunistring to Unicode 9.0.0. Initially I
planned it for the end of this month, but now I'm almost giving up,
because of the recent additions to the UAX #29 algorithms:
- The 3 rules added to the Grapheme Cluster Boundary Rules, namely
(GB10, GB12, GB13), involve 3 consequent characters, while the current
API uc_is_grapheme_break() only takes 2 characters
- The similar rules are also added to the Word Boundary Rules. Though
it wouldn't be a problem as uniwbrk.h doesn't expose such API, the
implementation of WB15 and WB16 could be complicated because it
requires lookahead of a next character
I have pushed the current version to my github repository:
https://github.com/ueno/gnulib/tree/wip/dueno/unicode-9.0.0
but I will be unlikely to have time to complete it any time soon.
Sorry. If anyone could finish it or rewrite it, it would be
appreciated.
By the way, I would like to change the "Maintainer:" field of the Gnulib
modules to "all", if nobody opposes.
Regards,
--
Daiki Ueno
- UAX #29 changes,
Daiki Ueno <=