|
From: | Linda Walsh |
Subject: | Re: locale specific ordering in EN_US vs. characterset collation rules for UTF-8 |
Date: | Thu, 27 Jun 2013 22:04:21 -0700 |
User-agent: | Thunderbird |
Chet Ramey wrote:
The world is larger than glibc and the glibc locale definitions. We need a solution that encompasses all of it. That solution should, and maybe will, include glibc, but that is not sufficient by itself.
---- I don't suppose it is possible to use the Unicode collation order when using unicode? algorithm reference: http://www.unicode.org/reports/tr10/tr10-24.html Collation order chart: http://www.unicode.org/Public/UCA/latest/allkeys.txt How does one get UTF-8 collation order? I would think think that a character specific ordering specified in LC_COLLATE would take precedence over a less specific regional ordering. I.e LC_COLLATE="XXX.UTF-8" -- Seems like it should use the UTF-8 rules over the XXX rules for COLLATION. If they wanted regional rules, then "XXX" alone without specifying an international standard like unicode, would allow regional rules to take precedence. But if they specify a specific character encoding for the characters, under collation, why wouldn't the character set's collation order be used? So how does one get UTF-8's Unicode collation ordering?
[Prev in Thread] | Current Thread | [Next in Thread] |