bug-bash
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: locale specific ordering in EN_US -- why is a<A<b<B<y<Y<z<Z?


From: Linda Walsh
Subject: Re: locale specific ordering in EN_US -- why is a<A<b<B<y<Y<z<Z?
Date: Mon, 21 May 2012 12:56:34 -0700
User-agent: Mozilla/5.0 (Windows; U; Windows NT 6.0; en-US; rv:1.8.1.24) Gecko/20100228 Lightning/0.9 Thunderbird/2.0.0.24 Mnenhy/0.7.6.666



Aharon Robbins wrote:


You'd think unicode would have something to say about collation
order that wouldn't allow such randomness, but maybe not.

It actually makes sense that it doesn't, since Unicode is more or less
a mapping of code points to glyphs, which is language independant. The
rules for collating depend upon the language.

----
        Actually it makes sense that it would... and it DOES.

Bash and POSIX would appear to violate Unicode ordering.

Algorithm reference: http://www.unicode.org/reports/tr10/tr10-24.html

Collation order chart:
http://www.unicode.org/Public/UCA/latest/allkeys.txt

----
It shows that cap A-Z come before a-z.

If a sort order is set (as mine used to be) to
en_US.UTF-8

then the Unicode sort order should have been used.

In bash, it was not.

It sounds like this is a bug rooted in the C libraries?






reply via email to

[Prev in Thread] Current Thread [Next in Thread]