Re: Rational Ranges [was Re: gnulib regex lib]

bug-gnulib

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Rational Ranges [was Re: gnulib regex lib]

From:	arnold
Subject:	Re: Rational Ranges [was Re: gnulib regex lib]
Date:	Mon, 13 Aug 2018 00:36:05 -0600
User-agent:	Heirloom mailx 12.4 7/29/08

Paul Eggert <address@hidden> wrote:

> address@hidden wrote:
> > The only FIXMEs I see are both in the _LIBC part of the code, and
> > there's only two: one in regexec.c and one in regcomp.c.
>
> In that case I guess there isn't a problem.
>
> I am a little concerned that unibyte locales use bytes whereas multibyte 
> locales 
> use characters for range expressions. As I understand it, this means Turkish 
> range expressions are interpreted differently depending on whether the locale 
> uses UTF-8 or ISO/IEC 8859-9. Is that really what Turkish-speakers want?

It's a sad fact of life that users have to be aware of their character set /
locale and understand the consequences of what they choose to use (or
what their OS has chosen for them upon installation).  This is just
another aspect of that.

> That being said, it doesn't matter all that much nowadays now that UTF-8 has 
> taken over, so it's probably not worth much of our time to worry about this 
> discrepancy. For what it's worth, 
> https://w3techs.com/technologies/details/en-iso885909/all/all says that only 
> 0.06% of websites still use ISO/IEC 8859-9, down from 0.09% a year ago (and 
> down 
> from 0.7% in 2010, so this is a factor-of-10 decline in 8 years).

I totally agree that it's not worth worrying about. It's a too small
tail to be wagging such a big dog.

Thanks,

Arnold

[Prev in Thread]

Current Thread

[Next in Thread]

Re: [gawk-devel] changing regex lib, Paul Eggert, 2018/08/10
- Re: [gawk-devel] changing regex lib, Bruno Haible, 2018/08/10
  - Re: [gawk-devel] changing regex lib, Paul Eggert, 2018/08/12
    - Rational Ranges [was Re: gnulib regex lib], arnold, 2018/08/12
    - Re: Rational Ranges [was Re: gnulib regex lib], Paul Eggert, 2018/08/12
    - Re: Rational Ranges [was Re: gnulib regex lib], arnold, 2018/08/12
    - Re: Rational Ranges [was Re: gnulib regex lib], arnold, 2018/08/12
    - Re: Rational Ranges [was Re: gnulib regex lib], Paul Eggert, 2018/08/12
    - Re: Rational Ranges [was Re: gnulib regex lib], arnold <=
- Re: [gawk-devel] changing regex lib, arnold, 2018/08/12

Prev by Date: Re: Adding strfmon(3) to Gnulib?
Next by Date: Recent change to fnmatch breaks my build
Previous by thread: Re: Rational Ranges [was Re: gnulib regex lib]
Next by thread: Re: [gawk-devel] changing regex lib
Index(es):
- Date
- Thread