bug-texinfo
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: `texindex` output depends on locale settings


From: Eli Zaretskii
Subject: Re: `texindex` output depends on locale settings
Date: Sun, 06 Nov 2022 17:13:19 +0200

> Date: Sun, 06 Nov 2022 17:05:00 +0200
> From: Eli Zaretskii <eliz@gnu.org>
> Cc: wl@gnu.org, bug-texinfo@gnu.org
> 
> > It could instead be
> > 
> > function islower(c)
> > {
> >     return c ~ /[[:lower:]]/
> > }
> > 
> > And similar for the others.  That would work for any unicode character.
> 
> Sure, but is the issue only with lower-case letters?  What about
> collation order or even determining what is and isn't a character (as
> opposed to incomplete byte sequence)?

I mean: what if the document is encoded in Latin-1, but the locale's
codeset is UTF-8?  Then the 0xE0 byte (à in Latin-1 encoding) will
probably not be identified as lower-case.



reply via email to

[Prev in Thread] Current Thread [Next in Thread]