bug-texinfo
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: `texindex` output depends on locale settings


From: Eli Zaretskii
Subject: Re: `texindex` output depends on locale settings
Date: Sun, 06 Nov 2022 17:05:00 +0200

> From: arnold@skeeve.com
> Date: Sun, 06 Nov 2022 07:57:03 -0700
> Cc: wl@gnu.org, bug-texinfo@gnu.org
> 
> Eli Zaretskii <eliz@gnu.org> wrote:
> 
> > Are you sure this Werner's request can be fulfilled:
> >
> > > >   I consider it very bad that `texindex` is locale-dependent.  IMHO
> > > >   the proper solution is to make `texinfo.tex` emit a document
> > > >   encoding statement to the (unsorted) index file that in turn gets
> > > >   acknowledged by `texindex`.
> 
> Sure? No. But I have some thoughts.
> 
> > FWIW, I don't even understand how can this be accomplished, unless the
> > program reinvents all the library functions that deal with characters
> > from scratch, instead of using libc functions (which are
> > locale-dependent).  And Gawk does use libc functions for that.
> 
> The current islower() is
> 
> function islower(c)
> {
>       return index("abcdefghijklmnopqrstuvwxyz", c) > 0
> }
> 
> It could instead be
> 
> function islower(c)
> {
>       return c ~ /[[:lower:]]/
> }
> 
> And similar for the others.  That would work for any unicode character.

Sure, but is the issue only with lower-case letters?  What about
collation order or even determining what is and isn't a character (as
opposed to incomplete byte sequence)?



reply via email to

[Prev in Thread] Current Thread [Next in Thread]