bug-texinfo
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: texi2any is too slow because of Unicode::Collate


From: Eli Zaretskii
Subject: Re: texi2any is too slow because of Unicode::Collate
Date: Sat, 11 Feb 2023 22:02:55 +0200

> From: Gavin Smith <gavinsmith0123@gmail.com>
> Date: Sat, 11 Feb 2023 19:46:12 +0000
> 
> On Sat, Feb 11, 2023 at 08:04:15PM +0100, Patrice Dumas wrote:
> > Other than that I do not have much other idea than disabling it, for
> > instance if documentlanguage is en.  The result with Unicode::Collate is
> > better for accented letters, but not so useful in english.  There could
> > even be a customization variable to use Unicode::Collate even in
> > english.
> 
> I think it's a good idea to disable it for "en" at least, along with
> a customization variable.

How many manuals set documentlanguage?  With the proliferation of
documentencoding set to UTF-8, I think disabling the collation for
"en" will be next to futile.

How come format_printindex takes such a large proportion of the
processing?  Isn't that strange?  Index entries are usually a small
proportion of the overall manual's text, so processing the manual
should take the lion's share.  The index in the manual you were timing
has about 8K entries, but the entire manual is 100K lines, so the
index is less than 10% of the total volume.  How come its processing
is so expensive?



reply via email to

[Prev in Thread] Current Thread [Next in Thread]