help-gawk
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: How to represent NBSP in gawk regex?


From: Wolfgang Laun
Subject: Re: How to represent NBSP in gawk regex?
Date: Mon, 21 Feb 2022 19:46:10 +0100

For a similar issue I pnce enhanced the environment of some programming
language with a module where Unicode characters (up to U+FFFF) could be
referenced by their names, defined to be int constants. Likewise, it would
be possible to have a string-to-int map.
-W


On Mon, 21 Feb 2022 at 19:35, Neil R. Ormos <ormos-gnulists17@ormos.org>
wrote:

> Eli Zaretskii wrote:
> > ["Neil R. Ormos" wrote:]
>
> >> Although both David's and Wolfgang's solutions
> >> work, I wonder if there is a more portable way
> >> to represent the character that is not
> >> nailed-up for a specific character set.
>
> > What do you mean by "character set" in this
> > context?  That is an overloaded terminology, and
> > it's easy to become confused if we don't define
> > precisely what we mean.
>
> I think the now-elided example made that clear, but I will restate:
>
> The question is, if someone finds themselves in an environment where the
> character encoding is not UTF-8, but does support a non-breaking-space
> character, how do they cause their Gawk program to produce it without
> having /a priori/ knowledge of the sequence of bits that encode that
> character?
>
>

-- 
Wolfgang Laun


reply via email to

[Prev in Thread] Current Thread [Next in Thread]