emacs-devel
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Unicode confusables and reordering characters considered harmful, a


From: Eli Zaretskii
Subject: Re: Unicode confusables and reordering characters considered harmful, a simple solution
Date: Sat, 06 Nov 2021 17:34:47 +0200

> From: Benjamin Riefenstahl <b.riefenstahl@turtle-trading.net>
> Date: Sat, 06 Nov 2021 14:58:31 +0100
> 
> Eli Zaretskii writes:
> > The Unicode Bidirectional Algorithm (UBA) mandates
> > (https://unicode.org/reports/tr9/#X8):
> >
> >   X8. All explicit directional embeddings, overrides and isolates are
> >   completely terminated at the end of each paragraph.
> >
> > [...]
> >
> > So when the UBA says "at the end of each paragraph", it means in
> > practice at EOL, since all the other paragraph separators are rarely
> > if ever used in human-readable text.  (And Emacs, of course,
> > implements that rule.)
> 
> Should the end of a comment or string in source code then also qualify
> as the end of a paragraph in this sense?

It could be, but the way the UBA is implemented in Emacs makes that
very hard to do, if not impossible.  And that's even before you
consider comment styles which make that hard even in principle.  For
example:

  /* This is the beginning of a comment, */
  /* and this is its continuation.      */



reply via email to

[Prev in Thread] Current Thread [Next in Thread]