emacs-orgmode
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Orgmode] Re: Exclamation mark not a postmatch ?


From: Carsten Dominik
Subject: Re: [Orgmode] Re: Exclamation mark not a postmatch ?
Date: Mon, 23 Mar 2009 20:50:45 +0100


On Mar 23, 2009, at 6:59 PM, Baoqiu Cui wrote:

Daniel Clemente <address@hidden> writes:

El dl, mar 23 2009, Baoqiu Cui va escriure:

Am I missing anything (or do I have to customize variable
org-emphasis-regexp-components to support this)? I tried to search the
mailing list archive about this, but could not find anything.

 I asked a similar thing but for quotations:
 http://www.mail-archive.com/address@hidden/msg04267.html

Yes, I read about that thread.  The problem you reported has something
to do with Unicode support in Org mode, and Carsten was trying to limit
Org-mode in plain ASCII characters to avoid things becoming messier.


The current way of explicitly including the supported characters is
safer and cleaner, at least before Org-mode is enhanced to cover more
international languages.


Let me clarify:

Org-mode should work fine with files containing text in any
encoding, including Asian languages and whatnot.  At least
this is what I aim for.

What you are referring to was a post where I said that I want the basic
syntax like todo keywords, tags, list bullets etc to be ASCII as much
as possible.

I would not mind to include a general class of open-quote and
close-quote characters into the character class in the emphasis
regular expression.  However, as far as I know, this is not
possible.  Syntax classes can do what Daniel wants, but only
partially.  However, syntax classes cannot be used in regular
expressions.  There are a few exceptions like [:digit:], but
quotes and parenthesis are not supported, and none of these
special [:classes:] are available in XEmacs.

I have read mastering regular expressions, and yes, in Perl, syntax
classes can be used in regular expression character classes.
Not so in Emacs - unless something has changed here.

- Carsten




Baoqiu

 I think there are regular expressions in Emacs for „initial
quotation sign“ and „final quotation sign“ which are generic and
 work for all possible signs. Since Unicode characters are classified
 and described, this generic detection should be possible without
 having to list all possible characters explicitly. There are many,
 for instance: « „ 「 ‘ “ ¿ ¡ etc.

 \s. in a regular expression already detects some punctuation
 characters (but not all). This or a better regexp could be part of
 org-emphasis-regexp-components

I should in the book „Mastering regular expressions“ at which level
 are „Unicode properties“ implemented in Emacs, and how to access
 them.


-- Daniel



_______________________________________________
Emacs-orgmode mailing list
Remember: use `Reply All' to send replies to the list.
address@hidden
http://lists.gnu.org/mailman/listinfo/emacs-orgmode





reply via email to

[Prev in Thread] Current Thread [Next in Thread]