[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[bug-gettext] [bug #32505] xgettext on C# file crashes when input file c
From: |
Daiki Ueno |
Subject: |
[bug-gettext] [bug #32505] xgettext on C# file crashes when input file contains Unicode literal with surrogates |
Date: |
Fri, 28 Nov 2014 08:57:09 +0000 |
User-agent: |
Mozilla/5.0 (X11; Linux x86_64; rv:33.0) Gecko/20100101 Firefox/33.0 |
Update of bug #32505 (project gettext):
Status: Confirmed => In Progress
Assigned to: m1gu3l => ueno
_______________________________________________________
Follow-up Comment #2:
According to ECMA-334 9.4.1, a Unicode code point in the range U+10000 to
U+10FFFF is represented using two Unicode surrogate code units. I couldn't
find any example in the specification, but a quick web search showed that the
following expressions represent the same string:
string s1 = "\U00020213"; // Codepoint U+20213
string s2 = "\uD840\uDE13"; // Encoded form of U+20213
This indeed works with Mono, but aborts xgettext. I'm attaching a tentative
patch to fix this.
(file #32566)
_______________________________________________________
Additional Item Attachment:
File name: 0001-csharp-Recognize-Unicode-surrogate-character-pair.patch Size:3
KB
_______________________________________________________
Reply to this item at:
<http://savannah.gnu.org/bugs/?32505>
_______________________________________________
Message sent via/by Savannah
http://savannah.gnu.org/
[Prev in Thread] |
Current Thread |
[Next in Thread] |
- [bug-gettext] [bug #32505] xgettext on C# file crashes when input file contains Unicode literal with surrogates,
Daiki Ueno <=