|
From: | Markus Mützel |
Subject: | [Octave-bug-tracker] [bug #57107] regexp functions fail on ISO-8859-1 input |
Date: | Sun, 3 Nov 2019 08:31:28 -0500 (EST) |
User-agent: | Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:71.0) Gecko/20100101 Firefox/71.0 |
Update of bug #57107 (project octave): Summary: regexp functions fail on ISO-8859 input => regexp functions fail on ISO-8859-1 input _______________________________________________________ Follow-up Comment #20: The attached patch wires in the validation of UTF-8 at a pretty low level. It applies on top of "bug57107_validate_u8.patch". It breaks "strread" and "textread" (and possibly also other things). But I'm still waiting for feedback on the maintainers mailing list to see if it is worth looking into why. Nevertheless, it demonstrates what could happen: octave:1> char (181) ans = µ octave:2> double (ans) ans = 194 181 octave:3> char ([181 228]) ans = µä octave:4> double (ans) ans = 194 181 195 164 (file #47784) _______________________________________________________ Additional Item Attachment: File name: bug57107_char.patch Size:9 KB <https://savannah.gnu.org/file/bug57107_char.patch?file_id=47784> _______________________________________________________ Reply to this item at: <https://savannah.gnu.org/bugs/?57107> _______________________________________________ Message sent via Savannah https://savannah.gnu.org/
[Prev in Thread] | Current Thread | [Next in Thread] |