octave-bug-tracker
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Octave-bug-tracker] [bug #57107] regexp functions fail on ISO-8859-1 in


From: Markus Mützel
Subject: [Octave-bug-tracker] [bug #57107] regexp functions fail on ISO-8859-1 input
Date: Sun, 3 Nov 2019 08:31:28 -0500 (EST)
User-agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:71.0) Gecko/20100101 Firefox/71.0

Update of bug #57107 (project octave):

                 Summary: regexp functions fail on ISO-8859 input => regexp
functions fail on ISO-8859-1 input

    _______________________________________________________

Follow-up Comment #20:

The attached patch wires in the validation of UTF-8 at a pretty low level. It
applies on top of "bug57107_validate_u8.patch".
It breaks "strread" and "textread" (and possibly also other things). But I'm
still waiting for feedback on the maintainers mailing list to see if it is
worth looking into why.
Nevertheless, it demonstrates what could happen:

octave:1> char (181)
ans = µ
octave:2> double (ans)
ans =

   194   181

octave:3> char ([181 228])
ans = µä
octave:4> double (ans)
ans =

   194   181   195   164



(file #47784)
    _______________________________________________________

Additional Item Attachment:

File name: bug57107_char.patch            Size:9 KB
    <https://savannah.gnu.org/file/bug57107_char.patch?file_id=47784>



    _______________________________________________________

Reply to this item at:

  <https://savannah.gnu.org/bugs/?57107>

_______________________________________________
  Message sent via Savannah
  https://savannah.gnu.org/




reply via email to

[Prev in Thread] Current Thread [Next in Thread]