You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The filter checks whether: n_ascii_chars < n_chars / n_utf_blocks
In the case of the string, there are 2 UTF blocks (due to \xa1), and all characters but one is ASCII, so it fails, and the string is processed again accepting only ASCII characters (thus cutting the string up to \xa1).
I tried old commits (even older than the UTF refactoring), and it seems that the string was already misinterpreted.
To fix this, I think we should modify the false-positive filter.
Yes, in the test it was using the Cs commands that didn't share the code with the izz (the code you refactored into librz/util/str_search.c), it had no such checks and FP filters. But I checked the string and it's absolutely valid, thus it's a bug in the FP filter.
I didn't see any work on this during this time, so I doubt it will be done for 0.4.0.
I'm removing the 0.4.0 milestone for now until someone starts to actively looking at this.
See the test called
Csa, Cs. and Cs.l
in `test/db/cmd/metadata.Regression was introduced in 39613b5 (#1752)
Fix is likely required in these files:
librz/util/str_search.c
libz/util/utf8.c
librz/util/str.c
The text was updated successfully, but these errors were encountered: