-
Notifications
You must be signed in to change notification settings - Fork 315
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
About some web content encoding issues #197
Comments
you can modify the code and base64encode it before output. |
@imfht do you have, by any chance, a copy of the content? The link is dead now and I could also use a base64 encoding for the output. |
@l3m0n I could be wrong but I believe this is the correct way to encode these characters in JSON. If you load using a standards compliant JSON loader (e.g. Python3 Are you processing the JSON data without actually de-serializing theJSON? if so, that is expected behavior |
@mzpqnxow I have issues with binary content: many (different) non-printable chars are encoded as |
@p-l- here is a capture from 010apartment.com - though I'm not having any issues with any apps or libraries in loading it, despite the presence of the escaped characters. Maybe it's helpful to you, maybe not:
|
@mzpqnxow I have no issues with that particular site. My use case is, for example, making a GET request to |
Ahhh... that makes much more sense, thanks for clarifying |
Conversion of binary responses to UTF8 occasionally yields U+FFFD [replacement characters](https://en.wikipedia.org/wiki/Specials_(Unicode_block)) (see zmap#197, zmap#263). As a result it is not possible to restore the original response. This introduces the `--hex` option to the `banner` module. When enabled, the `banner` value will contain server response in hex. Refs zmap#197, zmap#263
Conversion of binary responses to UTF8 occasionally yields U+FFFD [replacement characters](https://en.wikipedia.org/wiki/Specials_(Unicode_block)) (see #197, #263). As a result it is not possible to restore the original response. This introduces the `--hex` option to the `banner` module. When enabled, the `banner` value will contain server response in hex. Refs #197, #263 #325
I found that some web content will be encoded as \ufffd
E.g:
010-4200-4200.com
010apartment.com
The data stored in the database is garbled.
The text was updated successfully, but these errors were encountered: