Skip to content

v1.2.0 Large expansion in binary data scouring, visualizing, etc

Compare
Choose a tag to compare
@michelcrypt4d4mus michelcrypt4d4mus released this 22 Sep 00:02
· 212 commits to master since this release

1.2.0

  • Dramatic expansion in the pdfalyzer's binary data scouring capabilities:
    • Add chardet library guesses as to the encoding of all unknown byte sequences and ranks them from most to least likely
    • Add attempted decodes of all backtick, frontslash, single, double, and guillemet quoted strings in font binaries
    • Add decode attempts with Windows-1252, UTF-7, and UTF-16 encodings
    • Add --suppress-decodes to suppress attempted decodes of quoted strings in font binaries
    • Cool art gets generated when you swarm a binaries quoted strings, which are mostly but not totally random
  • The --font option takes an optional argument to limit the output to a single font ID
  • Add --limit-decodes to suppress attempted decodes of quoted strings in font binaries over a certain length
  • Add --surrounding option to specify number of bytes to print/decode before and after suspicious bytes; decrease default number of surrounding bytes
  • Add --version option
  • extract_guillemet_quoted_bytes() and extract_backtick_quoted_bytes() are now iterators
  • Fix scanning for UTF-16 BOM in font binary