Add automatic crib identification #48

unicornsasfuel · 2016-11-02T18:47:12Z

It would be amazingly useful to allow people to identify common substrings in some corpus of plaintexts. We have common words in English in the frequency section of Cryptanalib already, but this is pulled from publicly available data, in contrast to our character and multigraph frequency data, which we have calculated from Charles Dickens' A Tale of Two Cities.

It should be possible to automatically recognize cribs in some provided data, and this should boil down to the Longest repeated substring problem.

The text was updated successfully, but these errors were encountered:

unicornsasfuel · 2016-11-02T19:12:42Z

Okay, yeah, it looks like this is a complete pain in the ass, actually.

unicornsasfuel added enhancement help wanted labels Nov 2, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add automatic crib identification #48

Add automatic crib identification #48

unicornsasfuel commented Nov 2, 2016

unicornsasfuel commented Nov 2, 2016

Add automatic crib identification #48

Add automatic crib identification #48

Comments

unicornsasfuel commented Nov 2, 2016

unicornsasfuel commented Nov 2, 2016