Given the desire to cover the entire alphabet efficiently (finding correct letters without redundancy), what words are absolutely best in sequence? This repo answers this question by weighing each word in the subset by the ubiquity of its letters, as well as the local ubiquity of each letter in each position. This sorts words into four general, rank-discrete categories:
- (highest) Common letters in the right place
- (higher) Common letters in the wrong place
- (moderate) Uncommon letters in the right place
- (low) Uncommon letters in the wrong place
This ranking naturally arises from double-counting ubiquity within the local ubiquity of each letter (more common letters are on average more common in a given position than uncommmon letters). To penalize redundancy, letters from words chosen by highest rank have their weight set to zero, and the ranking is recalculated. This has the nice feature of penalizing local redundancy while allowing for nonlocal redundancy, were it in an otherwise high-ranking word.
This routine finds the following set to be absolutely best:
- Tares
- Colin
- Dumpy
- Bight
- Flake
given words bekah
and frike
are forbidden. This constitutes what I believe to be the best sequence to use, particularly in games like squabble and sedecordle. Do what you will with this massively overpowered knowledge.
See LICENSE.md
Cheers,