-
Notifications
You must be signed in to change notification settings - Fork 21
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Generating dictionary wordforms/unmunch #22
Comments
There is an examples/unmunch.py |
Works superbly compared to running wordforms over all roots (took me three days), unmuch is not supported for a while. This took like 30 seconds. But I have some differences. I am missing 2851 words and I have 319493 new words. |
Running Czech hunspell: http://www.translatoblog.cz/wp-content/uploads/2021/03/hunspell_cs.zip
|
All words missed by Spylls: This most likely means that they will not be assumed as correct during spellchecking. |
The new words created by spylls seems to be deficiency in original wordforms. |
Basically I see missing words of two kinds. Spylls:
Hunspell:
The surnames are maybe correct with Spylls, but wrong in Hunspell? But the |
Running:
Produces:
So this looks more like an unmnuch bug and not a general Spylls bug. |
vs
|
Found obviously missing code: #23 Suffix crossproduct is not analysed for prefixes. Btw, maybe secondary suffix crossproduct needs to be analysed as well? |
Btw, I am not even sure if the code in unmuch is right approach, sound't it be recursive check? After each prefix or suffix is added, check if new prefixes or suffixes cannot be added on top of that? I can image |
@exander77 ATM I, unfortunately, don't have much resource to discuss/debug it (I am in Kharkiv, Ukraine, splitting my days between volunteering, my dayjob, and doomscrolling). I'm thankful for your PR and I'll merge it if it works for you :) |
With that PR unmuch pretty much works for I don't want to get political on Github, but I am sending my:
from the Czech Republic. We had Soviet occupation here in 1968... I hope Czech Republic and whole European Union and NATO by extension will send enough support including weapons, so Ukraine can put Russia in its place. I think the hearts and minds of most Czech people are with Ukraine. |
The original Hunspell had two important utilities:
How to achieve this in
spylls.hunspell
?I use Hunspell to generate Scrabble dictionaries, and I am looking into replacing it with
spylls.hunspell
.The text was updated successfully, but these errors were encountered: