-
Notifications
You must be signed in to change notification settings - Fork 48
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
train_lm.py usage #65
Comments
also cleanuped->cleaned up |
How about adding a final optional 4th argument called lm_dir (the 3rd argument being 'work_dir'), so the user can specify where they want the final LM to be written? This will make life easier for callers, as they won't have to figure out where pocolm would put their stuff. |
OK, I will add the work_dir and lm_dir argument. |
OK, don't remove the suffix then.. if people want control they can add the On Sat, Sep 3, 2016 at 10:13 PM, Wang Jian [email protected] wrote:
|
Yes, of course. |
The usage message of train_lm.py (see below) does not agree with what the program actually
does. The usage message suggests the output goes to lm_dir, but it goes to a subdirectory.
I think you should rename lm_dir in the args to work_dir. And the usage message should explain what the location of the actual lm_dir output will be. There should be an "epilog" provided to the usage message, with an example usage- preferably a couple of example usages, one with a vocab and one with num-words specified.
Also, you are using the 'basename' of the wordlist as part of the name of the lm_dir. What if the wordlist has a suffix, like foo.txt? Then foo.txt will become part of that name. It seems to me not ideal. Maybe strip any final suffix.
The text was updated successfully, but these errors were encountered: