Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Include names of large cities #3

Open
1ec5 opened this issue Nov 10, 2014 · 3 comments
Open

Include names of large cities #3

1ec5 opened this issue Nov 10, 2014 · 3 comments

Comments

@1ec5
Copy link
Owner

1ec5 commented Nov 10, 2014

From [email protected] on 2008-06-06T02:26:56Z

The English spell checking dictionaries for Firefox include the names of
large cities, such as Boston. We should include names of large cities in
the Vietnamese dictionaries, too. There's no need to add cities for which
there is a common Vietnamese translation (such as "Luân Đôn" for London),
but many large cities don't have common translations (San Francisco,
Sydney, Tokyo, etc.).

This would be a low-risk addition, because the city names generally aren't
even close to Vietnamese words.

Original issue: http://code.google.com/p/hunspell-spellcheck-vi/issues/detail?id=3

@1ec5
Copy link
Owner Author

1ec5 commented Nov 10, 2014

From [email protected] on 2008-06-06T22:04:23Z

I've tried to look for words like Japan= Nhật Bản or Spain=Tây Ban Nha in the updated
dictionary list but it cannot find that country names.
However the spellchecking doesn't fail because there is Tây + Ban + Nha words
separated into the dictionary, is this correct? Do these words mean something isolated?
Otherwise we shouls just include the full "meaning" word "Tây Ban Nha" or in case we
make a typo like "Tây Ban Nhật", the dictionary will think is also correct.

@1ec5
Copy link
Owner Author

1ec5 commented Nov 10, 2014

From [email protected] on 2008-06-07T00:19:26Z

Words like "Tây Ban Nha" are transliterated from the Chinese name, in turn
transliterated from "España". The parts of these names all have meanings on their
own, some more obvious than others. For instance, "tây" means "west".

I took a look at a few of the spell checking dictionaries available for Firefox, and
none of them include compound words. Vietnamese makes much greater use of compound
words than most Western languages, but I think it'd be pretty difficult and
resource-intensive for a spell checker to verify that pairs of words are also
correct. I'd be surprised if Hunspell supports it.

For this bug, I was thinking more about non-Vietnamese place names that Vietnamese
speakers would tend to type in as is, like Sydney. But Vietnamese place names are
also important. Most Vietnamese place names probably aren't covered by the
dictionaries right now, because so many of them use very obscure component words, or
even ones that completely break Vietnamese spelling rules. So we'd have to figure out
our inclusion criteria for these proper names.

@1ec5
Copy link
Owner Author

1ec5 commented Nov 10, 2014

From [email protected] on 2008-06-07T01:09:10Z

Labels: Component-Wordlist

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant