-
Notifications
You must be signed in to change notification settings - Fork 28
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Closest Match for Punjabi (Pakistan) Not Resolving Match #59
Comments
I believe the issue here is that the maximize() language function is resolving pa and pa-PK to different maximized languages. I'm not a linguistic expert so I don't know if this is correct or not.
|
Similar issue here. In [4]: langcodes.get("ko").language_name()
Out[4]: 'Korean'
In [5]: langcodes.get("kor_Hang").language_name()
Out[5]: 'Korean'
In [6]: langcodes.closest_match("ko", ["kor_Hang"])
Out[6]: ('und', 1000) |
@BrightXiaoHan @joe-sciame-wm Thank you for the input! There is likely something to improve here. If I had to guess, I think the reason for this commit was exactly the problem you are describing: georgkrause@59326f8 Some formal hint: I took over the package and I am working on updating it here: https://github.com/georgkrause/langcodes |
I think script tag is unnecessary when matching spoken languages. |
I'm attempting to match a language code 'pa' with another language code 'pa-PK'.
This returns
I would expect this to return a match and not None. When I debug the library, I see the following which returns 54 from the tuple_distance_cached function.
The text was updated successfully, but these errors were encountered: