Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

oM, o~, and Unicode Character 'DEVANAGARI OM' #7

Open
funderburkjim opened this issue May 20, 2022 · 1 comment
Open

oM, o~, and Unicode Character 'DEVANAGARI OM' #7

funderburkjim opened this issue May 20, 2022 · 1 comment
Labels
bug Something isn't working

Comments

@funderburkjim
Copy link
Contributor

@drdhaval2785
In #6, a preliminary step was to convert the Devanagari version to slp1.
In checking the invertibility, I noticed what may be a bug in the Devanagari version.
The Devanagari had the \u0950 Devanagari ॐ character in two cases, L=516 and L=3814.

The scan for 516 is
image
and this is represented in slp1 of ben.txt as anoMkfta.
and in BEN_main_L2a.txt as अन्ॐकृत
whereas the Devanagari should be अनोंकृत

I suspect this to be a bug in converting from 'oM' to devanagari unicode.
In slp1, 'o~' converts to devanagari ॐ.

The L=3814 case is similar.

@funderburkjim funderburkjim added the bug Something isn't working label May 20, 2022
@funderburkjim
Copy link
Contributor Author

I suspect that BEN_main_L2a.txt derives from the version https://raw.githubusercontent.com/sanskrit-lexicon/csl-devanagari/main/v02/ben/ben.txt, so, if the above is indeed the result of a bug, the csl-devanagari respository would be the locus.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant