oM, o~, and Unicode Character 'DEVANAGARI OM' #7

funderburkjim · 2022-05-20T18:23:49Z

@drdhaval2785
In #6, a preliminary step was to convert the Devanagari version to slp1.
In checking the invertibility, I noticed what may be a bug in the Devanagari version.
The Devanagari had the \u0950 Devanagari ॐ character in two cases, L=516 and L=3814.

The scan for 516 is

and this is represented in slp1 of ben.txt as anoMkfta.
and in BEN_main_L2a.txt as अन्ॐकृत
whereas the Devanagari should be अनोंकृत

I suspect this to be a bug in converting from 'oM' to devanagari unicode.
In slp1, 'o~' converts to devanagari ॐ.

The L=3814 case is similar.

funderburkjim · 2022-05-20T18:28:05Z

I suspect that BEN_main_L2a.txt derives from the version https://raw.githubusercontent.com/sanskrit-lexicon/csl-devanagari/main/v02/ben/ben.txt, so, if the above is indeed the result of a bug, the csl-devanagari respository would be the locus.

funderburkjim added the bug Something isn't working label May 20, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

oM, o~, and Unicode Character 'DEVANAGARI OM' #7

oM, o~, and Unicode Character 'DEVANAGARI OM' #7

funderburkjim commented May 20, 2022

funderburkjim commented May 20, 2022

oM, o~, and Unicode Character 'DEVANAGARI OM' #7

oM, o~, and Unicode Character 'DEVANAGARI OM' #7

Comments

funderburkjim commented May 20, 2022

funderburkjim commented May 20, 2022