Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add kannada wiktionary as stardict dictionary #1

Open
vvasuki opened this issue Apr 10, 2016 · 7 comments
Open

Add kannada wiktionary as stardict dictionary #1

vvasuki opened this issue Apr 10, 2016 · 7 comments

Comments

@vvasuki
Copy link
Member

vvasuki commented Apr 10, 2016

@damooo would you be interested in this? It would help us kannada speakers greatly.

@vvasuki
Copy link
Member Author

vvasuki commented Apr 11, 2016

Thanks! Please mark purANa encyclopedia issue as fixed once you're done
pushing the changes.

I don't think that there is any significant deviation between (modern)
kannaDa and telugu. Please see
http://www.virtualvinodh.com/wp/character-matrix/ .

here is an example entry:
https://kn.wiktionary.org/wiki/%E0%B2%AC%E0%B2%82%E0%B2%A1%E0%B3%81 Just
visit
https://kn.wiktionary.org/wiki/%E0%B2%B5%E0%B2%BF%E0%B2%B6%E0%B3%87%E0%B2%B7:Random
as many times as you like to get a broader sample. As I have not used
kn.wiktionary a lot, I can't yet find you the best model entries.

2016-04-11 9:45 GMT-07:00 श्रीराम [email protected]:

viswas ji, can you give some small info about any special differances.
like special alphabets, and are there also any hrasva letters, those which
are differant from devanagari (or telugu).. And if it is possible, list of
some 10 search words in wiktionary, which contain all type of headings,
like gramatical details, usage, translations, etc.. So that i can preserve
all of them by referring those 10 words everytime i modified.. So that i
can make sure i didn't mess up


You are receiving this because you authored the thread.
Reply to this email directly or view it on GitHub
#1 (comment)

Vishvas /विश्वासः

@damooo
Copy link

damooo commented Apr 17, 2016

done .

update

More than two and half lakh pages of wiktionary-kn were mirrrored
and edited, along with text formatting preserved.all languages
headwords extracted, added, and all types of transliterations, and
devanagari also added in head words.and so many things.
as due to language barrior, completely driven by using html tags.
so if any mistakes are there then please mention.
Once it is sufficiently error free if any, then after i will work on
items in my todo list. and some languages lik mandarin ( :D ), and
urdu etc also there as main words with explanation in kannada.
for them no trannslation added. so to check them you type your self
for now.

And some search words
ಮುೞುಂಕು
ಅಮೃತ
ಹಾಲು
ನೆರೆ
ಬಗೆ
ಬಿಡು
ಎಲ್ಲ ಭಾಷೆಗಳು
ಒಲವು
ಕೊರೆ
.......etc. : )

@vvasuki
Copy link
Member Author

vvasuki commented Apr 17, 2016

This is wonderful work, and a gift to all Indian language lovers, thanks! (I moved your files to kn-head for now.)

A few requests:

  • In kn-head/wiktionary_kn, retain only kannaDa headword entries.
  • Include devanAgarI transliteration of kannada headwords.
  • Create files in en-head/wiktionary_en_kn/wiktionary_en_kn.* which has English headwords.
  • [Optional] Create files in nonenkn-head/wiktionary_nonenkn/wiktionary_nonenkn.* for all other headwords.

This splitting makes sense for the following reasons:

  • It makes the file sizes easily manageable.
  • It lets users pick the particular small dicts they want while ignoring others. (Especially important for people with old phones with limited memory.)

The addition of devanAgarI headwords of course makes sense for the same reason it makes sense in case of telugu (the most important reason, from my perspective, is that it help look up sanskrit-root words in other languages).

Also, interesting to know about your interest in other languages :-D. Urdu is a good idea, but I would argue that Indian (hindu) regional languages are a much higher priority (from the perspective of their commonalities with each other through sanskrit).

@vvasuki
Copy link
Member Author

vvasuki commented Apr 17, 2016

Very good, thanks shrIrAma! Can you also add optitrans for kannada?

@vvasuki
Copy link
Member Author

vvasuki commented Feb 24, 2017

I see optitrans and devanAgarI headwords as well! Thanks, @damooo !

Only, the right devanAgarI mAtrA-s are not being used for o and e. Example:
Compare ಕೊಕ್ಕರೆ|kokkare|कोक्करे in wiktionary-kn with కొక్కరాయి|kokkaraayi|कॊक्करायि in wiktionary-te . Can you fix, @damooo ?

@damooo
Copy link

damooo commented Feb 24, 2017

Will soon add entire new html version , with periodic updatable script. already created, but on testing.

@damooo
Copy link

damooo commented Feb 28, 2017

This update may take a week.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants