Skip to content

Convert non-ascii characters to "good enough" ascii in julia.

License

Notifications You must be signed in to change notification settings

altre/TextUnidecode.jl

Repository files navigation

TextUnidecode

Build Status Coveralls

Convert non-ascii characters to "good enough" ascii.

julia> unidecode("南无阿弥陀佛")
"Nan Wu A Mi Tuo Fo"

julia> unidecode("あみだにょらい")
amidaniyorai

References

This package is a more or less direct port of the java package unidecode which in turn is probably one of many ports of the Perl Package Text::Unidecode by Sean M. Burke.

The similarly named julia package Unidecode solves a different problem: Re-converting autocompleted Latex or Emoji back to the original UTF-8 string.