We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
在一些输入法里不支持带分隔符的拼音词库,只支持用户短语,比如微软的用户自定义短语,手机 Gboard 的个人词典, 这些编码为连续的拼音串 pinyinfengesuanfa。 在以这种格式为源格式时,需要将其转换为带分隔符的编码 pin'yin'fen'ge'suan'fa。 现在的方式是忽略原编码,而由程序自动注音,可能导致注音不准,而且效率低下。 我们需要一个拼音分割算法,由 连续的拼音串(pinyinfengesuanfa) 和 词组(拼音分割算法) 进行分割。
pinyinfengesuanfa
pin'yin'fen'ge'suan'fa
对于有歧义的拆分,可以通过以下信息解决:
The text was updated successfully, but these errors were encountered:
另一种思路,将每个字可能的读音做笛卡尔积,对比拼音串。
Sorry, something went wrong.
No branches or pull requests
在一些输入法里不支持带分隔符的拼音词库,只支持用户短语,比如微软的用户自定义短语,手机 Gboard 的个人词典,
这些编码为连续的拼音串
pinyinfengesuanfa
。在以这种格式为源格式时,需要将其转换为带分隔符的编码
pin'yin'fen'ge'suan'fa
。现在的方式是忽略原编码,而由程序自动注音,可能导致注音不准,而且效率低下。
我们需要一个拼音分割算法,由 连续的拼音串(pinyinfengesuanfa) 和 词组(拼音分割算法) 进行分割。
对于有歧义的拆分,可以通过以下信息解决:
The text was updated successfully, but these errors were encountered: