Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

chewing-cli 應該也要有注音修正功能 #655

Open
kanru opened this issue Nov 7, 2024 Discussed in #654 · 1 comment
Open

chewing-cli 應該也要有注音修正功能 #655

kanru opened this issue Nov 7, 2024 Discussed in #654 · 1 comment

Comments

@kanru
Copy link
Member

kanru commented Nov 7, 2024

Discussed in #654

Originally posted by llc0930 November 5, 2024
chewing-cli 應該也要有注音修正功能。
一修正為ㄧ;丫修正為ㄚ。
尤其是前者...教育部的xls跟xlsx真是個噩夢
2024-11-05-10:53:20-r

chewing-cli 生成詞庫時顯然會去掉重複字詞,所以沒有做去重...
《成語典》dict_idioms_2020_20240926.txt

chewing-cli init-database -n "《成語典》" -c "中華民國教育部" -l "CC BY-ND 3.0 臺灣" -r "2020_20240926" ./《成語典》dict_idioms_2020_20240926.txt dict_idioms_20240926.dat

== Trie Dictionary Statistics ==
Node count           : 19226
Leaf count           : 5186
Phrase count         : 5456
Max height           : 9
Average height       : 1
Root branch count    : 705
Max branch count     : 80
Average branch count : 0
```</div>
@kanru kanru added this to the v0.10.0 milestone Nov 7, 2024
@kanru
Copy link
Member Author

kanru commented Nov 11, 2024

其他的簡單修正 #656 (comment)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant