Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Phonetic guides are added to column values #104

Open
woylie opened this issue Apr 12, 2021 · 0 comments
Open

Phonetic guides are added to column values #104

woylie opened this issue Apr 12, 2021 · 0 comments

Comments

@woylie
Copy link

woylie commented Apr 12, 2021

Excel has a feature called phonetic guides for Japanese characters. When kanji are entered in a column, Excel adds phonetic guides as katakana. In Excel, these guides can be hidden or shown.

When opening an xlsx file like that with LibreOffice, Apple Numbers or Google Sheets, you can only see the actual column content (without the guides). However, when parsing the data with xlsxir, the phonetic guides are appended to the column content. For example, instead of 國學院大学, a column will contain 國學院大学ダイガク.

While googling for a solution, I found the same issue description in other tools: SDL, ACS, MicroStrategy.

According to this post,

For storing each unique text from a cell, Excel uses something called a "shared string table" and the content of each cell is the index of the text from that table. When we implemented the filter we erroneously thought that every "shared string" item contains only the text of the cell and some formatting belonging to that text. However, after this post, we found out that the phonetic translations are also found there.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant