Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Bug]: Missing Chinese character and part of number; #19089

Open
atlenovo opened this issue Nov 22, 2024 · 5 comments
Open

[Bug]: Missing Chinese character and part of number; #19089

atlenovo opened this issue Nov 22, 2024 · 5 comments

Comments

@atlenovo
Copy link

Attach (recommended) or Link to PDF file

011028 (5).pdf

Web browser and its version

Microsoft Edge 131.0.2903.51

Operating system and its version

Microsoft Window 11 23H2 22631.4460

PDF.js version

PDF.js: 4.3.8 [c419c83]

Is the bug present in the latest PDF.js version?

Yes

Is a browser extension

No

Steps to reproduce the problem

open pdf file in issue above with pdfjs

What is the expected behavior?

image

What went wrong?

企业微信截图_1732267086437 its a file with font subset embedded, shown as missing all Chinese character and part of number character

Link to a viewer

No response

Additional context

No response

@SprASum

This comment was marked as off-topic.

@kphuanghk
Copy link

It seems the sample PDF does not properly embed required Chinese glyphs, some viewers may work, but technically it does not embed the required glyphs, I think pdf.js just reflects the truth.

You can test with Ghostscript command

gs -sDEVICE=pdfwrite -dCompatibilityLevel=1.4  -dNOPAUSE -dQUIET -dBATCH -sOutputFile=improvedchinese.pdf 011028.pdf

Below output hints the missing glyphs

GPL Ghostscript 9.26: Missing glyph CID=28007, glyph=6d67 in the font AAAAAA+SourceHanSerifCN-Bold . The output PDF may fail with some viewers.
GPL Ghostscript 9.26: Missing glyph CID=25812, glyph=64d4 in the font AAAAAA+SourceHanSerifCN-Bold . The output PDF may fail with some viewers.
GPL Ghostscript 9.26: Missing glyph CID=10237, glyph=27fd in the font AAAAAA+SourceHanSerifCN-Bold . The output PDF may fail with some viewers.
GPL Ghostscript 9.26: Missing glyph CID=14726, glyph=3986 in the font AAAAAA+SourceHanSerifCN-Bold . The output PDF may fail with some viewers.
GPL Ghostscript 9.26: Missing glyph CID=14726, glyph=3986 in the font BAAAAA+SourceHanSerifCN-Regular . The output PDF may fail with some viewers.
GPL Ghostscript 9.26: Missing glyph CID=20328, glyph=4f68 in the font BAAAAA+SourceHanSerifCN-Regular . The output PDF may fail with some viewers.
GPL Ghostscript 9.26: Missing glyph CID=10072, glyph=2758 in the font BAAAAA+SourceHanSerifCN-Regular . The output PDF may fail with some viewers.
GPL Ghostscript 9.26: Missing glyph CID=13962, glyph=368a in the font BAAAAA+SourceHanSerifCN-Regular . The output PDF may fail with some viewers.
GPL Ghostscript 9.26: Missing glyph CID=14049, glyph=36e1 in the font BAAAAA+SourceHanSerifCN-Regular . The output PDF may fail with some viewers.
GPL Ghostscript 9.26: Missing glyph CID=28007, glyph=6d67 in the font BAAAAA+SourceHanSerifCN-Regular . The output PDF may fail with some viewers.
GPL Ghostscript 9.26: Missing glyph CID=25812, glyph=64d4 in the font BAAAAA+SourceHanSerifCN-Regular . The output PDF may fail with some viewers.
GPL Ghostscript 9.26: Missing glyph CID=10981, glyph=2ae5 in the font BAAAAA+SourceHanSerifCN-Regular . The output PDF may fail with some viewers.
GPL Ghostscript 9.26: Missing glyph CID=10237, glyph=27fd in the font BAAAAA+SourceHanSerifCN-Regular . The output PDF may fail with some viewers.
GPL Ghostscript 9.26: Missing glyph CID=15413, glyph=3c35 in the font BAAAAA+SourceHanSerifCN-Regular . The output PDF may fail with some viewers.
GPL Ghostscript 9.26: Missing glyph CID=25791, glyph=64bf in the font BAAAAA+SourceHanSerifCN-Regular . The output PDF may fail with some viewers.
GPL Ghostscript 9.26: Missing glyph CID=15364, glyph=3c04 in the font BAAAAA+SourceHanSerifCN-Regular . The output PDF may fail with some viewers.
GPL Ghostscript 9.26: Missing glyph CID=30523, glyph=773b in the font BAAAAA+SourceHanSerifCN-Regular . The output PDF may fail with some viewers.
GPL Ghostscript 9.26: Missing glyph CID=30524, glyph=773c in the font BAAAAA+SourceHanSerifCN-Regular . The output PDF may fail with some viewers.
GPL Ghostscript 9.26: Missing glyph CID=13951, glyph=367f in the font BAAAAA+SourceHanSerifCN-Regular . The output PDF may fail with some viewers.
GPL Ghostscript 9.26: Missing glyph CID=14645, glyph=3935 in the font BAAAAA+SourceHanSerifCN-Regular . The output PDF may fail with some viewers.
GPL Ghostscript 9.26: Missing glyph CID=24476, glyph=5f9c in the font BAAAAA+SourceHanSerifCN-Regular . The output PDF may fail with some viewers.
GPL Ghostscript 9.26: Missing glyph CID=24466, glyph=5f92 in the font BAAAAA+SourceHanSerifCN-Regular . The output PDF may fail with some viewers.
GPL Ghostscript 9.26: Missing glyph CID=30528, glyph=7740 in the font BAAAAA+SourceHanSerifCN-Regular . The output PDF may fail with some viewers.
GPL Ghostscript 9.26: Missing glyph CID=30530, glyph=7742 in the font BAAAAA+SourceHanSerifCN-Regular . The output PDF may fail with some viewers.
GPL Ghostscript 9.26: Missing glyph CID=30520, glyph=7738 in the font BAAAAA+SourceHanSerifCN-Regular . The output PDF may fail with some viewers.
GPL Ghostscript 9.26: Missing glyph CID=14288, glyph=37d0 in the font BAAAAA+SourceHanSerifCN-Regular . The output PDF may fail with some viewers.

Hope this helps.

@SprASum

This comment was marked as off-topic.

@calixteman
Copy link
Contributor

From Acrobat on WIndows 11:
image
and it's fine in Chrome.

@SprASum

This comment was marked as off-topic.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

5 participants