-
Notifications
You must be signed in to change notification settings - Fork 15
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
整理资料库时出现Wide character导致失败的问题 #4
Comments
谢谢!也请发个pull request,留email,我把说好的脚本寄给您。 |
@bandiaozimu 谢谢,我的邮箱: [email protected] |
你单独运行The Unarchiver,可以调整设定,你要勾选以下项目: Extraction Advanced |
在将某些资源导入资料库时,终端如下所示报错:
----- move to /Users/XXXXX/voice1808251341
----- unfoldDLSiteFile
----- grapDLCount
Wide character at /Library/Perl/5.18/darwin-thread-multi-2level/Encode.pm line 296.
----- buildDLSite
看了下err.log中显示:
malformed JSON string, neither array, object, number, string or atom, at character offset 0 (before "(end of string)") at ./buildDLSite line 41.
发现是new_works.json中没有任何信息输出。
于是定位到grapDLCount文件,尝试注释掉第71行
后能够工作,故发现是因为从dlsite爬到的work_text信息没有被成功解码。
通过分析这些失败资源的异同,发现原因出于work_text中的
…
字符,也即HTML中的省略符…
。可以通过在前面增加:
$result{'text'} =~ s/…/.../;
将
…
替换为三个.
解决。The text was updated successfully, but these errors were encountered: