-
-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Some missing domains #15
Comments
Could you provide more information on where these URLs referenced? |
I have use this script to file this url's https://pastebin.com/8CgQLz9N urls looks like: https://samurai-wup.cdn.nintendo.net/d/man_JCXJ_00.pdf |
Can you tell me specific metadata files (from the samurai folder) these are referenced in so this can be verified independently? The links by themselves aren't too useful unless I understand how they are referenced in the metadata. |
For 3DS, only PDFs from samurai servers are referenced, and only on JP region for some titles. Sadly all of them return status 404 with the message "Sorry, not found" when trying to fetch them using curl, however. Other than JP, only region ES has exactly one title (50010000044217) that references a PDF on a capcom server. This server is likely unaffected by the impending eShop shutdown, but since it's only one file it's out of scope for saveShop scraping. Interested people should download this file manually. |
the pdf's are in samurai/{}/{}/title/{titleid} |
and try https://www.nintendo.co.jp/data/software/manual/{filename} as a backup option |
Very interesting, that works! I won't be able to integrate this quickly enough into saveShop, but interested people can run the following UNIX command chain to download the files manually: mkdir manuals && cd manuals
ack "\.pdf" ../samurai/JP/ja/ | tr '>' '\n' | grep 'https.*pdf' | cut -d'<' -f1 | sort | sed 's/samurai-ctr.cdn.nintendo.net\/d/www.nintendo.co.jp\/data\/software\/manual/g' | xargs -I{} wget '{}' |
He i have done some datamining and found some more urls that missing
img-eshop.cdn.nintendo.net # firth image cdn
m1.nintendo.net # pdf
www.nintendo.co.jp # pdf
The text was updated successfully, but these errors were encountered: