Rework notebooks to use the static self-hosted fake job board #350
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
indeed.com has tightened their bot protection against web scraping, which is why requests to their site as they are described in this course return 403 Forbidden status codes.
I've attempted to circumvent this using fake headers (something that would be explainable in an intro course) but no luck, 403 prevails.
I've previously reworked the written tutorial to use a self-hosted fake job board that I set up just for the purpose of the tutorial.
As a quick fix for the video course, I added an explanatory lesson to the video coure and reworked the Jupyter notebooks.
The information and processes that I explain in the rest of the course are still valid and a good introduction for how to approach scraping a static website.
Where to put new files:
my-awesome-article
How to merge your changes: