- Read the guideline before start
In this task you will parse https://quotes.toscrape.com/ website to get list of
all quotes from it. This site uses pagination to show data, but you still have to
get all quotes - so to scrape all pages. Each quote should consist of text
,
author
and tags
- the classes structure already implemented in app/parse.py
.
You need to write main
function, which takes output_csv_path
- it's a file, that should
be created, and in which all data about quotes should be written.
Hints:
- keep your code as simple as possible;
- write reusable functions;
- be gentle to website resources.
Also collect author's biography for each quote (or even separate it to another csv file). Of course here you should not load the same page for the same author several times (cache it).