Scraping a paginated webpage

In this project, I utilized web scraping techniques to gather data from a paginated webpage. Specifically, I extracted Hockey Team data from ScrapeThisSite, which is an example of a paginated webpage. A paginated webpage is one that is organized into multiple pages, often to accommodate a large amount of data. In the case of the specified page, the Hockey Team data spanned 582 rows and was distributed across multiple pages for a more organized presentation.

The scraping process was facilitated by the BeautifulSoup library -- a powerful Python library for pulling data out of HTML and XML files.

Exploratory Data Analysis

Following the data retrieval, I conducted Exploratory Data Analysis (EDA) using the pandas, matplotlib, and seaborn libraries. This analysis aims to gain insights, visualize patterns, and uncover trends within the scraped Hockey Team data. The combination of these libraries allowed for comprehensive data exploration and visualization, enhancing the understanding of the dataset.

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
Hockey-Teams.csv		Hockey-Teams.csv
README.md		README.md
main.ipynb		main.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Scraping a paginated webpage

Exploratory Data Analysis

About

Releases

Packages

Languages

ni-lie/web-scraping-plus-eda

Folders and files

Latest commit

History

Repository files navigation

Scraping a paginated webpage

Exploratory Data Analysis

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages