Skip to content

Latest commit

 

History

History
20 lines (12 loc) · 759 Bytes

README.md

File metadata and controls

20 lines (12 loc) · 759 Bytes

news_autoscraper

A daily scraper for the top 3 news headlines from:

[news_org]_scraper.py scrapes the news organisation's home page using BeautifulSoup every day at 5.30 UTC. The data gets appended to updated_headlines.csv

Resources: https://jonathansoma.com/

Analysis

Take a look at pie_charts.ipynb for basic counts of most occuring words between two dates, after removing stopwords: 2024_count_fig

top_10_count_fig