Scanning various event websites to compile a list of what's on, where, and when in the Newbury area. This uses ChromeDriver to navigate the pages, Beautiful Soup to parse the HTML, then the iCalendar Python library to form an RFC5545 compliant calendar (.ics) file that can be imported or linked to from MS-Outlook, Google Calendar etc.
Ultimately I intend to:
- Cover all event venues in/around the area
- Host the ICS files for linking or import to calendars (GitHub, or S3)
- Build a pipeline to periodically build, validate and release the ICS files
Venues and their event websites that I have in mind include:
Venue | Website | First Attempt |
---|---|---|
Corn Exchange | https://cornexchangenew.com/events/ | cornExchange.ics |
Arlington Arts Centre | https://arlingtonarts.ticketsolve.com/ticketbooth/shows?i=64 | arlington.ics |
Ace Space (self-published) |
https://acespace.org.uk/events/ | ace-space-newbury-041df7d619b.ics |
The WaterMill | https://www.watermill.org.uk/ | |
Shaw House | https://booking.westberks.gov.uk/heritage_events.html | |
The Mount, Wasing | https://www.wasing.co.uk/events/ | |
Welford Park | https://www.welfordpark.co.uk/events/ | |
Highclere Castle | https://www.highclerecastle.co.uk/events/ |
Requirements for the software and other tools to build, test and push
- Download the latest ChromeDriver executable for your platform
- Download/Fork this repo
- Pip Install the requirements (sorry - no requirements.txt yet)
- At the moment I am just writing bespoke place.py files that individually generate a place.ics file. Example:
python arlington.py