Skip to content

acbraith/crossfit_scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Crossfit Web Scraper and Data Analysis

crossfit_api.py provides an API for scraping data from the 2017 Crossfit leaderboard. Make sure there's a /cache folder in the same directory as crossfit_scaper.py; big queries will be stored here (including the full leaderboards you get, manually delete the files if you want to refresh your data).

data_analysis.py produces some pretty box plots, and allows you to interact with them by left clicking to set 'your' benchmarks, or right clicking to remove them (one click is needed to fix the GUI, which is messed up before for some reason...). The predicted open placement will then be calculated using an XGB model trained on open leaderboard data. This code needs some pretty serious cleaning up though, I'll get around to that sometime...

Here are some nice box plots of the scraped data.

About

Crossfit Leaderboard Scraper and Analysis

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages