A browser for education sites built by... students with ♥.
This is a project for a data mining course in which we webscrapped various .edu
websites and their corresponding newsletter sites.
The browser in theory should rank the original source higher than their newsletter sites as they're more likely to link to the main
university page.
- Install NPM
- Open a terminal in the client folder and install dependencies with
npm i
- Using the same terminal, then run
npm start
- Your browser should automatically open a window at localhost:3000. If not, open your browser and visit that link
- Ensure NPM is installed from previous step
- Open a terminal in the server folder and run
npm i
- Using the same terminal, then run
npm start
WARNING: This will override current results so you may need to run the webscraper and then each individual method, we encourage that you do NOT re-run the webscraper
- Install python 3.8
- pip install bs4, networksx, numpy
- To run,
python3 webscrape.py
- Similar to the assignment,
python3 <method> <input_graph> <output>