This is a website that aggregates Ukulele Tutorials for Indian Songs from different channels on YouTube.
The aggregation is done using tools like youtube-dl and pandas and a lot of manual cleaning of the data.
The website is built using Next.js and React Instant Search.
See the about page for more information!
To setup this repository for updating the metadata, you'd need to install Python requirements for scraping the data from YouTube and the js requirements for building the website.
-
Install the requirements in
requirements.txt
-
Also setup the Next.js site
cd site/
yarn
yarn dev
-
To scrape data run
$ ./scripts/update-data.py -d
This creates a CSV with parsed data, after crawling through a bunch of channels on YouTube.
-
To view the site, run
yarn dev
inside thesite/
directory. Make sure that the Python venv where the requirements are installed has been sourced. The publishing script uses youtube-dl for some features. -
You can manually start editing the metadata in the CSV, if you like. But, the Next.js site also provides an editor interface when running the site locally.
-
You can start editing the data from UI and publish when you are happy with the metadata for the song.
-
To ensure data quality, we have a tiny script that checks that all the metadata has been populated for a "published" song. You can run the check script in a separate terminal to see if data is getting saved correctly.
ls data/* |entr ./scripts/check-data.py