Github Metrics Dashboard Apache Spark Ingestion

Objective

This project aims to collect commit KLOC of open source projects and store the data in MongoDB with the help of Apache Spark. The Inital goal is to have the code working with a smaller open source project and then move on to larger projects.

Ingestion Service works as follows
1. Get the username, repository name and branch name from the frontend.
2. Clone the repository in a temporary location, which will be deleted once the data is received
3. Create folders for storing the respective commit object - the folder name is the commit sha for that commit
4. Extract cloc, commit date, commit SHA and other required info
5. Calculate the Range*LOC for the commits per file
6. calculate Defect Density, Spoilage and Productivity

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
project		project
src/main/scala		src/main/scala
.gitignore		.gitignore
ReadME.md		ReadME.md
build.sbt		build.sbt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Github Metrics Dashboard Apache Spark Ingestion

Objective

About

Releases

Packages

Languages

LoyolaChicagoMetrics/metrics-dashboard-bash-scala

Folders and files

Latest commit

History

Repository files navigation

Github Metrics Dashboard Apache Spark Ingestion

Objective

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages