An automated plagiarism detector that handles unzipping, generates plagiarism report and scraps the reports for minimum threshold plagiarism.
-
Staging Files: All the zip files are unzipped and the files that are present in unzipped folder are segreggated based on a particular format which is
{rollno.}_{questionno.}
. Makes multiple directories based on the number of questions. -
Plagiarism Script: Checks for plagiarism and provides the percentage of plagiarism for each file against all the other files that are present in a directory. Generates a HTML file containing plagiarism report.
- run script:python3 plag.py {Assignment file location} {Report generation location}
-
Scrapping Plagiarism Report: Scrapes the percentage of plagiarism from each report generated by Plagiarism Script and returns a
.txt
file that contains the Roll No. of all the students who have plagiarism above the minimum allowed threshold.