Skip to content

coryknuth/school_district_analysis

Repository files navigation

School District Analysis

We were tasked with analyzing a collection of data from the local school district, comparing test scores against school characteristics like size and spending to help the district leadership make budgetary decisions about next school year. One issue we ran into before completing our analysis was evidence of tampering with test score numbers relating to one grade, at one school. In order to prevent this from polluting the dataset, we removed these fraudulent scores from the data, and re-ran the analysis. We then compared the first dataset, which included the fraudulent scores, against the new data which had been cleaned.

How is the district summary affected?

We found that, due to the very small amount of students scores that were affected by the tampering, there was hardly any impact felt at the district summary level. Here is the original data.

district_summary_original

And here is the same table, scrubbed for the tampered scores.

district_summary_cleaned

As shown, there was a 0.1 decrease in the percent passing overall, and the average math score. There was a 0.2 decrease in the percent passing math, and a 0.3 decrease in percent passing reading. Again, the number of scores compared to the entire district count that were fraudulent was a very small fraction.

How is the school summary affected?

As we compare the school summaries before and after data cleaning, you'll see the impact was tiny, but measurable to the overall scores for Thomas High School.

Original Data

per_school_summary_original

Cleaned Data

per_school_summary_cleaned

How does replacing the ninth graders’ math and reading scores affect Thomas High School’s performance relative to the other schools?

In the original data, Thomas High School ranked #2 in overall passing percent for the district. After the data cleaning, while Thomas High School's overall passing percent was reduced by roughly 0.3%, Thomas maintained 2nd overall ranking for the district.

Original Data

top_5_original

Cleaned Data

top_5_cleaned

How does replacing the ninth-grade scores affect the following:

Math and reading scores by grade

Our solution to the tampered scores was to drop them altogether, replacing all Thomas 9th grade scores with N/A values. Because of this the only grade and school affected were the Thomas High 9th graders as shown.

Original Data

math_by_grade_original

Cleaned Data

math_by_grade_cleaned

Scores by school spending

We found very slight changes to scores for the $630-$644 category that Thomas High School is in after correcting for the tampered data. However, with our final data for presentation being precise only to one tenth, these very slight changes didn't impact the final results.

Original

Cleaned

Scores by school size

Much the same as with spending, as we look at the scores broken down by school size into categories, we saw fractional changes from cleaning the fraudulent scores. The changes were less than a tenth so they didn't impact our final results here since our final chart is only precise to one tenth.

Original

Cleaned

Scores by school type

We found similiar results when analyzing by school type. The differences in scores for both charter schools and district schools don't register in our data with one tenth precision.

Original

average_by_type_original

Cleaned

average_by_type_cleaned

Summary

Thankfully, the amount of data that was compromised relative to the entire dataset was extremely small. We saw the most impact to the average scores for Thomas High School itself. we saw Math and Reading averages, as well as percent passing both subjects and overall were impacted with 0.3% from the original. These changes became even smaller when you analyze at the district level between all schools and students. We found changes measured only in hundreths to average scores regardless of how the data was categorized. We broke it down by spending, size, and school type, and saw little to no impact to the final data to be presented.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published