Giter VIP home page Giter VIP logo

school_district_analysis's Introduction

School_District_Analysis

This analysis consists of two technical deliverables and written report to present the results on the ninth grade reading and math scores and perform school district analysis.

Background:

Maria, the chief data scientist for a city school district is responsible for analyzing information from a variety of resources and in a variety of formats. In this role she has a task of preparing all standardized test data for analysis, reportings and presesntation to provide insights about performance trends and patterns. These insights are used for informed discussions and strategic decisions at the school and district level.

Overview of the school district analysis:

I will be helping Maria analyze data on student funding and students standardized test scores. I am given access to every student's math and reading scores, as well as various information on the schools they attend. My task is to aggregate the data and showcase trends in school performance. This analysis will assist the school board and the superintendent in making decisions regarding the school budget and priorities. For this task, the Family Educational Rights and Privacy Act(FERPA) of 1974 is taken into consideration as it protects the privacy of student education records. This law applies to all school that receive funds under an applicable program of the US Department of Education. The data is to be treated with utmost confidentiality to protect the students I am reporting on.

Tools used:

Maria advises to download and install a software calle Anaconda, a free open source distribution software for over 1500 packages, to perform the analysis. She has also provide some instructions and video tutorials. The analysis of school data will be done using the package 'Jupyter Notebook'.Jupyter Notebook is to Anaconda what Microsoft Word is to Microsoft Office. Anaconda packages support the Python and R programming languages for datatscience, machine learning and data processing and so on. Here we will be using Python too. Data source - election_results.csv file is an Excel file with given data.

  • Software - Python 3.7.8, Visual Studio Code, Anaconda, Jupyter Notebook
  • Data Source - PyCitySchools.ipynb
  • Libraries - Pandas
  • Add,commit,push - GitBash and GitHub

Purpose:

The school board has notified Maria and her supervisor that the students_complete.csv file shows evidence of academic dishonesty. It has been noted that the reading and math grades for Thomas High School ninth graders appear to have been altered. Although the school board does not know the full extent of the academic dishonesty, they want to uphold state-testing standards and have turned to Maria for help. Maria in turn has asked me to replace the math and reading scores for Thomas High School with NaNs while keeping the rest of the data intact. Once replaced, Maria would like me to repeat the school district analysis and write up a report to describe how these changes affected the overall analysis. For this purpose, the following steps were taken to work on the given starter data:

  • In order to select all the math and reading scores from Thomas High School, the loc method was used.
  • In order to retrieve all the rows with 'Thomas High School' and '9th grade' entries, the comparison operator was used.
  • The reading and math scores were also retrieved using logical and comparison operators.
  • The reading and math scores were then replaced by 'NaN' so that such entries will not be considered for calculation in the school district summary.

15

Code Snapshots of the Deliverables:

Deliverable 1:

22 23

Deliverable 2:

24 25 26

School District Analysis

  • The top 5 and bottom 5 performing schools, based on the overall passing rate 27 28

  • The average math score for each grade level from each school

37

  • The average reading score for each grade level from each school

36

  • The scores by school spending per student, by school size, and by school type

By Spending

29

30 31

By school size

32 33

By school type

34 35

Results:

1) How is the district summary affected?

On performaing calculations for Thomas High School, it can be noted that there was'nt much difference in the average scores and percentage results for 9th graders. It can be noted that originally there were 39170 students and after removing the THS students the new student count was 38709.

21

20 The passing math percentage score also changed from 74.8 to 75%. Passing reading percentage changed from 85.7 to 86%. The overall passing percentage changed from 64.9 to 65%. Hence there was not a lot of major changes.

2) How is the school summary affected?

For Thomas High School, the average math and reading scores were recalculated and school summary updated. It can be noted for Thomas High School that before data cleanup the overall passing percentage was 65% and after deliverable 2 it moved up to 90%.

40

Before and after the cleanup, the overall percentage is 65% and 90%. There was achange in the %passing math, %Passing Reading and %Overall passing where all the values were over 90%.

41

3) How does replacing the ninth graders’ math and reading scores affect Thomas High School’s performance relative to the other schools?

By replacing Thomas High Schools' ninth graders' math and reading scores, Rest of the schools did not see much change in their rankings as much as Thomas High School. Thomas High school's initial overall percentage was 65% and then it changed to 90%.

13

4) How does replacing the ninth-grade scores affect the following:

For comparative study, we replaced the ninth grade scores. The various aspects of the analysis is discussed below in detail.

- Math and reading scores by grade

We wanted to get more in depth knowledge on the performance of the students at the different schools.Therefore, we tabulated the data according to the grades 9th, 10th, 11th and 12th. This helped us greatly in the comparative study. Math and reading scores for Thomas High School reduced after the cleanup hence supporting the fact that the 9th graders' score were equivalent to negligible. The average math and reading scores for 10th,11th and 12th grades did not change much after the analysis.

Math scores by grade:

  • Math Score was set to 'NaN' that is equivalent to 0.
  • Student count before the cleanup was 1635.
  • Studet count after the cleanup was 1174.

39

Reading scores by grade:

  • Reading Score was set to 'NaN' that is equivalent to 0.
  • Student count before the cleanup was 1635.
  • Studnet count after the cleanup was 1174.

38

As per the tabulated results we can see that there was hardly any change in the results.

- Scores by school spending

Various bins were created to analyze the scores based on spending. As per the tabulated results it can be noted that some schools that had the lowest spending per student, still could achieve overall passing percentage of 90%.

7

On the other hand, some other schools with per student spending of over $645 were able to achieve an overall passing percentage of 53% only. Hence the district needs to understand as to how they can support the schools so that they can deliver better results in the future.

3

  • Thomas High School is in the spending bucket of '$630-644,
  • Student count before the cleanup was 1635.
  • Studnet count after the cleanup was 1174.

- Scores by school size

Bins were created and various groups were defined like small, medium and large. The size data was then categorized based on the bins and the results can be seen below.

6

2

- Scores by school type

In order to understand which types of schools were performing well, we performed an analysis and found out that the charter schools performed much better than the district schools. The overall percentage for charter schools was found to be 90% as compared to the disctrict schools having just 54%.

1

  • Thomas High School is a 'charter' type of school.
  • Removing Thomas High School 9th Grade scores reduces the "% Passing Math" by 0.1%, "% Passing Reading" by 0.3% and "% Overall Passing" by over 0.3%.

Summary:

Summarizing the four changes in the updated school district analysis after reading and math scores for the ninth grade at Thomas High School have been replaced with NaNs:

    1. After nullifying the scores of 9th graders at Thomas High School, there was not much difference in the school district summary.
    1. when we nullified the scores of 9th graders at Thomas High school, we could observe that the school lost its rank in the top 5 ranking schools.
    1. After revaluation of 10th and 12th scores at Thomas High School, the average scores and percentages changed drastically, resulting in it levelling up.
    1. For 10th and 12th grades, after we recalculated the scores, the ranking of Thomas High School moved to the top 5 school rankings in the district.

We can also conclude from the overall analysis that the schools excelling academically were mostly charter schools. As a result, the district needs to find various ways to address this issue for the overall improvement in academics. It was also noted that the schools with more students had a direct impact on their academic success.

school_district_analysis's People

Contributors

aakriiti2504 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.