Giter VIP home page Giter VIP logo

saad2016x / driving-licenses-traffic-accidents-and-casualties-analysis Goto Github PK

View Code? Open in Web Editor NEW
1.0 1.0 0.0 2.96 MB

review some numbers about accidents in the Kingdom of saudi during the years 2016 and 2017. It is an attempt to use the Python programming language to produce useful graphs during the journey of learning this language.

Jupyter Notebook 100.00%
analytics data-analysis data-science data-visualization

driving-licenses-traffic-accidents-and-casualties-analysis's Introduction

Traffic Accidents & Casulties in Relation to Driving Licences Issuance

Executive Summary

After going through the process of cleaning and munging available datasets and normalizing for population, there is a weak relation between driving livences issuance and traffic accidents. Correlation between driving livences issuance and injuries casualities is -0.5, this might suggest that traffic jams can play a role in that due to more drivers are on the road.

The follwoing data availability will unleash great insights if available and can be linked with what we have, to answer questions that we have generated after conducting analysis on available datasets. Datasets are accidents meta-data, demographics of dead & injured and 'Saher' system and traffic violations data.

Introduction

A sound policy reform is best introduced using data to generate decisions that are evidance-based and capitalize on available resources. In this project, we used the follwoing datasets to explore relations between traffic accidents and Casulties with driving licences issuance in Saudi Arabia for the years of 2016-2017, here are the datasets in use:

The objective is to find if there is a relation between traffic accidents and casulties with driving licences issuance, and are there any commonalities between administrative areas in terms of ratios of accidents per issued driving licences.

Key words: #traffic #accidents #licences #Saudi #Arabia #casulties #deaths #injuries #2016 #2017

Analysis

After going through the process of cleaning and munging the datasets, we created a combined dataset that you can find at your disposal in the follwoing link:

Combined dataset

And here we show a dictionary to understand what each variable accounts for:

Feature Type Dataset Description
year date Driving_Licences Year at which number of issued driving licences
administrative_area object Driving_Licences Driving licences origin of issuance
num_issued_driving_licences int Driving_Licences Number of issued driving licences
num_of_accidents float Traffic_Accidents Number of traffic accidents
num_of_casualties_dead float Traffic_Accidents Number of traffic accidents that had death casualties
num_of_casualties_injured float Traffic_Accidents Number of traffic accidents that had injuries casualties
pop_2016 float Population 2016 population of Saudi Arabia divided by administrative area
pop_2017 float Population 2017 population of Saudi Arabia divided by administrative area

First Glance

The correlation between the variables we analyzed is shown below:

It is clear that there is high correlation between most of the variables. Does this holds true if we normalize using population size? We will discover that.



Results

After normalizing for population, there is a weak relation between driving livences issuance and traffic accidents (corr = 0.026). It appears at the begining that the relation is highly correlated (positive), however when calculating the correlation after normalizing it does not hold up. Moreover, after normalizing, correlation flipped to negative correlation between driving livences issuance and injuries casualities (corr = -0.510). What we suspect is this might suggest that traffic jams can play a role in that due to more drivers are on the road.


In the following chart, we can visualize how many driving licences issued per 10k of population aggregated by reigon:

It seems Hail and Riyadh have more driving licences issuance on average per 10k people, while average is 198 licences per 10k people.


In the following chart, we can see how many accedents happened per 10k of population aggregated by reigon:

It seems that north east of Saudi Arabia is having more accidents on average per 10k people, while average is 151 accidents per 10k people.


In the following chart, we can check how many deaths happend per 100k of population aggregated by reigon:

4 out 6 areas that have more deaths on average are in the northern part of Saudi Arabia, average is 33 deaths per 100k people.


In the following chart, we can visualize how many injuries happend per 100k of population aggregated by reigon:

Al-Baha had 235 injuries per 100k of population which is way higher than the average while Riyadh is moderately low in terms of injuries per 100k people, average is 136 injuries per 100k people.


In the following chart, we can visualize how many traffic accidents per driving licences issued aggregated by reigon:

Al-Qaseem had 3 accidents per issuance of driving licences on average, this could be due to that Al-Qaseem is among the lowest area issuing driving licences in 2016-2017. Northern Boarder area is present again as upnormal. 8 out of 13 areas on average, have less than 1 accident per issuance of driving licences.



Conclusion

There is a weak relation between number of driving licences issued and traffic accidents while correlation is moderatly negative between number of driving licences issued and accidents that had injuries which might suggest that more drivers on the road lead to lower severe accidents, could be transulated by traffic jams.

We can say that the following question can be addressed in further research, using these questions for each one of these areas that show some trends and standout among others:

  • Northern Boarder: Why death rates are the highest?

  • Al-Jouf: High casualties in general, why?

  • Al-Baha: Why injuries rates are the highest?

  • Hail: Are there any reasons for high rates of driving licences issuance?

  • Riyadh: Are there any reasons for high rates of driving licences issuance? And why casualties is the lowest among other areas?


The follwoing data availability will unleash great insights if available and can be linked with what we have:

  1. Accidents meta-data.
  2. Demographics of dead and injured.
  3. 'Saher' system and traffic violations data.

driving-licenses-traffic-accidents-and-casualties-analysis's People

Contributors

saad2016x avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.