Giter VIP home page Giter VIP logo

explore_flights_data's Introduction

Explore flights Data

Dataset

The data consisted of 73546 records of flights, for the US, over 5 years, from 1987 to 1992. As the original files were too big, I created some random samples for this learning study. There are time variables (mainly expected times and delays), numeric variables and text variables such are origins, destinations and carriers' names.
Those data files have been selected from here.
A detailed presentation of the data is available here.

Summary of Findings

Starting with the exploration, I found that 1.1% of the flights were cancelled.
Chicago was the top number one city, home for more cancellations. US Airways appeared to be the carrier having the most cancelled flight. While trying to find another feature related to the cancellation or the delaying, I found out that the arrival delays was a right skewed distribution. I applied zooming and axis limits to be able to focus on the bulk of the data. It visually showed flights delays from -40 minutes (earlier arrivals) to 80 minutes. Next, regarding cancelled flights, I dived into relationships between destinations and carriers codes.

image info

Indeed, for Chicago as arrival city, American Airlines and United Airlines are the carriers with the most cancelled flights. For Boston, US Airways appears to be the one with the most cancelled flights. For Atlanta, Easten Airlines and Delta Airlines are also the carriers with the most cancelled flights. I explored the same relationships for delayed flights, examining destinations versus carriers codes.

image info

For all the arrival cities with delays, the top three carriers which are the mostly late are American Arilines, AS Airways and Delta Airlines. Back to cancelled fligths again, I explored a third feature: the planned flight durations.

image info

Eastern Airlines and Delta Airlines cancelled flights, mostly planned for Atlanta are short and medium-haul flights. American Airlines and United Airlines cancelled flights, mostly planned for Chicago are also short and medium-haul flights. The third feature I explored for delayed flights (on arrival) was departure delays.

image info

I noticed that, especially for Chicago and Atlanta, we have departure delays for all the carriers. Indeed there other parameters - meteo for example - which could also give more explanations, but are not covered in this short study.

Key Insights for Presentation

For the presentation, first, I focused on just the influence of carriers and the cities on the cancellations. I started by examining cancellations, cities and carriers as categorical variables one by one. Then I dived in to their relationships, cancellations and cities, then cancellations with carriers. Finally I observed the influence of the planned flights duration as a third feature, on the cancellations. Regarding delayed flights, I started also with the influence of carriers and cities. Then I added a third parameter, which is the departure delays.
The presentation is here

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.