Giter VIP home page Giter VIP logo

applied_data_science_capstone_spacex_ibm's Introduction

Applied Data Science Capstone:

This project is part of IBM Data Science certificate. https://www.coursera.org/professional-certificates/ibm-data-science

SpaceX Falcon 9 First Stage Landing Prediction

In this project, we will predict if the Falcon 9 first stage will land successfully. SpaceX advertises Falcon 9 rocket launches on its website with a cost of 62 million dollars; other providers cost upward of 165 million dollars each, much of the savings is because SpaceX can reuse the first stage. Therefore if we can determine if the first stage will land, we can determine the cost of a launch. This information can be used if an alternate company wants to bid against SpaceX for a rocket launch. In this lab, you will collect and make sure the data is in the correct format from an API. The following is an example of a successful and launch. This supervised classification project was carried out as part of the IBM Data Science Professional certificate.

Presentation of the project

The presentation covers all steps described in Jupyter Notebooks in following sections. From Data Collection, Analysis to prediction of launch succeess (succesful first stage landing) with classification algorithms. We draw a parallel between SpaceX Falcon9 and ArianeEspace Ariane5 (and coming Ariane6) that was dominating the market of commercial launches until SpaceX put an end to this domination thanks to the reusable boosters concept and induced cost reduction.

Presentation to be downloaded (15.6MB). Will not display on Github.

SpaceX Data Collection.

Request to the SpaceX API.
Clean the requested data.

Data Wrangling.

In this lab, we will perform some Exploratory Data Analysis (EDA) to find some patterns in the data and determine what would be the label for training supervised models.

Web scraping Falcon 9 and Falcon Heavy Launches Records from Wikipedia

Web scraping to collect Falcon 9 historical launch records from a Wikipedia page titled List of Falcon 9 and Falcon Heavy launches. Beautiful soup.

EDA - Undertanding Dataset. SQL Queries.

Understand the Spacex DataSet
Load the dataset into the corresponding table in a Db2 database.
Execute SQL queries to answer assignment questions

EDA Data visualization

EDA Data visualization. Exploring and Preparing Data.
Perform exploratory Data Analysis and Feature Engineering using Pandas and Matplotlib Exploratory Data Analysis.
Preparing Data Feature Engineering
In this assignment, we will predict if the Falcon 9 first stage will land successfully. SpaceX advertises Falcon 9 rocket launches on its website with a cost of 62 million dollars; other providers cost upward of 165 million dollars each, much of the savings is due to the fact that SpaceX can reuse the first stage.

SpaceX Launch Records Dashboard

Dashboard Application with Plotly Dash about SpaceX launch sites, sucess rate and payload. In the previous exploratory data analysis labs, you have visualized the SpaceX launch dataset using matplotlib and seaborn and discovered some preliminary correlations between the launch site and success rates.

  • Launch Site Drop-down Input Component
  • Callback function to render success-pie-chart based on selected site dropdown
  • Range Slider to Select Payload
  • Callback function to render the success-payload-scatter-chart scatter plot.

Note: the interactive Notebook doesnot work on Github. Must be downloaded and executed locally. Or on Google Colab.

Launch Sites Locations Analysis with Folium

Interactive The launch success rate may depend on many factors such as payload mass, orbit type, and so on. It may also depend on the location and proximities of a launch site, i.e., the initial position of rocket trajectories. Finding an optimal location for building a launch site certainly involves many factors and hopefully we could discover some of the factors by analyzing the existing launch site locations.
In this lab, you will be performing more interactive visual analytics using Folium.
Note: The interactive Notebook does not work on Github. It must be downloaded and run locally, with an Anaconda distribution for example. Or on Google Colab.

SpaceX Launch Success. Machine Learning classification

Machine Learning Prediction using Classification Algorithms: KNN, Decision Tree, SVM, Logistic Regression.
Optimization of hyperparameters.




Stephane D. Oct-2022 Rev. Dec-2022

applied_data_science_capstone_spacex_ibm's People

Contributors

drstef avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.