Giter VIP home page Giter VIP logo

ibm-applied-datascience-captsone's Introduction

Applied Data Science Capstone

This repository contains all the material I developed for achieving the IBM Data Science Professional Certificate.

The Jupyter Notebooks here provided represents a part of my portfolio regarding the Data Science Field.

The idea of the project is to fully deploy the skills I acquired and that now I should master for taking part in a Data Science project based on real data. The most important aspects would be:

Hard skills

  • Coding using Python in a Jupyter Notebook environment, using the many Data Science libraries, such as pandas, numpy, scikit-learn, seaborn, folium, plotly, dash, and many more.

  • Computational thinking, i.e., solving real world data issues by means of coding instructions.

Soft skills

  • Understanding the patterns in the data I gathered.
  • Presenting the data in a way that stakeholders can be advised.

The Data Science Project, in brief

SpaceY is a newly established rocket launch company which wants to compete against the already established SpaceX. To do so, SpaceY should be able to:

  • Reuse the 1st stage rocket booster.
  • Be more cost competitive than its competitor.

SpaceX states that their launch services with 1 st stage recovery cost 62 million USD , whereas 15 million USD are required to build a 1 st stage Falcon 9 booster when excluding R&D and profit margin.

Considering the parameters in our predictive models, a Decision Tree was capable to predict the successfulness of 1 st stage booster landing with an accuracy of 89%.

It comes that SpaceY will be able to predict the cost of a launch exploiting the Decision Tree model as a proxy. Thus, SpaceY will be capable of making more informed bids against SpaceX for a rocket launch.

The Notebooks

Here you can find a brief description for each of the Jupyter Notebook files used for the project. All the results in the final presentation come from the above mentioned notebooks.

  • 01_jupyter-labs-spacex-data-collection-api.ipynb allows to collect launches information using the Open Source REST API for SpaceX.

  • 02_jupyter-labs-webscraping.ipynb allows to retrieve information through web scraping exploiting the Wikipedia page listing the Falcon 9 Heavy launches.

  • 03_labs-jupyter-spacex-Data wrangling.ipynb manipulates the information previously retrieved in order to get appropriate labeling for further classification model creation.

  • 04_jupyter-labs-eda-sql-coursera_sqllite.ipynb queries a SQL database to retrieve further information about the SpaceX Falcon 9 history.

  • 05_jupyter-labs-eda-dataviz.ipynb allows to perform data visualization on the data previously gathered, so that visual insights can be readiliy retrieved.

  • 06_lab_jupyter_launch_site_location.ipynb creates an interactive map to retrieve information about the Launch Sites exploited by SpaceX for the Falcon 9 missions.

  • 07_SpaceX_Machine_Learning_Prediction_Part_5.jupyterlite.ipynb trains, tune, and test different machine learning model from the previosly created dataset.

Moreover:

  • spacex_dash_app.py contains the interactive dashboard. It is intended for user friendly data exploration and visualization, i.e., for the stakeholders.

Credits

Here it is the certificate I earned:

The Certificate

Guido Mascia, PhD.

Email: [email protected]

ibm-applied-datascience-captsone's People

Contributors

maskul93 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.