- Anaconda
- Jupyter Notebooks
- The Data Analysis Process
- Data Analysis Process - Case Study 1
- Data Analysis Process - Case Study 2
- Programming Workflow for Data Analysis
- Investigate a Dataset ##Project Overview In this project, you will analyze a dataset and then communicate your findings about it. You will use the Python libraries NumPy, pandas, and Matplotlib to make your analysis easier.
You will need an installation of Python, plus the following libraries:
- pandas
- NumPy
- Matplotlib
- csv We recommend installing Anaconda, which comes with all of the necessary packages, as well as IPython notebook. Here are the installation steps:
Download the installer from https://www.anaconda.com/download/. Choose the Python 3.6 or higher version, and the appropriate 64/32-bit installer.
Refer to the installation instructions here.
Verify the installation, as mentioned here.
In this project, you'll go through the data analysis process and see how everything fits together.
You'll use the Python libraries NumPy, pandas, and Matplotlib, which make writing data analysis code in Python a lot easier! Not only that, these are sought-after skills by employers!
After completing the project, you will:
- Know all the steps involved in a typical data analysis process
- Be comfortable posing questions that can be answered with a given dataset and then answering those questions
- Know how to investigate problems in a dataset and wrangle the data into a format you can use
- Have experience communicating the results of your analysis
- Be able to use vectorized operations in NumPy and pandas to speed up your data analysis code
- Be familiar with pandas' Series and DataFrame objects, which let you access your data more conveniently
- Know how to use Matplotlib to produce plots showing your findings