Giter VIP home page Giter VIP logo

exploratory-data-analysis's Introduction

GitHub Logo

Streetfighting with Maths and EDA

Lets Get Rolling - Student Pre-Read

Before this lesson , we recommend you go through

Learning Objectives

After this lesson, you'll be able to

  • How to get an overview of dataset without any code?
    • Deep dive with Pima Indians Diabetes
  • Perform Exploratory Data Analysis
  • Work with Data
  • Have a mathematical feeling of Linear Regression

Agenda

Slides

@gslides

Instructors code alongs

About Pima Indians Diabetes

Practical Advise

  • EDA is not a compulsary step. Most
  • Its not even a step. Its what you call as getting a feel.
  • Have your Checklist Ready
  • Apply en-masse

Quantitative EDA

  • Dimensions of your data - .shape
  • Statistical Summary - .describe()
  • Get General Highlevel - .info()
  • Class Distribution - .groupby('class').size()
  • Pairwise Pearson correlations - .corr()
  • Skew for each attribute - .skew()
  • Head()/ Tail()

Visual EDA

  • Keep the DAMN thing clean - Don't do a half hearted job at it - Else, don't do it
  • What every chart should have?
    • Title
    • xlabels
    • ylabels
    • xticks
    • yticks
    • xmin
    • ymin
    • legend
    • annotate
  • Have a default script -
  • Know the following
    • What kind of insight is best drawn from what kind of chart? Refer Taxonomy
    • Sacrilegious mistakes - What kind of features can be plotted? Can't do a scatter plot on catgorical variables
    • Know your variable/ feature scales - Nominal/ Ordinal/ Interval/ Ratio
  • Most used plots * Q Q Plot * Scatter * Line * Histogram * Correlogram * Box & Whiskers

Assignments

Resources & Post Reads

exploratory-data-analysis's People

Contributors

shwedosh avatar mayureshshilotri avatar sangamangreg avatar pradeepjaiswar avatar

Stargazers

Pratik Barjatiya avatar

Watchers

James Cloos avatar Suraj Jadhav avatar  avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.