Giter VIP home page Giter VIP logo

pandas_in_class's Introduction

Data Manipulation with pandas DataFrame

GitHub Logo

Many real-world data sets contain strings, integers, time-stamps and unstructured data. How do you store data like this so that you can manipulate it and easily retrieve important information? The answer is in a pandas DataFrame!

At a glance

  • In Class Instruction: 4 Hours
    • In Class code along Dataset: Weather Dataset
  • Project Dataset: Indian Premier League
    • Estimated Time to complete Project Tasks: 1 Hours
    • Total sub tasks within the Project: 6
    • Complexity of sub tasks : Mid to High
    • Points to be scored : 700
  • Why should you care about this project: This project challenges you to manipulate large datasets without using conventional programming techniques to extract business insights.
  • Skills Rehearsed
    • Python

In-Class Activities

  • Instructor led concept onboarding
  • Code Alongs
  • In Class Quiz Administration
  • Periodic Recap - Closer to the end of session
  • In Class Assignments - Motivation
  • Take Away Assignments

Why complete this?

  • You will become acquainted with the powertool of pandas - the DataFrame. You will learn how to use pandas to import and then inspect a variety of datasets.

  • Having learned the fundamentals of working with DataFrames, you will now move on to more advanced indexing techniques. These are powerful techniques that allow you to tidy and rearrange your data into the format that allows you to most easily analyze it for insights.

Learning Objective

After this lesson, you'll be able to

  • Understand the need for Pandas in Data Science
  • Data Manipulation and Transformations
  • Pivot Tables and Group By
  • Merging Data

Pre Reads

Slides

Check the Jupyter Notebook in the top right of the screen

Post Reads

Project

In IPL teams representing Indian cities contend each year. Chris Gayle is the highest run scorer in IPL. Do you know who is the second highest run scorer (without using β€˜for’ loop)? This module can help you determine the second highest run scorer by manipulating large data sets to extract business insights.

This project challenges you to manipulate large datasets without using conventional programming techniques to extract business insights.

pandas_in_class's People

Contributors

jay-trivedi avatar bhumilharia avatar codebeast77 avatar alex19427 avatar gyandhanee avatar

Watchers

Soumendra Prasad Dhanee avatar James Cloos avatar Suraj Jadhav avatar PradeepJaiswar avatar Mayuresh Suresh Shilotri avatar Sangam Angre avatar Shweta Doshi avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.