Giter VIP home page Giter VIP logo

data_cleaning_workshop's Introduction

Machine Learning Project Checklist

Summary: This checklist was created to help ML students/practitioners structure their projects and problems in a way that makes sense to me.


When I just got started learning Python for Machine Learning and worked on my first few projects, I found it very overwhelming because...

  • it was difficult to remember all of the steps I needed to take in order to make my data ML-friendly,
  • I couldn't easily remember the functions, methods, and estimators from pandas, numpy, and sklearn, and
  • it was tedious and time-consuming to try to understand large (>50 feature) datasets

So, I created the ML checklist (Pictured Below) to be a handy tool for whenever I start to feel lost creating an ML project.

Machine Learning Checklist

In this repo, I also created...

  1. ml_project_checklist_template.ipynb: (Pictured below) a Jupyter .ipynb that you can use as a template for your project or Kaggle competition
  2. data_cleaning_for_ml_lab_EXERCISES.ipynb: An exercises/lab that you can finish for data cleaning practice, originally made for a workshop that I gave
  3. data_cleaning_for_ml_lab_SOLUTIONS.ipynb: A solutions file for the exercises I give above
  4. boston.csv and cambridge.csv: Airbnb datasets from here used for the exercises
  5. I also included a PDF version of the checklist.

iPynb Template


I hope you find these resources as useful as I do!

Happy learning :).

data_cleaning_workshop's People

Contributors

pdeguzman96 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.