Giter VIP home page Giter VIP logo

misk-homl's Introduction

Hands-on machine learning for predictive analytics

Misk Academy


๐Ÿ—“๏ธ October, 2020
โฐ 16:00 - 18:00 & 18:30 - 20:30


Overview

Students will learn many of the most common machine learning methods to include:

  • A proper modeling process
  • Feature engineering
  • Linear and logistic regression
  • Regularized models
  • K-nearest neighbors
  • Random forests
  • Gradient boosting machines
  • Stacking / super learners
  • And more!

This module will teach students how to build and tune these various models with R and Python packages that have been tested and approved due to their ability to scale well (i.e. glmnet, ranger, xgboost, h2o, scikit-learn). However, the motivation in almost every case is to describe the techniques in a way that helps develop intuition for its strengths and weaknesses.

Learning Objectives

This module will step through the process of building, visualizing, testing, and comparing supervised models. The goal is to expose you to building machine learning models using a variety of algorithms. By the end of this module you should:

  • Understand how to apply an end-to-end modeling process that allows you to find an optimal model.
  • Be able to properly pre-process your feature and target variables.
  • Interpret, apply and compare today's most popular and effective machine learning algorithms.
  • Methodically and efficiently tune these algorithms.
  • Visualize and compare how features impact these models.

Prework

This module makes a few assumptions of your established knowledge regarding your programming skills and exposure to basic statistical concepts. Below are my assumptions and the relevant courses that you should have already attended to make sure you are properly prepared. The material provides examples in both R and Python so as long as you are proficient with the assumptions below for one language then you will be good to go.

Assumptions Resource
Comfortable with R & Python programming link
Proficient with basic data wrangling tasks link
Knowledgable of foundational statistics link

Prior to session 1, please run the following scripts to ensure you have the necessary packages used throughout.

Language Requirements
Python link
R link

Schedule

Session Description Reading(s) Slides Source code
1 Introduction to machine learning Notebook HTML [R] [Python]
2 The modeling process Notebook HTML [R] [Python]
3 Feature and target engineering Notebook HTML [R] [Python]
4 Portfolio builder #1 Notebook
5 Linear regression Notebook HTML [R] [Python]
6 Logistic regression Notebook HTML [R] [Python]
7 Regularized regression Notebook HTML [R] [Python]
8 Portfolio builder #2 Notebook
9 Multivariate adaptive regression splines Notebook HTML [R] [Python]
10 K-nearest neighbors Notebook HTML [R] [Python]
11 Decision trees Notebook HTML [R] [Python]
12 Bagging Notebook HTML [R] [Python]
13 Random forests Notebook HTML [R] [Python]
14 Portfolio builder #3 Notebook
15 Gradient boosting Notebook HTML [R] [Python]
16 Stacked models Notebook HTML [R] [Python]
17 Portfolio builder #4 Notebook

misk-homl's People

Contributors

bradleyboehmke avatar scavetta avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.