Giter VIP home page Giter VIP logo

dat_sf_16's Introduction

DAT_SF_16

Course materials for General Assembly's Data Science course in San Francisco (7/29/15 - 10/14/15).

Logistics

Course Description

Foundational course in data science, including machine learning theory, case studies and real-world examples, introduction to various modeling techniques, and other tools to make predictions and decisions about data. Students will gain practical computational experience by running machine learning algorithms and learning how to choose the best and most representative data models to make predictions. Students will be using Python throughout this course.

Required Setup

Completion Requirements

In order to receive a General Assembly Certificate in Data Science, upon completion of the course, students must:

  • Complete and submit 80% of all course assignments (homework, homework reviews, labs, quizzes). Students who miss more than 20% of assignments will not be eligible for the course certificate.
  • Complete and subimt the course midterm test, earning a satisfactory grade.
  • Complete and submit the course final project, earning a satisfactory grade by completing all functional and technical requirements on the project rubric, including delivering a presentation.

Assignments, milestones and feedback throughout the course are designed to prepare students to deliver a quality course project.

Course Outline

The weekly schedules for lecture content, lab content, and homework assignments are subject to change according to the needs & preferences of the class.

Course Schedule

Week Monday Wednesday
 | **UNIT 1**                                          | **DATA**

1 | | 7/29: Introduction to Data Science, Git setup 2 | 8/3: Data Format, Access & Transformation + Python review | 8/5: Cleaning and exploring data + Linear Algebra review | | | UNIT 2 | MACHINE LEARNING 3 | 8/10: Introduction to Machine learning, Classification with K-Nearest Neighbors | 8/12: Cross Validation and Naïve Bayes 4 | 8/17: Regression and Regularization | 8/19: Logistic Regression 5 | 8/24: Imbalanced Classes and Evaluation Metrics | 8/26: Advanced Classifiers 6 | 8/31: Ensemble Techniques | 9/2: Review of classification and regression | | | UNIT 3 | APPLICATIONS 7 | 9/7: Labor Day (NO CLASS) | 9/9: K-Means Clustering and Unsupervised learning 8 | 9/14: Dimensionality Reduction | 9/16: Recommendation systems 9 | 9/21: Neural Networks & Deep learning | 9/23: Natural Language Processing and Text Mining 10 | 9/28: Time Series Analysis | | | | UNIT 4 | AT SCALE 10 | | 9/30: Database Technologies 11 | 10/5: Map Reduce | 10/7: Paralell and distributed computing 12 | 10/12: Final project working session | 10/14: Final project presentations

Homework Schedule

HW Topics Dataset Assigned Due Review Due
1 Github setup 7/29 8/3 8/5
2 Data Exploration 8/5 8/10 8/12
3 Classification, KNN + Naïve Bayes Pima Indians 8/12 8/17 8/19
4 Classification, Cross Validation 8/19 8/24 8/26
5 Classification, Evaluation 8/26 8/31 9/2
Midterm ------------- 8/31 9/9 9/11
6 Clustering & Dim Reduction 9/9 9/14 9/16
7 RecSys + NLP 9/16 9/21 9/23
8 Networks 9/23 9/28 9/30
9 Time Series + AWS 9/30 10/5 10/7

Final Project Milestones

FP Deliverable Due
1 Title & Data Sources 8/19
2 Elevator Pitch 9/2
3 Draft Analysis 9/23
8 Final Project Due 10/14

Office Hours

Instructor Times Available method
Justin 3:00 - 6:00 PM Sundays in person (at GA), slack, hangouts by appointment
Francesco Monday & Wednesday slack (quickest response) or hangouts by appointment

Slack

You've all been invited to use Slack for chat during class and the day. Please consider this the primary way to contact other students. Justin will be in Slack during class to handle questions. All instructors will be available on Slack during office hours (listed above).

Resources

Working in the terminal

Statistical Learning Theory

Algorithms

Python

dat_sf_16's People

Contributors

ghego avatar breucopter avatar vanessaohta avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.