Giter VIP home page Giter VIP logo

sta141b-lectures's Introduction

STA 141B Data & Web Technologies for Data Analysis

People

  • Instructor: Randy Lai ([email protected])

  • Meeting time: 9:00 - 10:20 AM, TR

  • Location: Wellman Hall 216

  • Office hour: MSB 4111 Tuesday and Wednesday 2:00 - 3:00pm (or by appointment)

  • TA: Franco Liang ([email protected])

  • Meeting time: 5:10 - 6:00 PM, W (Wellman 1) or 6:10 - 7:00 PM, W (Young 184)

  • Office hour: 4:00 - 6:00 PM, R (MSB 1117)

Material

Date Note HTML PDF
01-07 introduction html pdf
01-09, 01-14 tidy data html pdf
01-16 special data html pdf
01-21 visualization html pdf
01-23 maps html pdf
data table html
diagram html
01-28, 01-30 regex html pdf
02-04, 02-06 shiny html pdf
02-11, 02-13 sql html pdf
02-18 json html pdf
02-20 api html pdf
02-25 web scrapping html pdf
02-27 nosql html pdf
03-03 textmining html pdf

Site

  • Canvas for grades
  • Piazza for discussion
  • GitHub for lecture notes

Important: submit your GitHub username to https://forms.gle/E8AiF7i1xTVEFshM7

Tentative Schedule

Week Topic
1 Introduction
2 Tidy data
3 Visualization
4 Regular Expressions and strings
5 Shiny
6 Databases and SQL
7 XML, JSON and YAML
8 Web Scraping and REST API
9 Text Mining
10 Show class

Grading

Category Grade Percentage
Assignments 70%
Project 25%
Participation 5%
  • There will be around 5/6 assignments
  • Assignments must be turned in by the due date. No late assignments are accepted.
  • Participation will be based on your involvement in class, discussion, or office hours. The most subjective way to earn participation points is to have some interactions on Piazza. (A+ will be only given to those students with high participation)

How to ask questions

The instructor and TA will not respond to any emails about general questions related to assignments and course materials. Please use piazza in regard to this matter. For private or sensitive questions you can do private posts on Piazza or email the instructor or TA.

Learn how to ask a question. Asking a question is an art, stackoverflow.com has some good tips.

Group Project

See project.md.

Resources

Assignment Rubric

(Adapted from Nick Ulle and Clark Fitzgerald )

Point values and weights may differ among assignments. This is to indicate what the most important aspects are, so that you spend your time on those that matter most. Check the homework submission page on Canvas to see what the point values are for each assignment.

The grading criteria are correctness, code quality, and communication. The following describes what an excellent homework solution should look like:

Correctness

The report does the following:

solves all the questions contained in the prompt makes conclusions that are supported by evidence in the data discusses efficiency and limitations of the computation cites any sources used The attached code runs without modification.

Code Quality

The code is idiomatic and efficient. Different steps of the data processing are logically organized into scripts and small, reusable functions. Variable names are descriptive. The style is consistent and easy to read.

Communication

Plots include titles, axis labels, and legends or special annotations where appropriate. Tables include only columns of interest, are clearly explained in the body of the report, and not too large. Numbers are reported in human readable terms, i.e. 31 billion rather than 31415926535. Writing is clear, correct English.

Inquisitiveness

The report points out anomalies or notable aspects of the data discovered over the course of the analysis. It discusses assumptions in the overall approach and examines how credible they are. It mentions ideas for extending or improving the analysis or the computation.

sta141b-lectures's People

Contributors

frnliang avatar randy3k avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.