Giter VIP home page Giter VIP logo

yogionbioinformatics / principles-of-data-science Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 2.01 MB

Work done for University of Pittsburgh course "Principles of Data Science" (STAT 1261) with Dr. Junshu Bao in Fall semester of 2018.

Home Page: https://datascience.berkeley.edu/about/what-is-data-science/

License: GNU General Public License v3.0

R 100.00%
data-science data-visualization r r-markdown data-wrangling tidy-data bootstrap regression causal-models machine-learning

principles-of-data-science's Introduction

Principles of Data Science

Noteโ•

Some pdf files come with Rmd (R Markdown) source code files while others do not. The Rmd files are added as supplementary material. For those unfamiliar with R Markdown, the source code is converted into a pdf file which contains the final, well-formatted and fully visualized work.

For those that are confused about data science, please be sure to check the link associated with this repository.

Introduction

Work done for University of Pittsburgh course "Principles of Data Science" (STAT 1261) with Dr. Junshu Bao in Fall semester of 2018.

Units for the course were divided into:

  1. Data Visualization

  2. Data Tidying and Wrangling

  3. Multiple Statistical Models and Bootstrapping

  4. Machine Learning

๐Ÿ’ฅ Units build on top of each other so most units are not mutually exclusive and involve knowledge from previous units.

Folders

๐Ÿ“ Data Visualization/

Contains multiple files showing various data visualizations using packages such as ggplot as well as the generic plot() built-in R function. There are many examples of linear models as well as colorful schematics for how data can be well visualized. The purpose of these is to peak an audience's interest all while empowering the message behind the data.

๐Ÿ“ Data Tidying and Wrangling/

Starting from reasonably easy difficulty and ending with hard, this folder utilizes many different data sets to create meaningful, usable data from very messy origins. The process of this transformation can be accomplished by data tidying and wrangling. This was accomplished using the packages dplyr, tidyr, tidyverse and mdsr.

๐Ÿ“ Multiple Statistical Models and Bootstrapping/

Files inside this folder relate to creation of many types of stastical models to leverage and validate meaningful data from messy data sets. Bootstrapping was also employed on several occassions. Packages used include mdsr, tidyverse, tidyr and broom.

๐Ÿ“ Machine Learning/

Files inside relate to both Machine Learning as well as in-depth results of multiple linear regression. The packages used for this section were glmnet, rpart, rpart.plot and mdsr.

Contact Information

interests

Yogindra Raghav (YogiOnBioinformatics)

[email protected]

principles-of-data-science's People

Contributors

yogionbioinformatics avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.