Giter VIP home page Giter VIP logo

stats506_proj_02's Introduction

STATS 506 Group Project F19

Author: Group 2

Diana Liang (STATA)

Sijun Zhang (dplyr_glmnet)

Wenjing Zhou (data.table_customized_cross-validation)

Navigation

Report

The completed report in .html and .Rmd files are stored in Final folder.

Data Sets

We’re using a combination of 4 datasets (Demographics, Blood Pressure, Total Nutrients day 1, Total Nutrients day 2) from the 2015-2016 NHANES, which are stored in DATA folder.

Code Reviews

We are working on both the issues and commit code reviews. Here some instances about code reviews README.md, STATA_LASSO_penalty, README.md_formula_editing and Interactions_glmnet.

Scripts

The final scripts path are linked in the Author part and you can find running guidence in each folder's readme.md.

Overview

We choose to investigate whether both genders will react the same under the the effect of consumption habits on blood pressure, a known symptom of different chronic diseases. Will the factors that are most important in determining blood pressure be different between males the females?

Whether both genders will react the same under the the effect of consumption habits on blood pressure

We will show in the following analysis that there are certain foods that affect one gender more than the other, and that these foods change depending on the type of blood pressure being measured.

Method: LASSO with customized penalty.factor

Since our purpose is to find how gender effect the relationship between the nutrition intake and the blood pressure measurement, we only penalize the interaction terms by setting the penalty.factor of the interaction terms with a same positive value and let other terms' zero. The explict form for LASSO minizing goal is shaped to

where is the penalty factor we used for each term

Requirements

To run the group_2_final.Rmd file in Draft folder, the following packages should be pre-installed in the IDE.

Version Package
3.6.1 dplyr
3.6.1 ggplot2
2.1.3 tibble
1.3.1 readr
1.0.0 tidyr
1.4.0 stringr
0.4.0 forcats
4.3-0 Hmisc
1.6.0 SASxport
2.0-18 glmnet
1.12.2 data.table
0.4.0 Statamarkdown
1.1.0 kableExtra
x.x.x doMC

doMC is only available in UNIX-like System. The installation command has been embedded in the group_2_final.Rmd file, thus the UNIX-like System user can knit the group_2_final.Rmd directly after meeting the requirements other than doMC.

Install Statamarkdown

As Statamarkdown doesn't support direct downloading using install.packages(), we can use the devtools package to install it from github.com.

library(devtools) # before this you may need to install devtools
install_github("hemken/Statamarkdown")

If for some reason that gives you problems, you can also install from this website

# For Windows
install.packages("https://www.ssc.wisc.edu/~hemken/Stataworkshops/Stata%20and%20R%20Markdown/Statamarkdown_0.3.9.zip", repos=NULL)

# For linux or Mac
install.packages("https://www.ssc.wisc.edu/~hemken/Stataworkshops/Stata%20and%20R%20Markdown/Statamarkdown_0.3.9.tar.gz", type="source", repos=NULL)

stats506_proj_02's People

Contributors

diana-liang avatar randyzhang98 avatar wenjzh avatar

Watchers

 avatar  avatar  avatar

Forkers

wenjzh

stats506_proj_02's Issues

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.