getdata-016's Introduction

getdata-016

Getting and Cleaning Data Course Project

The code assumes that the following files are present in the working directory:

activity_labels.txt
features.txt
X_train.txt
X_test.txt
Y_test.txt
Y_train.txt
subject_train.txt
subject_test.tx

The first part of the code extracts the activity labels as well as the features labels and storing the data for the test and training sets.

The labels which are numeric are then convereted into the appropriate activity labels.

Once that is complete the subject files for the test and the training sets are extracted and binded to the appropraite data sets. Once that is complete all of the data is binded together. The appropriate column names are applied.

The next step is to determine all of the columns that contain the word mean() or std(). This is accomplished with the function grepl. Once the columns are determined the data set is subsetted to the appropriate columns and stored into a new data set called clean.

Finally the data is summarized by using the function aggregate with all columns from 3 to 68 categorized by the first two columns which are the activity and subject columns.

Once this is all complete a write.table function is called for the output text file.

Recommend Projects

sho-ohata / getdata-016 Goto Github PK

getdata-016's Introduction

getdata-016

getdata-016's People

Contributors

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent