Getting and Cleaning Data - Course Project

Human Activity Recognition Using Smartphones

Data Set General Information

The experiments have been carried out with a group of 30 volunteers within an age bracket of 19-48 years. Each person performed six activities (WALKING, WALKING_UPSTAIRS, WALKING_DOWNSTAIRS, SITTING, STANDING, LAYING) wearing a smartphone (Samsung Galaxy S II) on the waist. Using its embedded accelerometer and gyroscope, we captured 3-axial linear acceleration and 3-axial angular velocity at a constant rate of 50Hz. The experiments have been video-recorded to label the data manually. The obtained dataset has been randomly partitioned into two sets, where 70% of the volunteers was selected for generating the training data and 30% the test data. Additional info available here :

http://archive.ics.uci.edu/ml/datasets/Human+Activity+Recognition+Using+Smartphones

Data Set Files Description

The raw data set contains 10299 instances partitioned into train and test data. It is organized into several files:

activity_labels.txt : mapping table for activities (ID, Label)
features.txt : variables names
subject_test.txt and subject_train.txt : identifier of the subject who carried out the experiment.
X_test.txt and X_train.txt : 561 variables storing measurement of triaxial acceleration from the accelerometer (total acceleration) and the estimated body acceleration and triaxial Angular velocity from the gyroscope. Axis is identified in the last character of the variable name as X, Y or Z.
y_test.txt and y_train.txt : identifier of the activity performed.

Script Data Manipulations

Pre-requisites: The UCI HAR Dataset must be extracted in the repo subdirectory called "UCI_HAR_Dataset".

The run_analysis.R sript will do:

read in and combines test and training data for subjects, features and activities

keep data for only the variables related to measurement of mean or standard deviation

merge subject, features and activity data into one data frame

reshape data to present the means of all the columns per test subject and per activity and export the new tidy data set into a text file tidy_data.txt also available in the repo.

The resulting tidy data set contains 68 variables which are described in Codebook.md.

miclae76 / gacd Goto Github PK

gacd's Introduction

Getting and Cleaning Data - Course Project

Data Set General Information

Data Set Files Description

Script Data Manipulations

gacd's People

Contributors

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent