Giter VIP home page Giter VIP logo

gacd's Introduction

Getting and Cleaning Data - Course Project

Human Activity Recognition Using Smartphones

Data Set General Information

The experiments have been carried out with a group of 30 volunteers within an age bracket of 19-48 years. Each person performed six activities (WALKING, WALKING_UPSTAIRS, WALKING_DOWNSTAIRS, SITTING, STANDING, LAYING) wearing a smartphone (Samsung Galaxy S II) on the waist. Using its embedded accelerometer and gyroscope, we captured 3-axial linear acceleration and 3-axial angular velocity at a constant rate of 50Hz. The experiments have been video-recorded to label the data manually. The obtained dataset has been randomly partitioned into two sets, where 70% of the volunteers was selected for generating the training data and 30% the test data. Additional info available here :

http://archive.ics.uci.edu/ml/datasets/Human+Activity+Recognition+Using+Smartphones

Data Set Files Description

The raw data set contains 10299 instances partitioned into train and test data. It is organized into several files:

  1. activity_labels.txt : mapping table for activities (ID, Label)
  2. features.txt : variables names
  3. subject_test.txt and subject_train.txt : identifier of the subject who carried out the experiment.
  4. X_test.txt and X_train.txt : 561 variables storing measurement of triaxial acceleration from the accelerometer (total acceleration) and the estimated body acceleration and triaxial Angular velocity from the gyroscope. Axis is identified in the last character of the variable name as X, Y or Z.
  5. y_test.txt and y_train.txt : identifier of the activity performed.

Script Data Manipulations

Pre-requisites: The UCI HAR Dataset must be extracted in the repo subdirectory called "UCI_HAR_Dataset".

The run_analysis.R sript will do:

  • read in and combines test and training data for subjects, features and activities
  • keep data for only the variables related to measurement of mean or standard deviation
  • merge subject, features and activity data into one data frame
  • reshape data to present the means of all the columns per test subject and per activity and export the new tidy data set into a text file tidy_data.txt also available in the repo.

The resulting tidy data set contains 68 variables which are described in Codebook.md.

gacd's People

Contributors

miclae76 avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.