The purpose of this project is to demonstrate your ability to collect, work with, and clean a data set.
The detail of the data can be found here
This repository contains
data_mean_summary.txt
: tidy data with the average of each variable for each activity and each subjectrun_analysis.R
: the R script used to create the final tidy dataCodeBook.md
: The code book
The R script called run_analysis.R does the following:
- Read the data and merges the training and the test sets to create one data set.
- Extracts only the measurements on the mean and standard deviation for each measurement (including tBodyAcc, tGravityAcc and tBodyGyro).
- Uses descriptive activity names to name the activities in the data set
- Appropriately labels the data set with descriptive variable names.
- From the data set in step 4, creates a second, independent tidy data set with the average of each variable for each activity and each subject.
To run this R script, please place the unzipped downloaded data folder (UCI HAR Dataset
) in the root directory.