UCIHAR-Data-Project

Course Project for Gettting & Celaning Data based on Human Activity Recognition Using Smartphones Dataset

This README file explains details around what files are included and what are their features.

Original Data Source

Data for analysis is downloaded from the below URL
https://d396qusza40orc.cloudfront.net/getdata%2Fprojectfiles%2FUCI%20HAR%20Dataset.zip

Files Included in Repository

This repo includes following files

run_analysis.R
CodeBook.md
avgSujectActivities.txt

Dtails of run_analysis.R script

This is the script used to perform analysis on raw data to create a tidy datafile called avgSujectActivities.txt

Functions of `run_analysis.R` Script

Downloads the dataset from the URL mentioned above and unzips it to create UCI HAR Dataset folder
Imports test and train datsets and creates data frames from then and then Merges the training and the test sets to create one data frame.
Extracts a subset of data with only the measurements on the mean mean() and standard deviation std() for each measurement. Also excludes meanFreq()-X measurements or angle measurements where the term mean exists resulting in 66 measurement variables.
Updates the variable names in dataframe variable names for data fame to improve readibility
Appropriately labels the data set with descriptive activity names in place of activity Ids
Reshapes dataset to create a data frame with average of each measurement variable for each activity and each subject
Writes new tidy data frame to a text file to create the required tidy data set file of 180 observations and 68 columns(2 columns for activityName and subjectID and 66 columns for measurement variables)

Running the script

To run the script, you just have to download the script and source the script from your working directory in R. source(run_analysis.R)

Details of CodeBook.md

The code book file describes the variables, the data, and any transformations and work performed to clean up the data.

Details of tidy Dataset file avgSujectActivities.txt

This is the tidy data file created after after running run_analysis.R script on the original data downloaded from
this URL

This tidy dataset contains following

180 observations and 68 columns(2 columns for activityName and subjectID and 66 columns for measurement variables)
Each measurement variable column is average value for each combination of subjectId and activityName

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

UCIHAR-Data-Project

Original Data Source

Files Included in Repository

Dtails of run_analysis.R script

Functions of `run_analysis.R` Script

Running the script

Details of CodeBook.md

Details of tidy Dataset file avgSujectActivities.txt

This tidy dataset contains following

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
CodeBook.md		CodeBook.md
README.md		README.md
avgSujectActivities.txt		avgSujectActivities.txt
run_analysis.R		run_analysis.R

nkaul/UCIHAR-Data-Project

Folders and files

Latest commit

History

Repository files navigation

UCIHAR-Data-Project

Original Data Source

Files Included in Repository

Dtails of run_analysis.R script

Functions of run_analysis.R Script

Running the script

Details of CodeBook.md

Details of tidy Dataset file avgSujectActivities.txt

This tidy dataset contains following

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Functions of `run_analysis.R` Script

Packages