Skip to content

Statistics project on the analysis of the household dataset.

Notifications You must be signed in to change notification settings

lorainemg/Household-Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Household Analysis

Overview

This project analyzes the "Individual Household Electric Power Consumption" dataset from the UCI Machine Learning Repository. The analysis involves statistical methods, hypothesis testing, regression, clustering, and dimensionality reduction techniques to gain insights into household electricity consumption patterns.

Dataset

  • Source: UCI Machine Learning Repository
  • Timeframe: 4 years of electric power consumption data
  • Attributes: Various electrical parameters such as active power, reactive power, voltage, and current

Analysis Phases

Phase 1: Statistical Analysis

  • Population Sampling & Hypothesis Testing:
    • Creating a normal population from the dataset
    • Extracting samples and comparing variances between attributes
  • Regression Analysis:
    • Identifying linear relationships between variables

Phase 2: Machine Learning Techniques

  • Dimensionality Reduction & Clustering:
    • Applying Principal Component Analysis (PCA)
    • Implementing clustering techniques for better data interpretation
  • Analysis of Variance (ANOVA):
    • Comparing means of specific characteristics across different groups

Technologies Used

  • R Programming Language
  • Statistical Libraries
  • Machine Learning Techniques

Usage

  1. Clone the repository:

    git clone https://github.com/lorainemg/Household-Analysis.git
  2. Open the project in RStudio or your preferred R environment.

  3. Run the scripts in the specified order to reproduce the analysis.

Reports

Documentation about Phase1 and Phase2 of this project can be found at phase1-report and phase2-report

Contributors

License

This project is open-source and available under the MIT License.

Releases

No releases published

Packages

No packages published

Languages