Analyze A/B Test Results

An Udacity project regarding the course Data Analyst Nanodegreee.

Project Overview

This project aims to give you the understanding of the results of an A/B test run by an e-commerce website. The company developed a new web page in order to try and increase the number of users who decide to pay for the company's products (this is called convert) The goal of the project is to help the company decide whether to keep the old page or implement the new page.

The project takes you through 3 parts:

Probability
A/B Test
Regression testing

The user must go through all of these to explore the data and arrive at meaningful conclusions.

Results

for every part we got a result on the same dataset:

Part I: Probability

From using simple probability we can conclude that a user in the control group had a 12.04% chance to convert, whereas a user in the treatment group had an 11.88% chance to convert. From this we may conclude that the control page is better purely due to statistics. However it is a very insignificant difference (0.16%) to decide a certain page will cause more impact.

Part II: A/B Test

Here we create our hypothesis notation:

H0: Pnew − Pold ≤ 0

H1: Pnew − Pold > 0

At the end of the section, we calculated the p-value to be 90.62%. Given a large p-value it suggests that we wouldn't move away from the null hypothesis. Also, the p-value is larger than 5% (Type I Error rate). From that, we fail to reject the null hypothesis. Meaning we cannot conclude that the new page converts more users than the old page.

Part III: Regression

For regression part, we had to use the statsmodel library using the LOGISTIC REGRESSION method. The main difference Part III has on Part II is that in the previous part we did a one-tailed hypothesis test. Whereas in Part III, we do a two-tail test:

Part III Hypothesis:

H0: Pnew − Pold = 0

H1: Pnew − Pold ≠ 0

The p-value that we found above is 0.19. Which means that we also Fail to reject the null hypothesis since our Type I error is 0.05 and in order to reject we need to have it less than 0.05.

Tools

pandas
numpy
random
matplotlib
statsmodel

Statistical Methods and Techniques

bootstrapping
Z-test and Critical Z score
P-value
Logistic Regression and Dummy Creation
Hypothesis Testing

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Analyze_ab_test_results_notebook.html		Analyze_ab_test_results_notebook.html
Analyze_ab_test_results_notebook.ipynb		Analyze_ab_test_results_notebook.ipynb
Analyze_ab_test_results_notebook.pdf		Analyze_ab_test_results_notebook.pdf
README.md		README.md
ab_data.csv		ab_data.csv
countries.csv		countries.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Analyze A/B Test Results

Project Overview

Results

Part I: Probability

Part II: A/B Test

Part III: Regression

Tools

Statistical Methods and Techniques

About

Releases

Packages

Languages

AOM98/Analyze-AB-Test-Results

Folders and files

Latest commit

History

Repository files navigation

Analyze A/B Test Results

Project Overview

Results

Part I: Probability

Part II: A/B Test

Part III: Regression

Tools

Statistical Methods and Techniques

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages