Shopping Mall Customer Data Segmentation Data
I thought it was a fun dataset to work on, I like business and economics and I'd love to delve into trends and look into the data and try to visualize that data.
- Picked dataset
- Defined 10 questions
- Answered 10 questions using Pandas
- Added at least one data visualization (using Matplotlib and/or Seaborn) to each single question
- Prepared presentation slides to present at graduation
-
Question 1: Find and visualize the average spending scores per age group.(Gen Z, Millenials, Gen X, Boomers etc.)
-
Question 2: Find and visualize what percent each age group makes up of mall shoppers.
-
Question 3: Find and visualize the gender makeup of the market and the gender split of each age segmentation.
-
Question 4: How are consumers segmented based on income according to the U.S Cencus Bureau's Income in the United States 2022 Report?
-
Question 5: What is the distribution for Spending Score.
-
Question 6: Of those in the top 25% of spending, how is it split based on gender and age?
-
Question 7: How would this visualization look if it was segmented by gender as well?
-
Question 8: How is income correlated with Spending Score?
- Answer: Reaching this point was quite hard, at first I tried a scatter plot, but that was useless, since the dataset is so big, and going with each average for each income ammount was also useless, since there were so many unique incomes. So I split it up into chunks of 5000 dollars (I also tried 1000, 2000, and 300, but the labels ended up not having enough space).
- Failed Attempt
- Visualization:
-
Question 9: How is the group with 100 Spending score distributed based on gender and age?
-
Question 10: How is the group with 100 Spending score distributed based on income?