Skip to content

Latest commit

 

History

History
71 lines (55 loc) · 3.4 KB

README.md

File metadata and controls

71 lines (55 loc) · 3.4 KB

Wine Market Analysis

Python Pandas Streamlit Docker SQL Dbeaver

📝 Description

This project is about the company Wiwino, proudly active in the wine industry. Data about wines from users for years is gathered and stored in a sqlite database. With this as starting point following questions should be answered:

  1. We want to highlight 10 wines to increase our sales. Which ones should we choose and why?
  2. We have a limited marketing budget for this year. Which country should we prioritise and why?
  3. We would like to give awards to the best wineries. Come up with 3 relevant ones. Which wineries should we choose and why?
  4. We detected that a big cluster of customers likes a specific combination of tastes. We identified a few keywords that match these tastes: coffee, toast, green apple, cream, and citrus (note that these keywords are case sensitive ⚠️). We would like you to find all the wines that are related to these keywords. Check that at least 10 users confirm those keywords, to ensure the accuracy of the selection. Additionally, identify an appropriate group name for this cluster.
  5. We would like to select wines that are easy to find all over the world. Find the top 3 most common grapes all over the world and for each grape, give us the the 5 best rated wines.
  6. We would like to create a country leaderboard. Come up with a visual that shows the average wine rating for each country. Do the same for the vintages.
  7. One of our VIP clients likes Cabernet Sauvignon and would like our top 5 recommendations. Which wines would you recommend to him?

The original data is not in a good shape and so some fixing of issues and remodelling will need to be done.

From the final OLAP database the questions will be answered and presented on a dashboard in streamlit.

Table of Contents

🤖 File Structure

├── README.md
├── assets
├── data
│   ├── db
│   │   ├── fixed.db
│   │   ├── olap.db
│   │   └── raw.db
│   └── sql
│       └── fix-db.sql
├── requirements.txt
└── src
    ├── config.py
    ├── manage-db.py
    ├── streamlit_app.py
    └── utils
        ├── db.py
        └── get_results.py

🤖 OLAP Database

A fact and dimension model was created:

alt text

🎈 Streamlit

Streamlit Link

🔍 Contributors

🕐 Timeline

This project was created in 5 days.