DataCo Global's supply chain dataset. Includes the company's transactions with customers. The dataset includes 53 attributes ranging from order and shipping information to sales information, 180,519 rows, and features that include a mix of text and numeric data, such as location orders and sales data. Specifically, there are 24 character columns and 28 numeric columns.
Data resource: https://data.mendeley.com/datasets/8gx2fvg2k6/5/files/72784be5-36d3-44fe-b75d-0edbf1999f65
Based on the data set, we need to provide reports on overall sales as well as details of each product based on each different attribute in the data set and make comments on how the product operates. transportation methods thereby improve efficiency. In addition, it is possible to compare the sales performance of different countries and regions.
- Visual studio code 2019
- SQL Server Integration Services
- SQL server - express 2019
- PowerBI
- ...
- Using SSIS to built datawarehouse
• Two Fact tables include FactSales and FactDelivery, both of which are of type Transactions. Business process of each Fact version:
FactSales: Used to analyze and report sales, profits and sales quantity of each product according to each attribute.
FactDelivery: Report on the activity of shipping orders to users
• Dim tables include: DimDepartment, DimShippingMode, DimTime, DimMarket, DimRegion, DimSegment, DimCustomer, DimCountry, DimCity, DimCategory
Fact Sales: Fact Delivery: Constellation Schema:
Integrate data into the warehouse (SISS)
After successfully ETL and building a data warehouse, we will use Power BI to visualize, analyze data, etc.
- Revenue Dashboard
- Shipping status Dashboard
- Manage product Dashboard
Dashboard shows us the total quantity of products: you can choose to view the quantity by market, region, country and even by certain time. There is also a map chart to track the number of products sold in the region and compare between regions or between markets or countries.