The responses to the Shopify Summer 2022 Data Science Intern Challenge can be found in links below the question.
On Shopify, we have exactly 100 sneaker shops, and each of these shops sells only one model of shoe. We want to do some analysis of the average order value (AOV). When we look at orders data over a 30 day window, we naively calculate an AOV of $3145.13. Given that we know these shops are selling sneakers, a relatively affordable item, something seems wrong with our analysis.
Think about what could be going wrong with our calculation. Think about a better way to evaluate this data. What metric would you report for this dataset? What is its value?
How many orders were shipped by Speedy Express in total? What is the last name of the employee with the most orders? What product was ordered the most by customers in Germany?