This repo contains an analysis of three datasets; each can be found in the corresponding subfolder. Each analysis contains an EDA of data, K-Means with a classical way to encode categorical data, and K-Means with embeddings.
The micropublication describes the project and summarizes key takeaways.