This is a fun little demo using data from Anthropic's HH-RLF dataset. Check it out: helpfulharmless.com