Humor Arena

It is common ground that modern LLMs are bad at humor generation. Even top-shelf models tend to memorize and repeat a few simple jokes without any originality.

In our recent paper "Humor Mechanics: Advancing Humor Generation with Multistep Reasoning" (presented at the International Conference on Computational Creativity 2024), we show that the approach based on multistep reasoning can replicate the creativity process good enough to generate jokes which are on par with human-written jokes (with a top quality subset of "reddit jokes" dataset) according to the blind human labeling results. For more details, you can read the full paper on arXiv. We also shared our results and data to facilitate future research.

Now we want to go further: is there a way to improve reasoning schema? Are some models more potent in terms of humor generation than others? To investigate it, we made this Humor Arena to ask people to help us with blind side-by-side labeling.

Since our goal is to automate machine understanding of humor, here we demonstrate fully automatic sorting of 2000+ automatically generated jokes, publishing the top 50 according to machine scores. This is an early beta version and we are working on improving it, with code and algorithms to be published soon. Enjoy!

Reference

@article{tikhonov2024humor,
  title={Humor Mechanics: Advancing Humor Generation with Multistep Reasoning},
  author={Tikhonov, Alexey and Shtykovskiy, Pavel},
  journal={arXiv preprint arXiv:2405.07280},
  year={2024}
}

Build dependencies

brew install tilt google-cloud-sdk node buf openapi-generator

Dev Environment

tilt up

Regenerate protobufs and openapi

make generate

build and deploy

make deploy

Name		Name	Last commit message	Last commit date
Latest commit History 106 Commits
docs		docs
gen		gen
notebooks		notebooks
proto		proto
scripts		scripts
server		server
web		web
.dockerignore		.dockerignore
.gitignore		.gitignore
BUILD.bazel		BUILD.bazel
MODULE.bazel		MODULE.bazel
MODULE.bazel.lock		MODULE.bazel.lock
Makefile		Makefile
README.md		README.md
Tiltfile		Tiltfile
WORKSPACE		WORKSPACE
arena.Dockerfile		arena.Dockerfile
buf.gen.yaml		buf.gen.yaml
buf.lock		buf.lock
buf.mod		buf.mod
gazelle_python.yaml		gazelle_python.yaml
go.mod		go.mod
go.sum		go.sum
leaderboard.Dockerfile		leaderboard.Dockerfile
openapitools.json		openapitools.json
requirements.in		requirements.in
requirements.txt		requirements.txt
requirements_lock.txt		requirements_lock.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Humor Arena

Reference

Build dependencies

Dev Environment

Regenerate protobufs and openapi

build and deploy

About

Contributors 2

Languages

SaveTheRbtz/humor

Folders and files

Latest commit

History

Repository files navigation

Humor Arena

Reference

Build dependencies

Dev Environment

Regenerate protobufs and openapi

build and deploy

About

Topics

Resources

Stars

Watchers

Forks

Contributors 2

Languages