Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Json captioning described in the paper. #170

Open
SRagy opened this issue Jan 7, 2025 · 1 comment
Open

Json captioning described in the paper. #170

SRagy opened this issue Jan 7, 2025 · 1 comment

Comments

@SRagy
Copy link

SRagy commented Jan 7, 2025

In the paper, a captioning procedure is described, including 1) short description, 2) detailed description, 3) background etc... It is mentioned that this is given in a structured way in json format. How precisely is this done for if we want to replicate it for our own prompting?

  1. Do we do, e.g. {"short description": "A cat in some grass", "detailed description": A short-haired Bengal cat ..., "background": ... etc? 2. Do we include the quotation marks?
  2. Are the category names actually given in the json as in the list (short description, etc)?
  3. Were all 7 items in the list given for every caption or were they mixed and matched?

Many thanks for any response you are able to give. The model works wonderfully even with ordinary language captioning, but I would like to experiment with the structured captioning you mention.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants
@SRagy and others