CAPTN

The python scripts in this repository automatically convert images files in a specified folder to .jpeg and generate captions as individual .txt files for each image using eiter the Open AI API with e.g. gpt4o or the Florence2 model downloaded to your machine. Use this script to quickly prepare an image set to be used for FLUX finetuning e.g. with ai-toolkit.

The images and captions are saved to a new subfolder named "JPEGs"

Using the Open AI API

Just rename OAI_CONFIG_LIST.example to OAI_CONFIG_LIST, enter your OpenAI API key and run

pip install -r requirements.txt

and then

python -m image_captioning

you'll be prompted for the folder path and everything else runs automatically.

Using the Florence2 Model locally

Tested with python 3.10

pip install -r requirements_florence.txt

and then

python -m image_captioning_florence

you'll be prompted for the folder path and can optionally provide information on the contents of the images, everything else runs automatically.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
image_captioning.py		image_captioning.py
image_captioning_florence.py		image_captioning_florence.py
keywordreplace.py		keywordreplace.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
requirements_florence.txt		requirements_florence.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CAPTN

Using the Open AI API

Using the Florence2 Model locally

About

Releases

Packages

Languages

License

WismutHansen/CAPTN

Folders and files

Latest commit

History

Repository files navigation

CAPTN

Using the Open AI API

Using the Florence2 Model locally

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages