Skip to content

Latest commit

 

History

History
62 lines (47 loc) · 2.03 KB

File metadata and controls

62 lines (47 loc) · 2.03 KB

L2-05: AI Image Captioning and Storytelling

Overview

Generate textual and audio stories from image.

Storyteller Pipeline

    flowchart LR
        subgraph Pipeline
            direction LR
            subgraph In
                direction TB
                A([Story Prompt])
                B[/Image/]
            end 
            subgraph Img2Story
                direction LR
                C(Img-2-Txt)--caption-->D(Txt-2-Txt)
                D--story-->E(Txt-2-Audio)
            end 
            A-->Img2Story
            B-->Img2Story
            Img2Story-->X([Read Story])
            Img2Story-->Z([Play Story])
        end
    style C stroke:#f11,stroke-width:2px
    style D stroke:#1f1,stroke-width:2px
    style E stroke:#11f,stroke-width:2px
    
    style In fill:#ccc
    style Img2Story fill:#bdf
Loading

Description

Module Description
Img-2-Txt Generates caption for image
Txt-2-Txt Creates cool stories from image caption
Txt-2-Audio Converts textual story to audio

Contributing

If you want to contribute to this project, you are welcome to do so. You can either add new projects, improve existing ones, or fix bugs and errors.

Please follow these steps to contribute:

  • Fork this repository and clone it to your local machine.
  • Create a new branch with a descriptive name for your contribution.
  • Add your code and files to the branch and commit your changes.
  • Push your branch to your forked repository and create a pull request to the main repository.
  • Wait for your pull request to be reviewed and merged.

References