Image Captioning Using Deep Learning algorithms

By Ahmed Amine MAJDOUBI

In this project we will be tackeling the problem of generating image captions. So we will be mainly using image and text processing algorithms. This is a very interesting area of deep learning where humans excel at, which is compressing huge amount of images to just a few descriptive phrases. In this project, we will be using the 'flickr30k_images' dataset along with a csv file containing reference captions.

We will start by performing a descriptive analysis of our data (both the images and the caption phrases), then we will proceed by writting out the problem we will be solving. The next step is to create our own algorithm to generate captions, and train it on our training dataset. Then we will define a specific accuracy metric to evaluate our model on the test dataset. Finaly we will perform a qualitative analysis of our results, and end the notebook with a conclusion on our work.

Important Note: If you want to run this notebook, you should do it using Google Collaboration service. Also you need to have a directory in your drive named 'Features' to store the features that we will be creating, and another one inside it named 'models' to store the deep learning models we will create. You also need to have a folder named 'flickr30k_images' that contains the 'results.csv' file and another folder with the name 'flickr30k_images' that contains all the flickr30k images. Here is the structure of the files in my drive:

/content/drive
Path to your 'flickr30k_images' folder (for me, it's students-share/Projet)
- /flickr30k_images
  - Results.csv
  - /flickr30k_images (this folder contains all the flickr30k images)
Path to your 'Features' folder (for me, it's /My Drive)
- /Features
  - /models

You can get the first flickr30k_images folder from this link and the 'Results.csv' file from my github.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
DL_Project_Majdoubi.ipynb		DL_Project_Majdoubi.ipynb
README.md		README.md
results.csv		results.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Image Captioning Using Deep Learning algorithms

By Ahmed Amine MAJDOUBI

About

Releases

Packages

Languages

Mjidiba97/Images-Captions-Generation

Folders and files

Latest commit

History

Repository files navigation

Image Captioning Using Deep Learning algorithms

By Ahmed Amine MAJDOUBI

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages