The task involved identifying the objects in the dataset, via creation of various Machine Learning models, namely- Convolutional Neural Nets, Transfer Learning Models [MobileNetV2, Densenet and VGG16]. The performance of the various models were compared and appropriate conclusions were drawn.
The dataset used is the IDD or, Indian Driving Dataset. The dataset consists of images obtained from a front facing camera attached to a car. The car was driven around Hyderabad, Bangalore cities and their outskirts. The images are mostly of 1080p resolution, but there is also some images with 720p and other resolutions. The total number of images are 10,003. The team worked on 673 images, a curtailed dataset for ease of access and handling the huge dataset in an efficient way. https://idd.insaan.iiit.ac.in/dataset/details/
Repo: https://github.com/Arunav07/CRISTL-Project-1
Link to the Gantt Chart: https://docs.google.com/document/d/18UX6mkH88T9qIU-J9HYKmB4pJGqoPlSd/edit?usp=sharing&ouid=115112343899486492342&rtpof=true&sd=true
The project allows us to explore the various aspects involved while building a Machine Learning Model and gives us hands-on experience on a real life dataset. The object detection task involved will also pave a way in the future for automated cars to drive on Indian roads.
Repository referred throughout the course of the project: https://github.com/BKaiwalya/Deep-Learning_Diabetic-Retinopathy-Detection
Tensorflow:
- https://www.tensorflow.org/tutorials
- https://www.tensorflow.org/datasets/catalog/overview
- https://www.tensorflow.org/tutorials/keras/classification
- https://www.tensorflow.org/tutorials/images/classification
Existing Pre-processing examples:
- https://paperswithcode.com/dataset/idd
- https://github.com/BKaiwalya/Deep-Learning_Diabetic-Retinopathy-Detection/blob/master/DR_InputPipeline.py
XML to CSV Conversion:
- https://www.geeksforgeeks.org/convert-xml-to-csv-in-python/
- https://stackoverflow.com/questions/3207219/how-do-i-list-all-files-of-a-directory
One-hot Encoding References:
- https://en.wikipedia.org/wiki/One-hot
- https://www.quora.com/What-is-one-hot-encoding-and-when-is-it-used-in-data-science
tf.data References:
- https://www.tensorflow.org/guide/data
- https://youtu.be/VFEOskzhhbc
- https://youtu.be/kVEOCfBy9uY
- https://cs230.stanford.edu/blog/datapipeline/#building-an-image-data-pipeline
Tf Records References:
- https://www.tensorflow.org/tutorials/load_data/tfrecord
- https://www.youtube.com/watch?v=oxrcZ9uUblI
- https://towardsdatascience.com/a-practical-guide-to-tfrecords-584536bc786c
Multiple Labels Classsification References:
- https://towardsdatascience.com/image-data-labelling-and-annotation-everything-you-need-to-know-86ede6c684b1
- https://towardsdatascience.com/multi-label-image-classification-in-tensorflow-2-0-7d4cf8a4bc72
- https://peltarion.com/knowledge-center/documentation/cheat-sheets/multi-label-image-classification-/-cheat-sheet
Creating label_matrix References: