Traffic Sign Recognition

Goal

Build a Traffic Sign Recognition Project

The goals / steps of this project are the following:

Load the data set (see below for links to the project data set)
Explore, summarize and visualize the data set
Design, train and test a model architecture
Use the model to make predictions on new images
Analyze the softmax probabilities of the new images
Summarize the results with a written report

Requirements

Anaconda 3 is installed on your machine.
Download the data set

Getting started

Clone repository:

git clone https://github.com/akrost/CarND-TrafficSignClassifier.git
cd carnd-trafficsignclassifier

Create and activate Anaconda environment:

conda create --name carnd-p2
source activate carnd-p2

Activating the environment may vary for your OS.

Install packages:

pip install -r requirements.txt

Run project

jupyter notebook Traffic_Sign_Classifier.ipynb

Data Set Summary & Exploration

1. Basic summary of the data set

I used the numpy library to calculate summary statistics of the traffic signs data set:

The size of training set is?
=> 34,799
The size of the validation set is?
=> 4,410
The size of test set is?
=> 12,630
The shape of a traffic sign image is?
=> 32x32
The number of unique classes/labels in the data set is?
=> 43

2. Exploratory visualization of the dataset.

Here is an exploratory visualization of the data set. The graph shows a histogram of all classes in the three datasets. The blue line shows the distribution for the training set, orange shows the validation set and green shows the testing set.

Model Architecture

1. Image processing

1. Contrast enhancement

The graph shows the RGB histogram of the original image. There is a big spike for all three color channels around 30 (with a maximum of 160) hence the image seems to be very dark.

To improve the input data quality the CLAHE algorithm was used to enhance the contrast of the image. The next graph shows the RGB histogram of the same image that was used above, but this time the contrast was improved. The spike was flattened to around half its value and the overall spectrum was enhanced.

2. Normalization

After improving the contrasts, the image was normalized using the function

pixel_value_new = (pixel_value_old - 128.) / 128.

This step is necessary to make the input data zero centered ranging from -1 to 1.

2. Model architecture

My final model consisted of the following layers:

Model

Layer	Name	Description
Input		32x32x3 RGB image
Convolution 5x5	conv1	1x1 strides, VALID padding, outputs 28x28x6
RELU	conv1_relu
Max pooling	conv1_maxpool	2x2 strides, 2x2 kernel, SAME padding, outputs 14x14x6
Dropout	conv1_dropout	0.8 keep_rate
Convolution 5x5	conv2	1x1 strides, VALID padding, outputs 10x10x16
RELU	conv2_relu
Max pooling	conv2_maxpool	2x2 strides, 2x2 kernel, SAME padding, outputs 5x5x16
Dropout	conv2_dropout	0.8 keep_rate
Fully connected	fc1	400 inputs, outputs 120
RELU	fc1_relu
Dropout	fc1_dropout	0.5 keep_rate
Fully connected	fc2	120 inputs, 84 outputs
RELU	fc2_relu
Dropout	fc2_dropout	0.5 keep_rate
Fully connected	fc3	84 inputs, outputs 43 (#classes)

3. Training

To train the model, I used the following parameters:

Optimizer: Adam optimizer
Batch size: 128
Epochs: 20
Learning rate: 0.001
Keep rate convolutional layers: 0.8
Keep rate fully connected layers: 0.5

4. Description of the approach

My final model results were:

training set accuracy of 98.3 %
validation set accuracy of 96.0 %
test set accuracy of 94.7 %

If an iterative approach was chosen:

What was the first architecture that was tried and why was it chosen?
- I started with the LeNet Architecture since it is a proven architecture for similar problems.
What were some problems with the initial architecture?
- The LeNet architecture had to be adapted to the give image size
- The original architecture tended to overfit
How was the architecture adjusted and why was it adjusted?
- The input depth was adjusted to handle RGB images
- Size of the fully connected layers had to be adjusted to the new depth
- Dropout was introduced to both convolutional and fully connected layers to prevent the model from overfitting
Which parameters were tuned? How were they adjusted and why?
- The keep rate was tuned for both convolutional and fully connected layers
What are some of the important design choices and why were they chosen?
- Probably the most important design choice was to use dropout, since the diverging between the training set accuracy and the validation set accuracy was a strong indicator for overfitting.

If a well known architecture was chosen:

What architecture was chosen?
- LeNet as a basis
Why did you believe it would be relevant to the traffic sign application?
- Since it proved itself for similar tasks
How does the final model's accuracy on the training, validation and test set provide evidence that the model is working well?
- All three accuracies are quite high. This should be an indicator that the general architecture is ok and the model is not underfitting too much
- Training and validation accuracy are close to each other so the model does not seem to overfit
- Test accuracy is quite high which underlines the two points above

Test a Model on New Images

1. Choose five German traffic signs found on the web and provide them in the report. For each image, discuss what quality or qualities might be difficult to classify.

Here are five German traffic signs that I found on the web:

The first two images might be difficult to classify since speed limit signs look very similar to each other, especially with a 32x32 resolution. The third image should not be too hard. The fourth and the fifth image might be harder again, for the same reason as above.

2. Model's predictions

Here are the results of the prediction:

Image	Prediction
30 km/h	30 km/h
100 km/h	30 km/h
Yield	Yield
Ahead only	Ahead only
Keep right	Keep right

The model was able to correctly guess 4 of the 5 traffic signs, which gives an accuracy of 80 %. This compares badly to the accuracy on the test set of 94.7 %. This might be due to the very small set tested here.

3. Certainty of the model

The code for making predictions on my final model is located in the 25th cell of the IPython notebook.

For the first image, the model classifies the image correctly but it is not very sure since other speed limit signs seem to be quite similar. The top five soft max probabilities were

Probability	Prediction
.4913	30 km/h
.3349	20 km/h
.0915	Vehicles over 3.5 metric tons prohibited
.0506	50 km/h
.0087	70 km/h

For the second image, the classification is actually wrong. This time the probabilities are not as close as they were for the first picture, but unfortunately the best probability was wrong.

Probability	Prediction
.4653	30 km/h
.1476	100 km/h
.1134	Roundabout mandatory
.1133	80 km/h
.0730	50 km/h

For the third image, the model is completely sure that this is a yield sign, and the image does contain a yield sign.

Probability	Prediction
1.	Yield
0	Priority road
0	No passing
0	No vehicles
0	No entry

For the fourth image, the model is almost as confident as it is for the third image. The model predicts a ahead only sign and in the image there is indeed a ahead only sign.

Probability	Prediction
.9999	Ahead only
.00001	Turn left ahead
0	Turn right ahead
0	Go straight or right
0	Go straight or left

For the fifth image, the model again is completely sure that this image contains a keep right sign, and it does contain this sign.

Probability	Prediction
1.	Keep right
0	Turn left ahead
0	Go straight or right
0	Roundabout mandatory
0	Dangerous curve to the right

Visualization of the Neural Network

1. Characteristics the neural network used to make classifications

The image shows the feature map of the first convolutional layer:

According to the feature maps the characteristics of this layer are the read edge of the 30 km/h sign and the number 30 in the middle of the sign. This layer already shows a comprehensive list of features, that identify the sign.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
examples		examples
traffic-signs-data/internet_images		traffic-signs-data/internet_images
.gitignore		.gitignore
CODEOWNERS		CODEOWNERS
LICENSE		LICENSE
README.md		README.md
Traffic_Sign_Classifier.html		Traffic_Sign_Classifier.html
Traffic_Sign_Classifier.ipynb		Traffic_Sign_Classifier.ipynb
checkpoint		checkpoint
requirements.txt		requirements.txt
signnames.csv		signnames.csv
traffic-sign-classifier.ckpt.data-00000-of-00001		traffic-sign-classifier.ckpt.data-00000-of-00001
traffic-sign-classifier.ckpt.index		traffic-sign-classifier.ckpt.index
traffic-sign-classifier.ckpt.meta		traffic-sign-classifier.ckpt.meta

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Traffic Sign Recognition

Goal

Getting started

Data Set Summary & Exploration

1. Basic summary of the data set

2. Exploratory visualization of the dataset.

Model Architecture

1. Image processing

1. Contrast enhancement

2. Normalization

2. Model architecture

3. Training

4. Description of the approach

Test a Model on New Images

1. Choose five German traffic signs found on the web and provide them in the report. For each image, discuss what quality or qualities might be difficult to classify.

2. Model's predictions

3. Certainty of the model

Visualization of the Neural Network

1. Characteristics the neural network used to make classifications

About

Releases

Packages

Languages

License

akrost/CarND-TrafficSignClassifier

Folders and files

Latest commit

History

Repository files navigation

Traffic Sign Recognition

Goal

Getting started

Data Set Summary & Exploration

1. Basic summary of the data set

2. Exploratory visualization of the dataset.

Model Architecture

1. Image processing

1. Contrast enhancement

2. Normalization

2. Model architecture

3. Training

4. Description of the approach

Test a Model on New Images

1. Choose five German traffic signs found on the web and provide them in the report. For each image, discuss what quality or qualities might be difficult to classify.

2. Model's predictions

3. Certainty of the model

Visualization of the Neural Network

1. Characteristics the neural network used to make classifications

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages