MNIST

Image recognition program using MNIST labelled images.

A list of images is converted into an array containing vectors of pixel brightnesses: p_i where each array index i corresponds to a different image.

Each image p_i has a corresponding label q_i, a unit vector of dimension n, where n is the number of labels. Values of q_i belong to a set of n orthogonal vectors that map to each label.

Now that we have a way to interpret images and labels as vectors, we need a function that takes 'image' p to 'label' q.

One option is q = W.p where W is a matrix,

but let's define:

norm(x) ≡ x/|x|,

ξ(M,R; p_i) ≡ norm(e^{norm(R.e^M.p_i)}),

where R and M are matrices.

ξ has the desired properties:

it is differentiable;
it also maps between vectors;
it has more non-degenerate parameters that can be varied independently than W.

For some values of R and M, ξ will map images p_i to vectors very close to q_i. This happens when q_i.ξ(M,R; p_i) ≈ 1.

So the goal is to maximise f = Σ_i log(q_i.ξ(p_i)) by varying R and M.

I calculated expressions for ∂f_i/∂M and ∂f_i/∂R and used this program to perform many small gradients descents for each image.

Having now optimised R and M, this program applies ξ to new test images and can label them with 98.7% accuracy.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

MNIST

Files

README.md

Latest commit

History

README.md

File metadata and controls

MNIST