GRU - Gated Recurrent Unit

Idea

To solve the common problem of short-term memory in RNNs, GRUs provide mechanisms to store the desired information. GRUs are commonly used on top of a RNN architecture, with the difference in the activation blocks. By storing information in so called cells which are regulated by gates, GRUs provide the option to store relevant information over longer sequences.

Improvement

Capability of storing long-term information
Faster calculation than LSTMs
Quite easy implementation
Sigmoid in gate is usefull for vanishing gradients
- close to 0 ->

Concept

The GRU consisting of two gates and one memory cell.

a: activation

c: memory cell || hidden state h

: candidate for replacing c ||

: update gate. Decides when to update (most of the time the value will be 0 or 1) || u || z

: relevance gate. How relevant is to || r

Calculus

Architecture

- reference

Evaluation

Production

References

GRU - Wikipedia
GRU Paper
Illustrated GRU

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gru.md

gru.md

GRU - Gated Recurrent Unit

Idea

Improvement

Concept

Calculus

Architecture

Evaluation

Production

References

Files

gru.md

Latest commit

History

gru.md

File metadata and controls

GRU - Gated Recurrent Unit

Idea

Improvement

Concept

Calculus

Architecture

Evaluation

Production

References