public class GRU extends RecurrentCell
Reference paper - Gated Recurrent Unit - Cho et al. 2014. http://arxiv.org/abs/1406.1078. The definition of GRU here is slightly different from the paper but compatible with CUDNN.
$$ \begin{split}\begin{array}{ll} r_t = \mathrm{sigmoid}(W_{ir} x_t + b_{ir} + W_{hr} h_{(t-1)} + b_{hr}) \\ z_t = \mathrm{sigmoid}(W_{iz} x_t + b_{iz} + W_{hz} h_{(t-1)} + b_{hz}) \\ n_t = \tanh(W_{in} x_t + b_{in} + r_t * (W_{hn} h_{(t-1)}+ b_{hn})) \\ h_t = (1 - z_t) * n_t + z_t * h_{(t-1)} \\ \end{array}\end{split} $$
| Modifier and Type | Class and Description |
|---|---|
static class |
GRU.Builder
|
RecurrentCell.BaseBuilder<T extends RecurrentCell.BaseBuilder>currentVersion, dropRate, gates, mode, numStackedLayers, parameters, stateOutputs, stateShape, stateSize, useBidirectional, useSequenceLengthinputNames, inputShapes| Modifier and Type | Method and Description |
|---|---|
static GRU.Builder |
builder()
Creates a builder to build a
GRU. |
beforeInitialize, forward, getDirectParameters, getOutputShapes, getParameterShape, loadParameters, opInputs, saveParameters, updateInputLayoutToTNC, validateInputSizegetChildren, initialize, toStringcast, clear, describeInput, getParameters, isInitialized, setInitializer, setInitializerclone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, waitforward, validateLayoutpublic static GRU.Builder builder()
GRU.