Published February 1, 2022 | Version v1
Journal article Open

On the effectiveness of Gated Echo State Networks for data exhibiting long-term dependencies

  • 1. University of Pisa

Description

In the context of recurrent neural networks, gated architectures such as the GRU have contributed to the development of highly accurate machine learning models that can tackle long-term dependencies in the data. However, the training of such networks is performed by the expensive algorithm of gradient descent with backpropagation through time. On the other hand, reservoir computing approaches such as Echo State Networks (ESNs) can produce models that can be trained effi- ciently thanks to the use of fixed random parameters, but are not ideal for dealing with data presenting long-term dependencies. We explore the problem of employ- ing gated architectures in ESNs from both theoretical and empirical perspectives. We do so by deriving and evaluating a necessary condition for the non-contractivity of the state transition function, which is important to overcome the fading-memory characterization of conventional ESNs. We find that using pure reservoir comput- ing methodologies is not sufficient for effective gating mechanisms, while instead training even only the gates is highly effective in terms of predictive accuracy.

Files

1820-02142100063D.pdf

Files (369.6 kB)

Name Size Download all
md5:f4bf6ae8a94b63ed3fa53b0d091f5414
369.6 kB Preview Download