Incremental training of first order recurrent neural networks to predict a context-sensitive language

Stephan K Chalup; Alan D Blair

doi:10.1016/S0893-6080(03)00054-6

Incremental training of first order recurrent neural networks to predict a context-sensitive language

Neural Netw. 2003 Sep;16(7):955-72. doi: 10.1016/S0893-6080(03)00054-6.

Authors

Stephan K Chalup¹, Alan D Blair

Affiliation

¹ School of Electrical Engineering and Computer Science, The University of Newcastle, Callaghan, NSW 2308, Australia. chalup@cs.newcastle.edu.au

PMID: 14692631
DOI: 10.1016/S0893-6080(03)00054-6

Abstract

In recent years it has been shown that first order recurrent neural networks trained by gradient-descent can learn not only regular but also simple context-free and context-sensitive languages. However, the success rate was generally low and severe instability issues were encountered. The present study examines the hypothesis that a combination of evolutionary hill climbing with incremental learning and a well-balanced training set enables first order recurrent networks to reliably learn context-free and mildly context-sensitive languages. In particular, we trained the networks to predict symbols in string sequences of the context-sensitive language [a(n)b(n)c(n); n > or = 1. Comparative experiments with and without incremental learning indicated that incremental learning can accelerate and facilitate training. Furthermore, incrementally trained networks generally resulted in monotonic trajectories in hidden unit activation space, while the trajectories of non-incrementally trained networks were oscillating. The non-incrementally trained networks were more likely to generalise.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Language*
Neural Networks, Computer*