A Review of Recurrent Neural Networks: LSTM Cells and Network Architectures

Yong Yu; Xiaosheng Si; Changhua Hu; Jianxun Zhang

doi:10.1162/neco_a_01199

A Review of Recurrent Neural Networks: LSTM Cells and Network Architectures

Neural Comput. 2019 Jul;31(7):1235-1270. doi: 10.1162/neco_a_01199. Epub 2019 May 21.

Authors

Yong Yu¹, Xiaosheng Si², Changhua Hu³, Jianxun Zhang⁴

Affiliations

¹ Department of Automation, Xi'an Institute of High-Technology, Xi'an 710025, China, and Institute No. 25, Second Academy of China, Aerospace Science and Industry Corporation, Beijing 100854, China yuyongep@163.com.
² Department of Automation, Xi'an Institute of High-Technology, Xi'an 710025, China sxs09@mails.tsinghua.edu.cn.
³ Department of Automation, Xi'an Institute of High-Technology, Xi'an 710025, China hch_reu@sina.com.
⁴ Department of Automation, Xi'an Institute of High-Technology, Xi'an 710025, China zhang200735@163.com.

PMID: 31113301
DOI: 10.1162/neco_a_01199

Abstract

Recurrent neural networks (RNNs) have been widely adopted in research areas concerned with sequential data, such as text, audio, and video. However, RNNs consisting of sigma cells or tanh cells are unable to learn the relevant information of input data when the input gap is large. By introducing gate functions into the cell structure, the long short-term memory (LSTM) could handle the problem of long-term dependencies well. Since its introduction, almost all the exciting results based on RNNs have been achieved by the LSTM. The LSTM has become the focus of deep learning. We review the LSTM cell and its variants to explore the learning capacity of the LSTM cell. Furthermore, the LSTM networks are divided into two broad categories: LSTM-dominated networks and integrated LSTM networks. In addition, their various applications are discussed. Finally, future research directions are presented for LSTM networks.

Publication types

Research Support, Non-U.S. Gov't
Review

MeSH terms

Algorithms*
Data Analysis
Humans
Memory, Long-Term / physiology
Memory, Short-Term / physiology*
Neural Networks, Computer*