Efficient Memory-Enhanced Transformer for Long-Document Summarization in Low-Resource Regimes

Gianluca Moro; Luca Ragazzi; Lorenzo Valgimigli; Giacomo Frisoni; Claudio Sartori; Gustavo Marfia

doi:10.3390/s23073542

Efficient Memory-Enhanced Transformer for Long-Document Summarization in Low-Resource Regimes

Sensors (Basel). 2023 Mar 28;23(7):3542. doi: 10.3390/s23073542.

Authors

Gianluca Moro¹, Luca Ragazzi¹, Lorenzo Valgimigli¹, Giacomo Frisoni¹, Claudio Sartori¹, Gustavo Marfia²

Affiliations

¹ Department of Computer Science and Engineering (DISI), University of Bologna, Via dell'Università 50, I-47522 Cesena, Italy.
² Department of the Arts (DAR), University of Bologna, Via Barberia 4, I-40123 Bologna, Italy.

Abstract

Long document summarization poses obstacles to current generative transformer-based models because of the broad context to process and understand. Indeed, detecting long-range dependencies is still challenging for today's state-of-the-art solutions, usually requiring model expansion at the cost of an unsustainable demand for computing and memory capacities. This paper introduces Emma, a novel efficient memory-enhanced transformer-based architecture. By segmenting a lengthy input into multiple text fragments, our model stores and compares the current chunk with previous ones, gaining the capability to read and comprehend the entire context over the whole document with a fixed amount of GPU memory. This method enables the model to deal with theoretically infinitely long documents, using less than 18 and 13 GB of memory for training and inference, respectively. We conducted extensive performance analyses and demonstrate that Emma achieved competitive results on two datasets of different domains while consuming significantly less GPU memory than competitors do, even in low-resource settings.

Keywords: abstractive summarization; long document summarization; low-resource summarization; memory-enhanced language models.

Grants and funding

PNC0000002, CUP B53C22006450001/National Plan for NRRP Complementary Investments