N-GPETS: Neural Attention Graph-Based Pretrained Statistical Model for Extractive Text Summarization

Muhammad Umair; Iftikhar Alam; Atif Khan; Inayat Khan; Niamat Ullah; Mohammad Yusuf Momand

doi:10.1155/2022/6241373

N-GPETS: Neural Attention Graph-Based Pretrained Statistical Model for Extractive Text Summarization

Comput Intell Neurosci. 2022 Nov 22:2022:6241373. doi: 10.1155/2022/6241373. eCollection 2022.

Authors

Muhammad Umair¹, Iftikhar Alam¹, Atif Khan², Inayat Khan³, Niamat Ullah⁴, Mohammad Yusuf Momand⁵

Affiliations

¹ Department of Computer Science, City University of Science and Information Technology, Peshawar 25000, Pakistan.
² Department of Computer Science, Islamia College, Peshawar 25000, Pakistan.
³ Department of Computer Science, University of Engineering and Technology, Mardan, Pakistan.
⁴ Department of Computer Science, University of Buner, Buner 19290, Pakistan.
⁵ Faculty of Computer Science, University of Nangarhar, Jalalabad 2600, Afghanistan.

Abstract

The extractive summarization approach involves selecting the source document's salient sentences to build a summary. One of the most important aspects of extractive summarization is learning and modelling cross-sentence associations. Inspired by the popularity of Transformer-based Bidirectional Encoder Representations (BERT) pretrained linguistic model and graph attention network (GAT) having a sophisticated network that captures intersentence associations, this research work proposes a novel neural model N-GPETS by combining heterogeneous graph attention network with BERT model along with statistical approach using TF-IDF values for extractive summarization task. Apart from sentence nodes, N-GPETS also works with different semantic word nodes of varying granularity levels that serve as a link between sentences, improving intersentence interaction. Furthermore, proposed N-GPETS becomes more improved and feature-rich by integrating graph layer with BERT encoder at graph initialization step rather than employing other neural network encoders such as CNN or LSTM. To the best of our knowledge, this work is the first attempt to combine the BERT encoder and TF-IDF values of the entire document with a heterogeneous attention graph structure for the extractive summarization task. The empirical outcomes on benchmark news data sets CNN/DM show that the proposed model N-GPETS gets favorable results in comparison with other heterogeneous graph structures employing the BERT model and graph structures without the BERT model.

Publication types

Retracted Publication

MeSH terms

Benchmarking
Learning*
Linguistics
Models, Statistical*
Neural Networks, Computer