Deep Learning for Feature Extraction in Remote Sensing: A Case-Study of Aerial Scene Classification

Biserka Petrovska; Eftim Zdravevski; Petre Lameski; Roberto Corizzo; Ivan Štajduhar; Jonatan Lerga

doi:10.3390/s20143906

Deep Learning for Feature Extraction in Remote Sensing: A Case-Study of Aerial Scene Classification

Sensors (Basel). 2020 Jul 14;20(14):3906. doi: 10.3390/s20143906.

Authors

Biserka Petrovska¹, Eftim Zdravevski², Petre Lameski², Roberto Corizzo^{3

4}, Ivan Štajduhar^{5

6}, Jonatan Lerga^{5

6}

Affiliations

¹ Ministry of Defense of Republic of North Macedonia, 1000 Skopje, North Macedonia.
² Faculty of Computer Science and Engineering, Saints Cyril and Methodius University, 1000 Skopje, North Macedonia.
³ Department of Computer Science, University of Bari Aldo Moro, 70125 Bari, Italy.
⁴ Department of Computer Science, American University, Washington, DC 20016, USA.
⁵ Faculty of Engineering, University of Rijeka, 51000 Rijeka, Croatia.
⁶ Center for Artificial Intelligence and Cybersecurity, University of Rijeka, Radmile Matejcic 2, 51000 Rijeka, Croatia.

Abstract

Scene classification relying on images is essential in many systems and applications related to remote sensing. The scientific interest in scene classification from remotely collected images is increasing, and many datasets and algorithms are being developed. The introduction of convolutional neural networks (CNN) and other deep learning techniques contributed to vast improvements in the accuracy of image scene classification in such systems. To classify the scene from areal images, we used a two-stream deep architecture. We performed the first part of the classification, the feature extraction, using pre-trained CNN that extracts deep features of aerial images from different network layers: the average pooling layer or some of the previous convolutional layers. Next, we applied feature concatenation on extracted features from various neural networks, after dimensionality reduction was performed on enormous feature vectors. We experimented extensively with different CNN architectures, to get optimal results. Finally, we used the Support Vector Machine (SVM) for the classification of the concatenated features. The competitiveness of the examined technique was evaluated on two real-world datasets: UC Merced and WHU-RS. The obtained classification accuracies demonstrate that the considered method has competitive results compared to other cutting-edge techniques.

Keywords: convolutional neural network (CNN); feature extraction; feature fusion; remote sensing.