DFT-Net: Deep Feature Transformation Based Network for Object Categorization and Part Segmentation in 3-Dimensional Point Clouds

Mehak Sheikh; Muhammad Adeel Asghar; Ruqia Bibi; Muhammad Noman Malik; Mohammad Shorfuzzaman; Raja Majid Mehmood; Sun-Hee Kim

doi:10.3390/s22072512

DFT-Net: Deep Feature Transformation Based Network for Object Categorization and Part Segmentation in 3-Dimensional Point Clouds

Sensors (Basel). 2022 Mar 25;22(7):2512. doi: 10.3390/s22072512.

Authors

Mehak Sheikh¹, Muhammad Adeel Asghar¹, Ruqia Bibi², Muhammad Noman Malik¹, Mohammad Shorfuzzaman³, Raja Majid Mehmood⁴, Sun-Hee Kim⁵

Affiliations

¹ Department of Computer Science, National University of Modern Languages, NUML, Rawalpindi 46000, Pakistan.
² Department of Software Engineering, National University of Modern Languages, NUML, Rawalpindi 46000, Pakistan.
³ Department of Computer Science, College of Computers and Information Technology, Taif University, P.O. Box 11099, Taif 21944, Saudi Arabia.
⁴ Information and Communication Technology Department, School of Electrical and Computer Engineering, Xiamen University Malaysia, Sepang 43900, Malaysia.
⁵ Department of Brain & Cognitive Engineering, Korea University, Anam-dong, Seongbuk-ku, Seoul 02841, Korea.

Abstract

Unlike 2-dimensional (2D) images, direct 3-dimensional (3D) point cloud processing using deep neural network architectures is challenging, mainly due to the lack of explicit neighbor relationships. Many researchers attempt to remedy this by performing an additional voxelization preprocessing step. However, this adds additional computational overhead and introduces quantization error issues, limiting an accurate estimate of the underlying structure of objects that appear in the scene. To this end, in this article, we propose a deep network that can directly consume raw unstructured point clouds to perform object classification and part segmentation. In particular, a Deep Feature Transformation Network (DFT-Net) has been proposed, consisting of a cascading combination of edge convolutions and a feature transformation layer that captures the local geometric features by preserving neighborhood relationships among the points. The proposed network builds a graph in which the edges are dynamically and independently calculated on each layer. To achieve object classification and part segmentation, we ensure point order invariance while conducting network training simultaneously-the evaluation of the proposed network has been carried out on two standard benchmark datasets for object classification and part segmentation. The results were comparable to or better than existing state-of-the-art methodologies. The overall score obtained using the proposed DFT-Net is significantly improved compared to the state-of-the-art methods with the ModelNet40 dataset for object categorization.

Keywords: 3D object categorization; classification; deep neural network; part segmentation; point cloud.

Grants and funding

NRF-2021R1I1A1A01048455/National Research Foundation of Korea