Spatial Aggregation Net: Point Cloud Semantic Segmentation Based on Multi-Directional Convolution

Guorong Cai; Zuning Jiang; Zongyue Wang; Shangfeng Huang; Kai Chen; Xuyang Ge; Yundong Wu

doi:10.3390/s19194329

Spatial Aggregation Net: Point Cloud Semantic Segmentation Based on Multi-Directional Convolution

Sensors (Basel). 2019 Oct 7;19(19):4329. doi: 10.3390/s19194329.

Authors

Guorong Cai^{1

2}, Zuning Jiang¹, Zongyue Wang³, Shangfeng Huang¹, Kai Chen¹, Xuyang Ge¹, Yundong Wu^{4

5}

Affiliations

¹ Computer Engineering College, Jimei University, Xiamen 361021, China.
² Fujian Collaborative Innovation Center for Big Data Applications in Governments, Fuzhou 350003, China.
³ Computer Engineering College, Jimei University, Xiamen 361021, China. wangzongyue@jmu.edu.cn.
⁴ Computer Engineering College, Jimei University, Xiamen 361021, China. yundongwu@jmu.edu.cn.
⁵ Fujian Collaborative Innovation Center for Big Data Applications in Governments, Fuzhou 350003, China. yundongwu@jmu.edu.cn.

Abstract

Semantic segmentation of 3D point clouds plays a vital role in autonomous driving, 3D maps, and smart cities, etc. Recent work such as PointSIFT shows that spatial structure information can improve the performance of semantic segmentation. Motivated by this phenomenon, we propose Spatial Aggregation Net (SAN) for point cloud semantic segmentation. SAN is based on multi-directional convolution scheme that utilizes the spatial structure information of point cloud. Firstly, Octant-Search is employed to capture the neighboring points around each sampled point. Secondly, we use multi-directional convolution to extract information from different directions of sampled points. Finally, max-pooling is used to aggregate information from different directions. The experimental results conducted on ScanNet database show that the proposed SAN has comparable results with state-of-the-art algorithms such as PointNet, PointNet++, and PointSIFT, etc. In particular, our method has better performance on flat, small objects, and the edge areas that connect objects. Moreover, our model has good trade-off in segmentation accuracy and time complexity.

Keywords: LiDAR point cloud; deep learning; semantic segmentation; spatial structure information.

Abstract

Grants and funding