Catalog Home Page

NormalNet: A voxel-based CNN for 3D object classification and retrieval

Wang, C., Cheng, M., Sohel, F., Bennamoun, M. and Li, J. (2018) NormalNet: A voxel-based CNN for 3D object classification and retrieval. Neurocomputing, 323 . pp. 139-147.

[img]
PDF - Authors' Version
Embargoed until September 2020.

Link to Published Version: https://doi.org/10.1016/j.neucom.2018.09.075
*Subscription may be required

Abstract

A common approach to tackle 3D object recognition tasks is to project 3D data to multiple 2D images. Projection only captures the outline of the object, and discards the internal information that may be crucial for the recognition. In this paper, we stay in 3D and concentrate on tapping the potential of 3D representations. We present NormalNet, a voxel-based convolutional neural network (CNN) designed for 3D object recognition. The network uses normal vectors of the object surfaces as input, which demonstrate stronger discrimination capability than binary voxels. We propose a reflection-convolution-concatenation (RCC) module to realize the conv layers, which extracts distinguishable features for 3D vision tasks while reducing the number of parameters significantly. We further improve the performance of NormalNet by combining two networks, which take normal vectors and voxels as input respectively. We carry out a series of experiments that validate the design of the network and achieve competitive performance in 3D object classification and retrieval tasks.

Publication Type: Journal Article
Murdoch Affiliation: School of Engineering and Information Technology
Publisher: Elsevier BV
Copyright: © 2018 Elsevier B.V.
URI: http://researchrepository.murdoch.edu.au/id/eprint/42344
Item Control Page Item Control Page