NormalNet: A voxel-based CNN for 3D object classification and retrieval
Wang, C., Cheng, M., Sohel, F., Bennamoun, M. and Li, J. (2018) NormalNet: A voxel-based CNN for 3D object classification and retrieval. Neurocomputing, 323 . pp. 139-147.
*Subscription may be required
Abstract
A common approach to tackle 3D object recognition tasks is to project 3D data to multiple 2D images. Projection only captures the outline of the object, and discards the internal information that may be crucial for the recognition. In this paper, we stay in 3D and concentrate on tapping the potential of 3D representations. We present NormalNet, a voxel-based convolutional neural network (CNN) designed for 3D object recognition. The network uses normal vectors of the object surfaces as input, which demonstrate stronger discrimination capability than binary voxels. We propose a reflection-convolution-concatenation (RCC) module to realize the conv layers, which extracts distinguishable features for 3D vision tasks while reducing the number of parameters significantly. We further improve the performance of NormalNet by combining two networks, which take normal vectors and voxels as input respectively. We carry out a series of experiments that validate the design of the network and achieve competitive performance in 3D object classification and retrieval tasks.
Item Type: | Journal Article |
---|---|
Murdoch Affiliation(s): | School of Engineering and Information Technology |
Publisher: | Elsevier BV |
Copyright: | © 2018 Elsevier B.V. |
URI: | http://researchrepository.murdoch.edu.au/id/eprint/42344 |
![]() |
Item Control Page |
Downloads
Downloads per month over past year