3d object detection from point cloud
DOI:
https://doi.org/10.5564/jimdt.v6i1.3609Keywords:
computer vision, deep learning, object detection, point cloudAbstract
Here we present a comparison of two different deep learning architectures’ effectiveness along with two distinct detection head approaches for detecting point cloud ball objects. Two backbones that are explored are: VoxelNet, suited for organized point clouds, and PointNet, which handles unorganized point clouds. We modified and implemented SSD and Faster R-CNN detection heads for both backbones. It turns out that the PointNet backbone integrated with a customized Faster R-CNN detection head achieved higher accuracy compared to other combination of models.
Downloads
141
References
[1] Y. Zhou and O. Tuzel, “VoxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection,” Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR), 2018, https://doi.org/10.1109/CVPR.2018.00472.
[2] C. R. Qi, H. Su, K. Mo, and L. J. Guibas, “PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation,” Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR), 2017, https://doi.org/10.1109/CVPR.2017.16.
[3] S. Liu, L. Qi, H. Qin, and J. Shi, “SSD: Single Shot MultiBox Detector,” Proc. Eur. Conf.Comput. Vis. (ECCV), 2018, pp. 21–37, https://doi.org/10.1007/978-3-319-46448-02.
[4] W. Liu et al., “Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks,”arXiv preprint arXiv:1506.01497, 2015.
[5] Y. Li et al., “PointCNN: Convolution On X-Transformed Points,” arXiv preprint arXiv:1812.04244, 2018.
[6] V. A. Sindagi, Y. Zhou, and O. Tuzel, “MVX-Net: Multimodal VoxelNet for 3D Object Detection,” Proc. Int. Conf. Robot. Autom. (ICRA), 2019, pp. 7276–7282. doi:10.1109/ICRA.2019.8793882.
[7] L. Tetrel, G. Ferret, and R. Cazorla, “3D Point Cloud Object Detection in LidarView,” Kitware Software Company, 2023.
[8] T. Zhi et al., “Fully Convolutional One-Stage 3D Object Detection on LiDAR Range Images,”Proc. NeurIPS, 2022, https://doi.org/10.48550/arXiv.2205.13764.
[9] Y. Yan, Y. Mao, and B. Li, “SECOND: Sparsely Embedded Convolutional Detection,” Sensors, vol. 18, no. 10, 2018, https://doi.org/10.3390/s18103337.
[10] X. Chen et al., “Monocular 3D Object Detection for Autonomous Driving,” Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2016, pp. 2147–2156.
[11] B. Li, T. Zhang, and T. Xia, “Vehicle Detection from 3D LiDAR Using Fully Convolutional Network,” arXiv preprint arXiv:1608.07916, 2016.
[12] B. Yang, W. Luo, and R. Urtasun, “PIXOR: Real-Time 3D Object Detection From Point Clouds,” Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2018, pp. 7652–7660.
[13] M. Simon, S. Milz, K. Amende, and H. M. Gross, “Complex-YOLO: Real-Time 3D Object Detection on Point Clouds,” arXiv preprint arXiv:1803.06199, 2018.
[14] J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, “You Only Look Once: Unified, Real-Time Object Detection,” Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2016, pp. 779–788.
[15] D. Z. Wang and I. Posner, “Voting for Voting in Online Point Cloud Object Detection,” Proc. Robot.: Sci. Syst. (RSS), 2015.
[16] Y. Guo, H. Wang, Q. Hu, H. Liu, L. Liu, and M. Bennamoun, “Deep Learning for 3D Point Clouds: A Survey,” IEEE Trans. Pattern Anal. Mach. Intell., 2019, pp. 4338–4364.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Minjinsor Myagmarsuren

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
The authors grant the Journal of Institute of Mathemathics and Digital Technology a license to publish the article and identify itself as the original publisher.
![]()
Articles in the Journal of Institute of Mathemathics and Digital Technology are Open Access articles published under a Creative Commons Attribution-NonCommercial 4.0 International License - CC BY NC.
This license permits NonComericial use, distribution and reproduction in any medium, provided the original work is properly cited.