3D object detection based on image and LIDAR fusion for autonomous driving Online publication date: Thu, 17-Aug-2023
by Guoqiang Chen; Huailong Yi; Zhuangzhuang Mao
International Journal of Vehicle Information and Communication Systems (IJVICS), Vol. 8, No. 3, 2023
Abstract: 3D object detection is the fundamental task of autonomous driving. The existing approaches are very expensive in computation due to high dimensionality of point clouds. We use the 3D data more efficiently by representing the scene from the RGB image and the Bird's Eye View (BEV). The whole network is composed of two parts: one is the 2D Proposal Network for 2D region proposal generation, and the other is the 3D Region-based Fusion Network to predict the 2D locations, orientations, and 3D locations of the object. First, we fuse the BEV feature map and the RGB image to enhance the input. Second, we adopt the 3D encoding form with 2D-3D bounding box consistency constraints and design ROI-wise feature fusion to predict location information. Our experimental evaluation on both the KITTI as well as a large scale 3D object detection benchmark shows signicant improvements over the state of the art.
Existing subscribers:
Go to Inderscience Online Journals to access the Full Text of this article.
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of Vehicle Information and Communication Systems (IJVICS):
Login with your Inderscience username and password:
Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.
If you still need assistance, please email subs@inderscience.com