Deep Learning in Autonomous Vehicles: A Review
Main Article Content
Abstract
Autonomous vehicles (AVs) represent a transformative technology that has the potential to revolutionize transportation systems worldwide. Deep learning, a subset of machine learning, plays a crucial role in the development and operation of AVs by enabling perception, decision-making, and control. This paper provides a comprehensive review of the role of deep learning in AVs, covering topics such as object detection, lane detection, traffic sign recognition, localization, mapping, behaviour planning, and control systems. The paper also discusses the challenges and limitations of deep learning in AVs, including data quality and quantity, safety and security concerns, environmental factors, and regulatory and legal challenges. Furthermore, the paper examines recent advancements in deep learning models for AVs, integration with other technologies, potential impacts on society and industry, and future research directions. By providing a detailed overview of deep learning applications in AVs, this review aims to contribute to the ongoing discourse surrounding the development and deployment of autonomous vehicles.
Downloads
Metrics
Article Details
This work is licensed under a Creative Commons Attribution 4.0 International License.
You are free to:
- Share — copy and redistribute the material in any medium or format for any purpose, even commercially.
- Adapt — remix, transform, and build upon the material for any purpose, even commercially.
- The licensor cannot revoke these freedoms as long as you follow the license terms.
Under the following terms:
- Attribution — You must give appropriate credit , provide a link to the license, and indicate if changes were made . You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
- No additional restrictions — You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits.
Notices:
You do not have to comply with the license for elements of the material in the public domain or where your use is permitted by an applicable exception or limitation .
No warranties are given. The license may not give you all of the permissions necessary for your intended use. For example, other rights such as publicity, privacy, or moral rights may limit how you use the material.
References
Bojarski, M., et al. (2016). End to end learning for self-driving cars.
Ren, S., et al. (2015). Faster r-cnn: Towards real-time object detection with region proposal networks.
Li, Y., et al. (2018). Deep learning for traffic sign recognition: A review.
Stallkamp, J., et al. (2012). Man vs. computer: Benchmarking machine learning algorithms for traffic sign
recognition.
Mur-Artal, R., et al. (2017). ORB-SLAM2: An open-source SLAM system for monocular, stereo, and RGBD cameras.
Kuipers, J. (2018). Visual place recognition: A survey.
Zhang, W., et al. (2016). Towards driverless cars: A review of visual-based ground vehicle motion control.
Peng, H., et al. (2018). Deeploco: Fully convolutional localization networks for precise vehicle localization.
Sun, L., et al. (2017). A robust vehicle detection and classification system for traffic surveillance in daytime.
Shokri, R., et al. (2017). Deep models under the GAN: information leakage from collaborative deep
learning.
Chen, X., et al. (2018). Deepdriving: Learning affordance for direct perception in autonomous driving.
Schmidhuber, J. (2015). Deep learning in neural networks: An overview.
Goodfellow, I., et al. (2016). Deep learning.
Zhang, W., et al. (2017). Visual-based ground vehicle motion control: A review.
Redmon, J., et al. (2016). You only look once: Unified, real-time object detection.
Zou, W., et al. (2019). Object detection in 20 years: A survey.
Wang, X., et al. (2018). Deep learning for sensor-based activity recognition: A survey.
Su, S., et al. (2018). A survey of deep learning-based object detection.
Li, S., et al. (2019). A review of deep learning techniques for image segmentation.
Tang, J., et al. (2018). A survey on deep learning in medical image analysis.
Wang, S., et al. (2017). Deep learning for sensor-based human activity recognition: A survey.
Nguyen, A., et al. (2015). A survey of techniques for event detection in video.
Parkhi, O. M., et al. (2015). Deep face recognition.
Feichtenhofer, C., et al. (2016). Convolutional two-stream network fusion for video action recognition.
Simonyan, K., et al. (2014). Two-stream convolutional networks for action recognition in videos.
Wang, L., et al. (2016). Temporal segment networks: Towards good practices for deep action recognition.
He, K., et al. (2016). Deep residual learning for image recognition.
Szegedy, C., et al. (2015). Going deeper with convolutions.
Lin, T. Y., et al. (2017). Focal loss for dense object detection.
Girshick, R. (2015). Fast R-CNN.
Dai, J., et al. (2016). R-FCN: Object detection via region-based fully convolutional networks.
Redmon, J., et al. (2017). YOLO9000: Better, faster, stronger.