Search
2023 Volume 2
Article Contents
ARITICLE   Open Access    

LSTM-based lane change prediction using Waymo open motion dataset: The role of vehicle operating space

More Information
  • Lane change prediction is critical for crash avoidance but challenging as it requires the understanding of the instantaneous driving environment. With cutting-edge artificial intelligence and sensing technologies, autonomous vehicles (AVs) are expected to have exceptional perception systems to capture instantaneously their driving environments for predicting lane changes. By exploring the Waymo open motion dataset, this study proposes a framework to explore autonomous driving data and investigate lane change behaviors. In the framework, this study develops a Long Short-Term Memory (LSTM) model to predict lane changing behaviors. The concept of Vehicle Operating Space (VOS) is introduced to quantify a vehicle's instantaneous driving environment as an important indicator used to predict vehicle lane changes. To examine the robustness of the model, a series of sensitivity analysis are conducted by varying the feature selection, prediction horizon, and training data balancing ratios. The test results show that including VOS into modeling can speed up the loss decay in the training process and lead to higher accuracy and recall for predicting lane-change behaviors. This study offers an example along with a methodological framework for transportation researchers to use emerging autonomous driving data to investigate driving behaviors and traffic environments.
  • 加载中
  • [1]

    Li L, Jiang R, He Z, Chen XM, Zhou X. 2020. Trajectory data-based traffic flow studies: a revisit. Transportation Research Part C: Emerging Technologies 114:225−40

    doi: 10.1016/j.trc.2020.02.016

    CrossRef   Google Scholar

    [2]

    Wang X, Khattak AJ, Liu J, Masghati-Amoli G, Son S. 2015. What is the level of volatility in instantaneous driving decisions? Transportation Research Part C: Emerging Technologies 58:413−27

    doi: 10.1016/j.trc.2014.12.014

    CrossRef   Google Scholar

    [3]

    Liu J, Khattak A, Wang X. 2017. A comparative study of driving performance in metropolitan regions using large-scale vehicle trajectory data: Implications for sustainable cities. International Journal of Sustainable Transportation 11(3):170−85

    doi: 10.1080/15568318.2016.1230803

    CrossRef   Google Scholar

    [4]

    Liu J, Khattak A, Wang X. 2015. The role of alternative fuel vehicles: Using behavioral and sensor data to model hierarchies in travel. Transportation Research Part C: Emerging Technologies 55:379−92

    doi: 10.1016/j.trc.2015.01.028

    CrossRef   Google Scholar

    [5]

    Liu J, Khattak AJ. 2016. Delivering improved alerts, warnings, and control assistance using basic safety messages transmitted between connected vehicles. Transportation Research Part C: Emerging Technologies 68:83−100

    doi: 10.1016/j.trc.2016.03.009

    CrossRef   Google Scholar

    [6]

    Rios-Torres J, Liu J, Khattak A. 2019. Fuel consumption for various driving styles in conventional and hybrid electric vehicles: Integrating driving cycle predictions with fuel consumption optimization. International Journal of Sustainable Transportation 13(2):123−37

    doi: 10.1080/15568318.2018.1445321

    CrossRef   Google Scholar

    [7]

    Liu J, Wang X, Khattak A. 2016. Customizing driving cycles to support vehicle purchase and use decisions: Fuel economy estimation for alternative fuel vehicle users. Transportation Research Part C: Emerging Technologies 67:280−98

    doi: 10.1016/j.trc.2016.02.016

    CrossRef   Google Scholar

    [8]

    Fu X, Nie Q, Liu J, Khattak A, Hainen A, et al. 2022. Constructing spatiotemporal driving volatility profiles for connected and automated vehicles in existing highway networks. Journal of Intelligent Transportation Systems 26(5):572−85

    doi: 10.1080/15472450.2021.1944133

    CrossRef   Google Scholar

    [9]

    Liu J, Khattak A. 2020. Informed decision-making by integrating historical on-road driving performance data in high-resolution maps for connected and automated vehicles. Journal of Intelligent Transportation Systems 24(1):11−23

    doi: 10.1080/15472450.2019.1699076

    CrossRef   Google Scholar

    [10]

    Wolshon B, Hatipkarasulu Y. 2000. Results of car following analyses using global positioning system. Journal of Transportation Engineering 126:324−31

    doi: 10.1061/(asce)0733-947x(2000)126:4(324)

    CrossRef   Google Scholar

    [11]

    Kerner BS, Demir C, Herrtwich RG, Klenov SL, Rehborn H, et al. 2005. Traffic state detection with floating car data in road networks. Proceedings 2005 IEEE Intelligent Transportation Systems, 16 September 2005, Vienna, Austria. USA: IEEE. pp. 44−49. https://doi.org/10.1109/ITSC.2005.1520133

    [12]

    NGSIM. 2006. The Next Generation Simulation Program. http://ops.fhwa.dot.gov/trafficanalysistools/ngsim.htm

    [13]

    Knoop VL, Hoogendoorn SP, van Zuylen HJ. 2008. Capacity reduction at incidents: Empirical data collected from a helicopter. Transportation Research Record: Journal of the Transportation Research Board 2071:19−25

    doi: 10.3141/2071-03

    CrossRef   Google Scholar

    [14]

    Zhang W, Jordan G, Livshits V. 2016. Generating a vehicle trajectory database from time-lapse aerial photography. Transportation Research Record: Journal of the Transportation Research Board 2594:148−58

    doi: 10.3141/2594-18

    CrossRef   Google Scholar

    [15]

    Krajewski R, Bock J, Kloeker L, Eckstein L. 2018. The highD dataset: A drone dataset of naturalistic vehicle trajectories on German highways for validation of highly automated driving systems. 2018 21st International Conference on Intelligent Transportation Systems (ITSC), 4−7 November 2018, Maui, HI, USA. USA: IEEE. pp. 2118−25. https://doi.org/10.1109/ITSC.2018.8569552

    [16]

    NHTSA. 2015. Safety Pilot Model Deployment. https://catalog.data.gov/dataset/safety-pilot-model-deployment-data

    [17]

    Wiggers, Kyle. 2019. Lyft releases open source data set for autonomous vehicle development. https://venturebeat.com/2019/07/23/lyft-releases-open-source-data-set-for-autonomous-vehicle-development/ (Accessed October 6, 2019)

    [18]

    Waymo. 2020. Waymo Open Dataset. https://waymo.com/open/ (Accessed July 6, 2020)

    [19]

    Agarwal S, Vora A, Pandey G, Williams W, Kourous H, et al. 2020. Ford multi-av seasonal dataset. The International Journal of Robotics Research 39(12):1367−76

    doi: 10.1177/02783649209614

    CrossRef   Google Scholar

    [20]

    Deo N, Trivedi MM. 2018. Convolutional social pooling for vehicle trajectory prediction. 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 18-22 June 2018, Salt Lake City, UT, USA. USA: IEEE. pp. 1549−15498. https://doi.org/10.1109/CVPRW.2018.00196

    [21]

    Zhao T, Xu Y, Monfort M, Choi W, Baker C, et al. 2019. Multi-agent tensor fusion for contextual trajectory prediction. 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, 15−20 June 2019. USA: IEEE. pp. 12118−26. https://doi.org/10.1109/CVPR.2019.01240

    [22]

    Hou L, Xin L, Li SE, Cheng B, Wang W. 2020. Interactive trajectory prediction of surrounding road users for autonomous driving using structural-LSTM network. IEEE Transactions on Intelligent Transportation Systems 21:4615−25

    doi: 10.1109/TITS.2019.2942089

    CrossRef   Google Scholar

    [23]

    Song R, Li B. 2022. Surrounding vehicles' lane change maneuver prediction and detection for intelligent vehicles: a comprehensive review. IEEE Transactions on Intelligent Transportation Systems 23:6046−62

    doi: 10.1109/TITS.2021.3076164

    CrossRef   Google Scholar

    [24]

    Zheng Y. 2015. Trajectory data mining: An overview. ACM Transactions on Intelligent Systems and Technology 6:1−41

    doi: 10.1145/2743025

    CrossRef   Google Scholar

    [25]

    Macadam CC. 2003. Understanding and modeling the human driver. Vehicle System Dynamics 40:101−34

    doi: 10.1076/vesd.40.1.101.15875

    CrossRef   Google Scholar

    [26]

    Lefèvre S, Vasquez D, Laugier C. 2014. A survey on motion prediction and risk assessment for intelligent vehicles. ROBOMECH Journal 1:1

    doi: 10.1186/s40648-014-0001-z

    CrossRef   Google Scholar

    [27]

    Huang Y, Du J, Yang Z, Zhou Z, Zhang L, et al. 2022. A survey on trajectory-prediction methods for autonomous driving. IEEE Transactions on Intelligent Vehicles 7:652−74

    doi: 10.1109/TIV.2022.3167103

    CrossRef   Google Scholar

    [28]

    Kim B, Kang CM, Kim J, Lee SH, Chung CC, et al. 2017. Probabilistic vehicle trajectory prediction over occupancy grid map via recurrent neural network. 2017 IEEE 20th International Conference on Intelligent Transportation Systems, Yokohama, Japan, 16−19 October 2017. USA: IEEE. pp. 399−404. https://doi.org/10.1109/ITSC.2017.8317943

    [29]

    Zheng J, Suzuki K, Fujita M. 2014. Predicting driver's lane-changing decisions using a neural network model. Simulation Modelling Practice and Theory 42:73−83

    doi: 10.1016/j.simpat.2013.12.007

    CrossRef   Google Scholar

    [30]

    Dang HQ, Fürnkranz J, Biedermann A, Hoepfl M. 2017. Time-to-lane-change prediction with deep learning. 2017 IEEE 20th International Conference on Intelligent Transportation Systems. Yokohama, Japan, 16−19 October 2017. USA: IEEE. pp. 1−7. https://doi.org/10.1109/ITSC.2017.8317674

    [31]

    Benterki A, Boukhnifer M, Judalet V, Choubeila M. 2019. Prediction of surrounding vehicles lane change intention using machine learning. 2019 10th IEEE International Conference on Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications (IDAACS), Metz, France, 18−21 September 2019. USA: IEEE. pp. 839−43. https://doi.org/10.1109/IDAACS.2019.8924448

    [32]

    Tang L, Wang H, Zhang W, Mei Z, Li L. 2020. Driver lane change intention recognition of intelligent vehicle based on long short-term memory network. IEEE Access 8:136898−905

    doi: 10.1109/ACCESS.2020.3011550

    CrossRef   Google Scholar

    [33]

    Casas S, Luo W, Urtasun R. 2018. Intentnet: Learning to predict intention from raw sensor data. 2nd Conference on Robot Learning (CoRL), Zurich, Swizerland. Zürich, Switzerland: PMLR. pp. 947−56.

    [34]

    Chai Y, Sapp B, Bansal M, Anguelov D. 2019. MultiPath: multiple probabilistic anchor trajectory hypotheses for behavior prediction. arXiv Preprint

    doi: 10.48550/arXiv.1910.05449

    CrossRef   Google Scholar

    [35]

    Zeng W, Luo W, Suo S, Sadat A, Yang B, et al. 2019. End-to-end interpretable neural motion planner. 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, 15−20 June 2019. USA: IEEE, pp. 8652−61. https://doi.org/10.1109/CVPR.2019.00886

    [36]

    Hong J, Sapp B, Philbin J. 2019. Rules of the road: predicting driving behavior with a convolutional model of semantic interactions. 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 15-20 June 2019, Long Beach, CA, USA. USA: IEEE, 8446−54 https://doi.org/10.1109/CVPR.2019.00865

    [37]

    Tan HS, Huang J. 2006. DGPS-based vehicle-to-vehicle cooperative collision warning: engineering feasibility viewpoints. IEEE Transactions on Intelligent Transportation Systems 7:415−28

    doi: 10.1109/TITS.2006.883938

    CrossRef   Google Scholar

    [38]

    Polychronopoulos A, Tsogas M, Amditis AJ, Andreone L. 2007. Sensor fusion for predicting vehicles' path for collision avoidance systems. IEEE Transactions on Intelligent Transportation Systems 8:549−62

    doi: 10.1109/TITS.2007.903439

    CrossRef   Google Scholar

    [39]

    Broadhurst A, Baker S, Kanade T. 2005. Monte Carlo Road safety reasoning. IEEE Proceedings. Intelligent Vehicles Symposium, Las Vegas, NV, USA, 6-8 June 2005. USA: IEEE. pp. 319−24. https://doi.org/10.1109/IVS.2005.1505122

    [40]

    Joseph J, Doshi-Velez F, Huang AS, Roy N. 2011. A Bayesian nonparametric approach to modeling motion patterns. Autonomous Robots 31:383−400

    doi: 10.1007/s10514-011-9248-x

    CrossRef   Google Scholar

    [41]

    Aoude G, Joseph J, Roy N, How J. 2011. Mobile Agent Trajectory Prediction using Bayesian Nonparametric Reachability Trees. Infotech@Aerospace Conferences, 29−31 March 2011, St. Louis, Missouri. Reston, Virigina: AIAA. pp. 1−17. https://doi.org/10.2514/6.2011-1512

    [42]

    Käfer E, Hermes C, Wöhler C, Ritter H, Kummert F. Recognition of situation classes at road intersections. 2010 IEEE International Conference on Robotics and Automation. Anchorage, AK, USA, 3−7 May 2010. USA: IEEE. pp. 3960−65. https://doi.org/10.1109/ROBOT.2010.5509919

    [43]

    Lawitzky A, Althoff D, Passenberg CF, Tanzmeister G, Wollherr D, et al. 2013. Interactive scene prediction for automotive applications. 2013 IEEE Intelligent Vehicles Symposium, Gold Coast, QLD, Australia, 23-26 June 2013. USA: IEEE. pp. 1028−33 https://doi.org/10.1109/IVS.2013.6629601

    [44]

    Oliver N, Pentland AP. 2000. Graphical models for driver behavior recognition in a SmartCar. Proceedings of the IEEE Intelligent Vehicles Symposium 2000 (Cat. No. 00TH8511), Dearborn, MI, USA, 5 October 2000. USA: IEEE. pp. 7−12 https://doi.org/10.1109/IVS.2000.898310

    [45]

    Liebner M, Baumann M, Klanner F, Stiller C. 2012. Driver intent inference at urban intersections using the intelligent driver model. 2012 IEEE Intelligent Vehicles Symposium, Madrid, Spain, 3-7 June 2012. USA: IEEE. pp. 1162−67. https://doi.org/10.1109/IVS.2012.6232131

    [46]

    Mahajan V, Katrakazas C, Antoniou C. 2020. Prediction of lane-changing maneuvers with automatic labeling and deep learning. Transportation Research Record: Journal of the Transportation Research Board 2674:336−47

    doi: 10.1177/0361198120922210

    CrossRef   Google Scholar

    [47]

    Li D, Ma C. 2022. Research on lane change prediction model based on GBDT. Physica A: Statistical Mechanics and Its Applications 608:128290

    doi: 10.1016/j.physa.2022.128290

    CrossRef   Google Scholar

    [48]

    Xue Q, Xing Y, Lu J. 2022. An integrated lane change prediction model incorporating traffic context based on trajectory data. Transportation Research Part C: Emerging Technologies 141:103738

    doi: 10.1016/j.trc.2022.103738

    CrossRef   Google Scholar

    [49]

    Greff K, Srivastava RK, Koutník J, Steunebrink BR, Schmidhuber J. 2017. LSTM: a search space odyssey. IEEE Transactions on Neural Networks and Learning Systems 28:2222−32

    doi: 10.1109/TNNLS.2016.2582924

    CrossRef   Google Scholar

    [50]

    Zeng, C., Ma, C., Wang, K. and Cui, Z., 2022. Predicting vacant parking space availability: a DWT-Bi-LSTM model. Physica A: Statistical Mechanics and its Applications, 599, p.127498

    [51]

    Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, et al. 2017. Attention is all you need. Advances in neural information processing systems 30 (NIPS 2017). https://proceedings.neurips.cc/paper_files/paper/2017/hash/3f5ee243547dee91fbd053c1c4a845aa-Abstract.html

    [52]

    Liu Y, Zhang J, Fang L, Jiang Q, Zhou B. 2021. Multimodal motion prediction with stacked transformers. 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, USA, 20-25 June 2021. USA: IEEE. pp. 7573−82. https://doi.org/10.1109/CVPR46437.2021.00749

    [53]

    Alahi A, Goel K, Ramanathan V, Robicquet A, Li F, et al. 2016. Social LSTM: Human Trajectory Prediction in Crowded Spaces. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, June 27-30, 2016. USA: IEEE. pp. 961−71. https://doi.org/10.1109/CVPR.2016.110

    [54]

    Messaoud K, Yahiaoui I, Verroust-Blondet A, Nashashibi F. 2019. Non-local Social Pooling for Vehicle Trajectory Prediction2019 IEEE Intelligent Vehicles Symposium (IV). June 9-12, 2019. Paris, France. IEEE. pp. 975−80

    [55]

    Messaoud K, Yahiaoui I, Verroust-Blondet A, Nashashibi F. 2021. Attention based vehicle trajectory prediction. IEEE Transactions on Intelligent Vehicles 6:175−85

    doi: 10.1109/TIV.2020.2991952

    CrossRef   Google Scholar

    [56]

    Gao J, Sun C, Zhao H, Shen Y, Anguelov D, et al. 2020. VectorNet: Encoding HD Maps and Agent Dynamics From Vectorized Representation. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), June 13-19, 2020. Seattle, WA, USA. USA: IEEE. pp. 11525−33. https://doi.org/10.1109/IVS.2019.8813829

    [57]

    Zhao H, Gao J, Lan T, Sun C, Sapp B, et al. 2021. Tnt: Target-driven trajectory prediction. Conference on Robot Learning (CoRL 2021), London, United Kingdom & Virtual, 8−11 Nov, 2021. PMLR. pp. 895−904.

    [58]

    Gu J, Sun C, Zhao H. DenseTNT: end-to-end trajectory prediction from dense goal sets. 2021 IEEE/CVF International Conference on Computer Vision (ICCV), Montreal, QC, Canada, 10−17 October 2021. USA: IEEE. pp. 15283−92. https://doi.org/10.1109/ICCV48922.2021.01502

    [59]

    Choi S, Kim J, Yeo H. 2019. Attention-based recurrent neural network for urban vehicle trajectory prediction. Procedia Computer Science 151:327−34

    doi: 10.1016/j.procs.2019.04.046

    CrossRef   Google Scholar

    [60]

    Lin L, Li W, Bi H, Qin L. 2022. Vehicle trajectory prediction using LSTMs with spatial–temporal attention mechanisms. IEEE Intelligent Transportation Systems Magazine 14:197−208

    doi: 10.1109/MITS.2021.3049404

    CrossRef   Google Scholar

    [61]

    Hawkins AJ. 2019. Waymo is making some of its self-driving car data available for free to researchers. www.theverge.com/2019/8/21/20822755/waymo-self-driving-car-data-set-free-research (Retrieved 8 June 2022 )

    [62]

    Hu X, Zheng Z, Chen D, Zhang X, Sun J. 2022. Processing, assessing, and enhancing the Waymo autonomous vehicle open dataset for driving behavior research. Transportation Research Part C: Emerging Technologies 134:103490

    doi: 10.1016/j.trc.2021.103490

    CrossRef   Google Scholar

    [63]

    Ettinger S, Cheng S, Caine B, Liu C, Zhao H, et al. Large scale interactive motion forecasting for autonomous driving: the waymo open motion dataset. 2021 IEEE/CVF International Conference on Computer Vision (ICCV), Montreal, QC, Canada, 10-17 October 2021. USA: IEEE. pp. 9690−99. https://doi.org/10.1109/ICCV48922.2021.00957

    [64]

    Park SH, Kim B, Kang CM, Chung CC, Choi JW. Sequence-to-sequence prediction of vehicle trajectory via LSTM encoder-decoder architecture. 2018 IEEE Intelligent Vehicles Symposium. Changshu, China, 26−30 June 2018. USA: IEEE. pp. 1672−78. https://doi.org/10.1109/IVS.2018.8500658

    [65]

    Liu J, Mao X, Fang Y, Zhu D, Meng MQH. 2021. A survey on deep-learning approaches for vehicle trajectory prediction in autonomous driving. 2021 IEEE International Conference on Robotics and Biomimetics, Sanya, China, 27-31 December 2021. USA: IEEE. pp. 978−85. https://doi.org/10.1109/ROBIO54168.2021.9739407

    [66]

    Xing Y, Lv C, Wang H, Wang H, Ai Y, et al. 2019. Driver lane change intention inference for intelligent vehicles: framework, survey, and challenges. IEEE Transactions on Vehicular Technology 68:4377−90

    doi: 10.1109/TVT.2019.2903299

    CrossRef   Google Scholar

    [67]

    Kaur H, Pannu HS, Malhi AK. 2020. A systematic review on imbalanced data challenges in machine learning. ACM Computing Surveys 52:1−36

    doi: 10.1145/3343440

    CrossRef   Google Scholar

  • Cite this article

    Fu X, Liu J, Huang Z, Hainen A, Khattak AJ. 2023. LSTM-based lane change prediction using Waymo open motion dataset: The role of vehicle operating space. Digital Transportation and Safety 2(2):112−123 doi: 10.48130/DTS-2023-0009
    Fu X, Liu J, Huang Z, Hainen A, Khattak AJ. 2023. LSTM-based lane change prediction using Waymo open motion dataset: The role of vehicle operating space. Digital Transportation and Safety 2(2):112−123 doi: 10.48130/DTS-2023-0009

Figures(10)  /  Tables(4)

Article Metrics

Article views(4822) PDF downloads(777)

ARITICLE   Open Access    

LSTM-based lane change prediction using Waymo open motion dataset: The role of vehicle operating space

Digital Transportation and Safety  2 2023, 2(2): 112−123  |  Cite this article

Abstract: Lane change prediction is critical for crash avoidance but challenging as it requires the understanding of the instantaneous driving environment. With cutting-edge artificial intelligence and sensing technologies, autonomous vehicles (AVs) are expected to have exceptional perception systems to capture instantaneously their driving environments for predicting lane changes. By exploring the Waymo open motion dataset, this study proposes a framework to explore autonomous driving data and investigate lane change behaviors. In the framework, this study develops a Long Short-Term Memory (LSTM) model to predict lane changing behaviors. The concept of Vehicle Operating Space (VOS) is introduced to quantify a vehicle's instantaneous driving environment as an important indicator used to predict vehicle lane changes. To examine the robustness of the model, a series of sensitivity analysis are conducted by varying the feature selection, prediction horizon, and training data balancing ratios. The test results show that including VOS into modeling can speed up the loss decay in the training process and lead to higher accuracy and recall for predicting lane-change behaviors. This study offers an example along with a methodological framework for transportation researchers to use emerging autonomous driving data to investigate driving behaviors and traffic environments.

    • Vehicle trajectory data has been frequently used in transportation research[19] to provide insights from the microscopic level (vehicle behaviors) to the macro level (traffic conditions). The methods of collecting vehicle trajectories include probe vehicles with GPS devices[27, 10, 11], video-based vehicle detection[1215], connected vehicles[8, 9, 16] and autonomous vehicles[1719]. The advancement of technology has led to an expansion of research on vehicular behaviors, with a focus on examining the microscopic operations of individual vehicles and their interactions with other traffic participants.

      Vehicles equipped with GPS devices provide a continuous record of their tracks, which can be used as a source for monitoring traffic and unveiling the overall dynamics of vehicular flow. However, due to the penetration rate of the probe vehicles, it may be not adequate to manifest the detailed interactions between the vehicles. Connected vehicle projects like Safety Pilot Model Deployment Data[16] provide comprehensive vehicle operation information (e.g., speed, acceleration, maneuvers, etc.), but are still limited by the penetration rate, which makes it difficult to examine the interactions with other traffic agents. Continuous vehicle trajectories on a specific road segment can be recorded and identified using high-resolution bird's-eye view cameras[1215]. The Next Generation Simulation (NGSIM) trajectory data[12], which is acquired using this method, is widely used in traffic analysis and behavior prediction[2023].

      The latest advancements in autonomous driving have broadened the scope of data sources for understanding traffic beyond just vehicles, now encompassing both dynamic traffic agents and static environments. Autonomous vehicle industry players such as Waymo and Lyft released their data to the public to facilitate the related research[17,18]. The Waymo dataset provides comprehensive information of the vehicle operations and the surrounding environments. Waymo released both the perception and motion data about autonomous vehicles which encompasses trajectories of the surrounding traffic agents, as well as the geometrics and connections of the road. This can be utilized to fully understand the vehicle behaviors and interactions under constraints of the road geometrics.

      Research on the trajectory data[24] has demonstrated that the traffic agents could influence each other while operating on the roads. The human drivers would follow a process of observing the surrounding traffic objects and environments, planning for the path, and then performing the proper control to maintain safe driving[25]. Autonomous driving vehicles also follow a similar process of operation: perception, localization, planning, and control[23]. Autonomous vehicles need to identify the potential risks from the surrounding vehicles and make the right decision to avoid collisions. This required the self-driving cars to not only track the positions of the surrounding objects but also forecast their future positions and behaviors.

      Understanding the intentions of the surrounding vehicles and predicting the vehicle trajectories and behaviors remains to be the focus of transportation research[26,27]. The prediction of the vehicle operations could be roughly categorized into two streams: (1) directly predicting the trajectories[2022,28] and (2) predicting the vehicle maneuvers[23, 2932]. Predicting the trajectories is using either the kinematic or machine learning methods to forecast the exact position coordinates of the vehicles. However, due to the inherent stochastics and sparsity of the vehicle trajectories, it is difficult to provide reasonable predictions when the distributions of future positions for different intentions are large. Therefore, some researchers sample trajectory proposals from the historical dataset and predict the future motions based on drivers' intentions[3336]. Predicting the driver's intention which firstly defines a series of vehicle operations (e.g., lane keeping, lane changing) and utilizes previous vehicle trajectories and kinematics to infer the future intentions of the vehicles. Among all the driving maneuvers like lane keeping, lane changing, and turning, lane changing can be a critical behavior that can cause potential risks for autonomous vehicles. Lane changing maneuvers involve both longitudinal and lateral control of vehicles and require cooperation between the ego vehicle and surrounding vehicles. For self-driving vehicles, accurately predicting the potential lane changing behaviors of other vehicles is a critical task to ensure operational safety.

      This research is conducted to determine the prediction of the lane changing behaviors of the vehicles utilizing the Waymo motion dataset. As introduced, the Waymo dataset provides high-frequency vehicle tracks and various operation scenarios. A processing framework is developed to identify the lane changing behaviors of all detected vehicles and extract features for model training. The concept of Vehicle Operating Space (VOS) is introduced to evaluate the space around the vehicle for possible maneuvers. The VOS is also compiled to the feature map to testify its impact on the prediction performance. A long-short term memory (LSTM) model is developed for predicting lane change behavior. In order to examine the robustness of the model, a series of sensitivity tests are conducted on the feature inputs, prediction horizons and training data balancing.

      This study contributes by introducing the concept of the VOS to incorporate the interactions between the vehicles. The VOS provides valuable insights into the underlying factors influencing lane changing behaviors. Different to the other end-to-end models, the LSTM with crafted features proposed in this research, utilizes domain knowledge to reduce the training cost and improve the interpretability of the model. This study also contributes to the field by revealing how prediction horizon and training dataset balancing affect prediction performance. These findings are crucial for researchers and practitioners in the field to optimize the model's performance and improve the accuracy of lane changing behavior prediction. Overall, this study provides a significant contribution to the field and serves as a valuable reference for future research in the area of autonomous driving and vehicle behavior prediction.

    • Before the rise of deep learning, the researchers estimate the vehicle traceries from the kinematic or behavioral. According to the survey[26], the traditional prediction methods could be classified into three levels, with an increasing degree of abstraction: (1) Physics-based motion models, (2) maneuver-based models, and (3) interaction-aware motion models.

      Physics-based motion models are depending on the dynamics and kinematics of the vehicles. Following the laws of physics, the vehicle positions in the short term future could be inferred by the current vehicle kinematics, for instance, current positions, heading, and speed. Based on this conceptual idea, the Gaussian noise simulation[37, 38] and Monte Carlo simulation model[39] were developed to incorporate the uncertainty of the kinematics in prediction.

      Instead of directly predicting the vehicle trajectories, the maneuver-based models generally follow a two-step process to make the prediction: first, infer the intention of the drivers, and second estimate the trajectories either from a deterministic or stochastic manner. An intention model based on the vehicle states (position, acceleration, etc.), road network information (geometry and topology of the road, signal control, traffic rules, etc.), and driver behavior (head movements, driving style, etc.) is developed to determine the possible maneuver of the vehicle in the short future. With the inferred intention, the model will select one possible set of trajectories from the prototype trajectories or generate trajectories from Gaussian Processes[40, 41].

      Interaction-aware motion models provide a more comprehensive method for trajectory prediction by considering the interactions between vehicles. There are two major methods for this kind of model, one based on trajectory prototypes[42, 43] and the other based on dynamic Bayesian networks[44, 45].

    • The emergence of machine learning provides researchers and practitioners with a powerful tool to estimate vehicle trajectories. Mahajan[46], Li & Ma[47] and Xue et al.[48] used traditional machine learning methods (e.g., gradient boosting, XGBoost and support vector machine) for lane changing prediction. Compared with the traditional methodologies, deep learning can capture tracks of the ego vehicle and interactions with other road users in a complex driving scenario[27]. The major contributions of the current research could be categorized into two parts: (1) the innovation in deep learning backbone and (2) the way of incorporating the contextual information. Table 1 summarizes the methodological major considerations in contextual information.

      Table 1.  Summary of deep learning in trajectory and behavior prediction.

      ReferenceMethodologyContextual information
      Deo et al.[20]LSTM, CNN, Social PoolingSurrounding Vehicles
      Hou et al.[22]LSTMSurrounding Vehicles
      Kim et al.[28]LSTMSurrounding Vehicles
      Liu et al.[52]Stacked TransformerHD map,
      Surrounding Vehicles
      Messaoud
      et al.[54,55]
      Attention, LSTM, Social PoolingSurrounding Vehicles
      Gao et al.[56]VectorNetHD map,
      Surrounding Vehicles
      Zhao et al.[57]LSTM, CNN, Social PoolingSurrounding Vehicles, Satellite Image
      Zhao et al.[57]
      Gu et al.[58]
      VectorNet, Goal-based PredictionHD map,
      Surrounding Vehicles
      Choi et al.[59]Attention, LSTM-
      Lin et al.[60]Attention, LSTM-
    • For the trajectory data sequences, there exists an inherent notion of progress of steps and time. This feature requires the prediction methodology to have the ability of 'sequential memory' that captures and memorizes the sequential patterns. The traditional neural network structures (e.g., multi-layer perceptron) lack the intuitive mechanism to address the sequences of the data. Dealing with this issue, the Recurrent Neural Network (RNN) is introduced, followed by the Long-short Term Memory Network (LSTM) and Attention Mechanism.

      The trajectory prediction is one typical sequence-to-sequence prediction problem. Figure 1 illustrates the general model structures for trajectory prediction. Note that the encoder and decoder of the sequence-to-sequence model could be either RNN layers or LSTM layers which are explained in Fig. 1a & b. The LSTM, compared to the RNN, is more widely used because of its capability of solving the gradient vanishing and exploding[49,50]. As shown in Fig. 1, the LSTM has a more complex design that includes the addition of memory cells and three types of gates: forget gate, input gate, and output gate. The design of LSTM networks provides a key advantage over traditional RNNs, which is the ability to selectively remember or forget information over extended periods. This is made possible using the forget gate, which allows the network to discard irrelevant information, and the input gate, which enables the selective storage of new information. As a result, LSTM networks can more effectively capture long-term dependencies and prevent the loss of critical information over time.

      Figure 1. 

      Recurrent neural network and long short-term memory cell structure.

      Another innovation in trajectory prediction is the attention mechanism. An attention mechanism was introduced to deal with the gradient vanishing in the long sequence[51]. The transformer, which employs the substantial attention mechanism without RNN, was introduced to the trajectory prediction[52]. The TF-based models have better performance in long sequence prediction and can deal with the missing input observations.

    • The vehicle behaviors and trajectories are strongly tied with the contextual features, e.g., road geometrics, and traffic flow. One of the methods is to incorporate the relative positions of the surrounding vehicles in a grid amp as input features[28]. One popular way of encoding the surrounding environment is social pooling which was introduced by Alahi et al.[53]. Social pooling indicates a convolutional neural network that is applied to the birds-eye-view of the environment around the object. Deo & Trivedi[20] introduced social pooling into trajectory prediction by encoding the surroundings into grid cells for LSTM training. Zhao et al.[21] employed a CNN to the satellite image to exact the latent features of the environments (e.g., road geometrics) and then used social pooling to cover the contextual information. Messaoud et al.[54] added the attention mechanism to the previous social pooling structure to capture the interactions between all the surrounding vehicles.

      Most of the previous research used the NGSIM data collected from a limited number of freeway segments. The emergence of the autonomous driving data such as Waymo motion dataset expands both the data diversity and magnitude. Additionally, the road geometrics and traffic rules (e.g., stop signs, signal control) are coded and included in the dataset. This enables the neural network to comprehensively learn and understand the interactions between the vehicle behaviors and the surrounding environments. Waymo and its research team have proposed several end-to-end frameworks to learn the vectorization of road geometrics and trajectories. Gao et al.[56] proposed to represent the agent dynamics and HD map features with vectorized representation and developed a hierarchical graph network to learn the latent relationships. TNT and DenseTNT were proposed based on VectorNet to exact the interactions between the vehicles and the surrounding environments[57, 58].

    • Waymo, as one of the pilot companies in autonomous driving, first released its self-driving car perception data and then followed it with the motion data[61]. In comparison with the previous autonomous driving data like Lyft Level 5, NuScenes, and Argoverse, the Waymo motion data provides a larger and more diverse dataset with detailed road geometrics[62]. The Waymo motion data is composed of more than 100,000 segments with over 1,750 km. For each segment, comprehensive static map features are encoded, including but not limited to the lane centers, boundaries, stop signs, signal control, and the boundary types. In the following release, the lane connection and neighbor information are added but only limited to part of the dataset[63]. The object track states are sampled at 10 Hz and each segment contains 20-second tracking. For prediction, the 20-second tracking is further split into 9-second scenarios (1 second of history and 8 seconds of future data) with 5 seconds' overlap. The object tracks encompass the motion features of all other traffic agents (e.g., vehicles, cyclists) around the ego vehicle.

      The structure of the Waymo motion data can be demonstrated in Fig. 2. It is noted that Waymo released several versions of the data, each version may be not the same in structure and contain different sets of features. In this research, the subdataset 'uncompressed_scenario_validation_validation' is used, and Fig. 2 introduces the structure of this dataset. The information within one scenario can be categorized into three major parts:

      Figure 2. 

      Structure of the Waymo motion dataset (uncompressed_scenario_validation_validation).

      1. Scenario Features: This category includes the basic information about the current scenario, e.g., the scenario ID, the index of the self-driving car, etc.

      2. HD Map Features: Both the static and dynamic map features are within this category. The dynamic map features include the traffic signal states and the static map features contain comprehensive information about the road geometrics and connection. Waymo provides the explicit coordinates for the lanes, boundaries, crosswalks, etc. For each road segment, the indices of upstream/downstream lanes and the neighbor lanes are recorded in the corresponding attributes.

      3. Tracks: This attribute stores the motions of all observed objects in the scenario. There are a total of four types of objects: vehicles, pedestrians, cyclists, and others. The states of the objects are recorded at a 10 Hz frequency and each state consists of the coordinates of the object (x, y, z), size (width and height), motion (heading, longitudinal and lateral speed), and valid flag.

    • Based on the structure of Waymo motion dataset, this research develops a conceptual framework to conduct the vehicle maneuver prediction from the raw dataset. Figure 3 demonstrates the working flow of the proposed methodology. As shown in the figure, the framework is constituted of two major sections: (1) data processing and (2) prediction model.

      Figure 3. 

      Conceptual framework.

      The data process section consists of a series of automatic scripts to parse the motion data from the TFRecord files and then extract the metrics used for further modeling. As shown in Fig. 3, this research mainly fetches 3 sets of motion features: kinematics (position coordinate, speed, and heading), vehicle operating space (VOS), and lane changing behaviors. Figure 4 shows a snapshot of the lane-changing in the dataset.

      Figure 4. 

      Identification of lane changing behavior.

      Notably, the VOS of a vehicle is captured as the instant-level driving environment. The VOS refers to the immediate object-free space around a vehicle which somehow determines the instantaneous driving decisions such as accelerating and decelerating. Figure 5 conceptualizes the VOS for an ego vehicle, divided into eight zones representing eight directions or dimensions in the vehicle's immediate space: Front, Back, Left, Right, Front Left, Front Right, Back Right, and Back Left, respectively. The VOS is measured as an eight-dimensional space, and its shape is determined by the distances between the ego car and its immediate objects (e.g., other road users or roadside units). Figure 5a shows a typical shape of a VOS which is object-free within the range of an AV's perception system; Figure 5b shows the shape of a VOS is limited by surrounding vehicles.

      Figure 5. 

      Describing the driving buffer around the ego car. (a) Vehicle Operating Space in eight dimensions; (b) Vehicle Operating Space affected by surrounding vehicles.

      As stated above, the VOS of an ego vehicle is defined as an eight-dimensional space; the shape and size of a VOS are determined by the distances between the ego vehicle and its immediate objects in eight dimensions. A safe driving decision may require a driver or ADS to pay attention to all surrounding objects in eight dimensions while the attention may be unequally weighted for different dimensions. For instance, drivers would spend more time focusing on the vehicles in front than those in the back. This study proposes a measure to quantify the VOS by weighting and combining distances between the ego vehicle and its immediate objects in eight dimensions:

      $ {VOS}_{i}=\sum _{j=1}^{8}{w}_{j}{d}_{ij} $ (1)

      Where $ {VOS}_{i} $ represents the VOS measure for an instance $ i $; $ {w}_{j} $ is the weight for the distance on dimension $ j $; $ {d}_{ij} $ is the distance to the nearest object on the dimension $ j $.

      Since the Waymo motion data provide comprehensive road geometrics and connections, by joining the HD map with the vehicle positions, the lane changing behaviors of the vehicles could be identified. The extracted track information is compiled into the tensors for further machine learning modeling. The dimension of the tensor is compiled as follows.

      $ \mathrm{T}\mathrm{e}\mathrm{n}\mathrm{s}\mathrm{o}\mathrm{r}\;(\mathrm{N},\mathrm{ }\mathrm{T},\mathrm{F}) $ (2)

      Where, $ \mathrm{N} $ indicates the number of the track traffic object; $ \mathrm{T} $ indicates the number of consecutive sequences of the object track (at 10 Hz, in total of 91); $ \mathrm{F} $ represents the number of object features extracted.

      To predict the vehicle behaviors and trajectories, this research utilizes the LSTM as the backbone of the prediction model. The LSTM is one of the deep learning frameworks which are widely used for sequence-to-sequence perdition, especially in vehicle trajectory prediction[20, 22, 28, 57]. LSTM is improved from the recurrent neural network and could effectively alleviate the gradient vanishing[64]. This study utilized the vanilla LSTM for the behavior prediction for the lane changing prediction. Figure 1b demonstrates the design of the LSTM cell. As shown in the figure, the LSTM cell consists of forget gate, input gate, and output gate:

      1) Forget gate: The forget gate takes the information from the current input and previous hidden state and uses a sigmoid function to decide the information to be retained.

      2) Input gate: This gate takes the current input and previous hidden state and determines the information to be written onto the internal cell states.

      3) Output gate: The output gate regulates the current hidden state for prediction, which utilizes the previous hidden state and current input.

      The mechanism of the LSTM and gates can be formulated as the following equations. For prediction, the softmax function and linear function are used for behavior and trajectory prediction, respectively.

      $ {f}_{t}=\sigma ({W}_{uf}{u}_{t}+{W}_{hf}{h}_{t-1}+{b}_{f}) $ (3)
      $ {i}_{t}=\sigma ({W}_{ui}{u}_{t}+{W}_{hi}{h}_{t-1}+{b}_{i}) $ (4)
      $ {o}_{t}=\sigma ({W}_{uo}{u}_{t}+{W}_{ho}{h}_{t-1}+{b}_{o}) $ (5)
      $ {c}_{t}={f}_{t}\odot {c}_{t-1}+{i}_{t}\odot tanh({W}_{uc}{u}_{t}+{W}_{hc}{h}_{t-1}+{b}_{c}) $ (6)
      $ {h}_{t}={o}_{t}\odot tanh\left({c}_{t}\right) $ (7)

      Where, $ \sigma \left(x\right) $ represents the sigmoid function; $ x\odot y $ represents the element-wise product; $ {u}_{t} $ represents the $ {t}^{th} $ input vector of the sequence; $ W $ is the linear transformation matrix; $ b $ is the bias vector; $ {f}_{t},{i}_{t},{o}_{t} $ represent the outputs of the forget gate, input gate, and output gate, respectively; $ {c}_{t} $ is the cell state vector; $ {h}_{t} $ is the hidden state vector.

      The mean binary cross-entropy (BCE) is used for loss calculation and backpropagation. The BCE measures the difference between the predicted probabilities and the actual labels, and penalizes the probabilities based on the difference. The loss function can be formulated as follows:

      $ L=\mathrm{ }-\frac{1}{N}\sum _{i=1}^{N}[{y}_{i}·\mathrm{log}{x}_{i}+(1-{y}_{i})·\mathrm{log}\left({1-x}_{i}\right)] $ (8)

      Where, $ L $ represents the BCE loss for the given batch; $ {y}_{i} $ indicates the actual label of $ i $; $ {x}_{i} $ represents of the model prediction of $ i $.

    • This study uses the Waymo motion data for training, testing and validation. Limited by the computation power and local storage, this research only uses one TFRecord file (File ID: tfrecord-00018-of-00150) for modeling. More datasets would be employed when stronger computation power was available. The used dataset contains in total of 314 scenarios. After data parsing and lane changing identification, there are 5,503 objects fully tracked in a 9.1 second period (at 10 Hz) without missing data. There are 1,477 times lane changing behaviors observed. The training, testing, and validation datasets are randomly sampled from the original datasets according to the ratio of 60%, 20%, and 20%, respectively.

      The model was trained on the Ubuntu platform with CUDA support. The hyperparameters of the vanilla model can be found in Table 2. Figure 6 shows the training and testing loss changes during the training process. It could be found that both the training and testing loss dramatically decrease in the first five epochs and keeps fluctuating. For most cases, the training loss is lower than the testing loss which indicates that the model is, to some extent, overfitting. To further examine the model performance, the training and testing accuracy and recall during the training process are recorded in Fig. 7. Note that the threshold of 0.5 is chosen to determine the positive vs the negative predictions. According to Fig. 7, both the training and testing accuracy significantly increase in the first five epochs. The training accuracy varies around 0.825 and the testing accuracy, at around 0.80. Both accuracies are higher than 0.5 which is a naïve random guess on the training dataset. The focus of this study is on predicting the lane changing behaviors and the recall is therefore one important metric for evaluation. As shown in Fig. 7, both the training and testing recall is stabilized at around 0.75 which means 75% of the lane changing behaviors are correctly predicted. Table 3 demonstrates the trained model performance on training, testing, and validation tests, respectively. Though the model is to some extent overfitted, the overall performance on the testing dataset proves the robustness of the prediction model.

      Table 2.  Hyperparameter setup for the vanilla model.

      HyperparameterValue
      Learning Rate0.005
      Number of Recurrent Layers1
      Number of features in hidden state64
      Batch Size32
      Number of Epochs100
      Threshold0.5
      Sequence Length4
      Selected FeaturesLongitudinal speed,
      Lateral Speed,
      Heading, VOS

      Figure 6. 

      Training and testing loss during training process.

      Figure 7. 

      Prediction accuracy and recall for training and testing datasets.

      Table 3.  Model performance on training, testing, and validation datasets.

      DatasetsNumber of observationAccuracyRecall
      Training Set18060.850.79
      Testing Set6020.810.75
      Validation Set6020.790.74
    • The main purpose of developing the LSTM model is to predict the lane changing behaviors of the vehicles. Besides the hyperparameters, the model prediction accuracy can also be influenced by the model inputs. As shown in Fig. 2, the original Waymo motion data contains comprehensive information about the object kinematics and road environment. However, it could be a challenging task to compile all the information in the modeling. In some cases, introducing more information might bias the model results and make the model difficult to converge. In order to testify the influence of different inputs, this study conducted a series of sensitivity analyses to examine their potential impacts on the model performance. In this section, three parameters are investigated: (1) selection of the features, (2) prediction horizons, and (3) training dataset balancing.

    • The vanilla model contains four features of the vehicle operation: longitudinal speed, lateral speed, heading direction, and VOS. It could be assumed that significant changes in speed and heading might always be associated with the changing of the vehicle behaviors while it is uncertain that whether VOS will play a role in prediction. This study therefore introduces the coordinates and kinematics independently as the input features to examine their impacts on the model performance. Figure 8 shows the loss changes during the process for each model. The sensitivity test controls the hyperparameter to the same and uses the training loss decay speed and depth as the indicators of the model performance.

      Figure 8. 

      Training loss decay for different feature combinations.

      As shown in Fig. 8, after introducing the vehicle coordinates, both training and testing loss are larger than the other models. The model with coordinates is hard to converge at the given set of hyperparameters. This could possibly be because of the large variation and sparsity of the coordinates. Since Waymo motion data collected the vehicle trajectories in various conditions. The value of the coordinates varies in a large range from –33,000 to +40,000 (for the TFrecord file used in this study). The sparsity of the trajectories could dominate the training process and make the model hard to learn and converge.

      VOS measures the clear space around the vehicle. It could be believed that the vehicle can only make the lane change when the space is allowable. As shown in Fig. 8, introducing the VOS could significantly improve the training process and lead to better prediction accuracy and recall (Table 4). The VOS and vehicle trajectories convey information about the surroundings and ego operation, respectively. Both of them are assumed to be indicators of lane changing behaviors. However, in the end-to-end deep learning models, the raw features like vehicle trajectories could require more time in training to extract the latent features. The VOS is extracting the relative distance to the car of interest. This concept is in line with some practices of prediction[56, 65] in which the coordinates are normalized to the last observed position of the target agent for each data sequence. Overall, the VOS, as a manually crafted feature, can reduce the effort of tuning and make the model easy to converge.

      Table 4.  Model performance for different feature combinations

      Feature selectionFeature dimensionDatasetsAccuracyRecall
      Longitudinal Speed
      Lateral Speed
      Heading
      3Training Set0.810.80
      Testing Set0.790.79
      Validation Set0.770.75
      Longitudinal Speed
      Lateral Speed
      Heading
      Vehicle Coordinates
      5Training Set0.620.60
      Testing Set0.640.63
      Validation Set0.540.54
      Longitudinal Speed
      Lateral Speed
      Heading
      VOS
      11Training Set0.850.79
      Testing Set0.810.75
      Validation Set0.790.74
    • Predicting the vehicle trajectory and the vehicle behaviors are dependent on historical information. Long-term trajectory and maneuver prediction is a continuous challenge. According to the current practice in related research[66], the prediction accuracy would decay with the increase in the prediction horizon, that is, when the historical information used for prediction is far before the decision point, it is hard to make the right prediction. As shown by Xing et al.[66], the prediction horizon varies from 0.5 to 3.5 s. In order to examine the horizon impacts on the proposed method, this section conducts a sensitivity analysis on the different horizons from 0.4 to 7.4 s.

      The analysis results are shown in Fig. 9. Both the testing accuracy and recall imply decreasing trends with respect to the prediction horizons. It could be assumed that as the prediction horizon increases, the model accuracy would approach 0.5 which is a naïve random guess on the dataset (with a 1:1 balanced dataset).

      Figure 9. 

      Testing accuracy and recall for different prediction horizons.

    • The lane changing behaviors are rare vehicle operations compared with lane-keeping. As introduced previously, the lane changing behavior only composes a small part of the vehicle operations captured. In the model training, the accuracy can be biased by the true negative observations (lane keeping). One of the common ways to alleviate this issue is to balance the training dataset[67] which uses over-sampling or under-sampling to get a 1:1 training dataset for True Positive (TP) and True Negative (TN) observations. In order to investigate the possible influence of the data balancing, this research compose the training dataset with different TN/TP ratios and test the model performance on the same ratio testing set.

      Figure 10 shows the testing accuracy and recall after training the model with the given ratio. The ratio increases from 0.1 with the step size of 0.1 to 10. At the same time, the accuracy will drop in the first several step sizes and increase afterward while the recall keeps decreasing. Figure 10 also demonstrates the TN/TP ratio curve, and it can be found that the curve overlaps with the accuracy curve after around ratio 3. This is caused by the loss function design. Since the BCE loss is used for training, it considers both the prediction correctness of TP and TN. However, as the number of TN increases, the correctness of TN will dominate the model training and therefore leads to a low recall.

      Figure 10. 

      Testing accuracy and recall for different data balancing.

    • The Waymo motion dataset contains comprehensive information about vehicle kinematics and the driving environments. However, due to the computation source available to the authors, this study only used part of the data and information for data modeling. This research employed the vanilla LSTM for the model framework which is not the most up-to-date deep learning backbones for the sequential data. Advanced model frameworks such as Transformers and Graph Neural Network could be utilized for the lane changing behavior prediction. Besides, some of the limitations are from the Waymo dataset. The motion data of the surrounding vehicles are collected by the camera and Lidar sensors which is limited to a range around the ego car. This leads to the issue that the tracks of the surrounding objects are interrupted if it was shadowed or out of range. Only a portion of objects have a complete and consistent track. This would lead to the sparsity of the valid tracks and furthermore, the lane changing behaviors.

    • With the development of autonomous vehicle technology, more real-world autonomous driving test data are becoming available for research. The research of developing methods to predict instantaneous vehicle maneuvers and to anticipate vehicle trajectories or motions is gaining growing interest among both transportation researchers and industry innovators. Predicting vehicle maneuvers such as merges, lane changing and turns requires the understanding of the surrounding static and dynamic environments. With the open motion dataset provided by Waymo, this study proposed a framework to explore autonomous driving data and investigate vehicle maneuvers, specifically lane change behaviors. In this framework, this study develops a Long Short-Term Memory (LSTM) model to predict lane changing behaviors to support the automated driving decision making. A concept of Vehicle Operating Space (VOS) is introduced to measure the possible space for vehicle maneuvers. The features are compiled as tensors for the prediction. This study used the vanilla LSTM as the backbone of the prediction model.

      The proposed model shows fair performance on the lane changing prediction. With a prediction horizon of 0.4 s and a balanced training dataset, the model is able to achieve an accuracy of 0.81 and recall of 0.75 on the testing dataset. In order to examine the robustness of this model, a series of sensitivity analyses are conducted on three key parameters: (1) feature selection, (2) prediction horizon, and (3) training dataset balancing. As shown in the discussion, introducing the VOS can increase the speed of loss decay and achieve higher accuracy and recall. While in comparison, involving the raw vehicle trajectories may make it difficult for the model to converge. Though deep learning is heading toward end-to-end training and prediction, in some cases, the manually crafted metrics (such as VOS proposed) may help for better model performance. The test on different prediction horizons shows that as the horizon increase, the model performance will be worse and approach the naïve random guess results. Balancing the training data is one common way to deal with the rare event prediction (e.g., lane changing behavior in this study). This study raises a discussion on the sampling ratio of true positive over true negative. By testing different ratios, it is found that as the ratio decreases, the accuracy will gradually be dominated by the majority of the observations which is lane keeping. The recall keeps decreasing which indicates the model is not effective in making the right prediction on the lane changing. In other words, inappropriate training data balancing could somehow bias the results.

      This study develops a methodological framework to explore and predict lane changing behaviors using the Waymo motion dataset. This research provides an extended discussion on several critical issues affecting the model performance. However, the design of the proposed sensitivity analysis is not impeccable. Continuing research is needed to improve both the prediction model design and parameter tests. The future study is expected to incorporate more dynamics (refined object tracks) and statics (road geometrics) into the model and conduct training and testing using a large dataset.

      • The authors are grateful for the support from the Alabama Transportation Institute and Center for Transportation Operations, Planning and Safety at the University of Alabama. The data were obtained from Waymo Open Dataset. Software Python, QGIS and deep learning toolkit Pytorch were used for the data processing, visualization and modeling. The views expressed in this paper are those of the authors, who are responsible for the facts and accuracy of the information presented herein.

      • The authors declare that they have no conflict of interest.

      • Copyright: © 2023 by the author(s). Published by Maximum Academic Press, Fayetteville, GA. This article is an open access article distributed under Creative Commons Attribution License (CC BY 4.0), visit https://creativecommons.org/licenses/by/4.0/.
    Figure (10)  Table (4) References (67)
  • About this article
    Cite this article
    Fu X, Liu J, Huang Z, Hainen A, Khattak AJ. 2023. LSTM-based lane change prediction using Waymo open motion dataset: The role of vehicle operating space. Digital Transportation and Safety 2(2):112−123 doi: 10.48130/DTS-2023-0009
    Fu X, Liu J, Huang Z, Hainen A, Khattak AJ. 2023. LSTM-based lane change prediction using Waymo open motion dataset: The role of vehicle operating space. Digital Transportation and Safety 2(2):112−123 doi: 10.48130/DTS-2023-0009

Catalog

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return