An evolutionary game theory-based machine learning framework for predicting mandatory lane change decision

Sixuan Xu; Mengyun Li; Wei Zhou; Jiyang Zhang; Chen Wang; Sixuan Xu; Mengyun Li; Wei Zhou; Jiyang Zhang; Chen Wang

doi:10.48130/dts-0024-0011

Mandatory lane change (MLC) is likely to cause traffic oscillations, which have a negative impact on traffic efficiency and safety. There is a rapid increase in research on mandatory lane change decision (MLCD) prediction, which can be categorized into physics-based models and machine-learning models. Both types of models have their advantages and disadvantages. To obtain a more advanced MLCD prediction method, this study proposes a hybrid architecture, which combines the Evolutionary Game Theory (EGT) based model (considering data efficient and interpretable) and the Machine Learning (ML) based model (considering high prediction accuracy) to model the mandatory lane change decision of multi-style drivers (i.e. EGTML framework). Therefore, EGT is utilized to introduce physical information, which can describe the progressive cooperative interactions between drivers and predict the decision-making of multi-style drivers. The generalization of the EGTML method is further validated using four machine learning models: ANN, RF, LightGBM, and XGBoost. The superiority of EGTML is demonstrated using real-world data (i.e., Next Generation SIMulation, NGSIM). The results of sensitivity analysis show that the EGTML model outperforms the general ML model, especially when the data is sparse.

HTML

Introduction

Mandatory lane change (MLC) refers to the behavior that the driver must change the current lane to the expected lane in some places due to traffic regulations or his/her driving needs. MLC usually occurs in expressway weaving areas, on and off ramps, and the entrance to intersections. Compared with discretionary lane changing (DLC, e.g., the lane changing behavior taken by the drivers to improve the current driving environment), MLC is more likely to cause traffic oscillations, which have a negative impact on traffic efficiency and safety^[1,2]. Therefore, analyzing, modeling, and predicting mandatory lane-changing behavior is important for improving road traffic safety and efficiency.

In the past decade, there has been a rapid increase in research on lane change modeling, especially on mandatory lane change decision (MLCD) prediction^[3−5]. MLCD models can be categorized into two types, physics-based models and machine-learning models. Early physics-based MLCD models started from the classic rule-based models (e.g., Gipps^[6], MITSIM^[7], MOBIL^[8]), and utility-based models^[9], which imitated human drivers' activities towards lane-changing. However, challenging function expressions and complicated parameters make these models more difficult to calibrate and validate. The lane-changing process involves dynamic interaction between drivers, that is, one driver pays the cost (e.g., speed, space) and the other driver benefits from it (e.g., acceleration, lane change). Game theory (GT), one of the most frequent applications of simulating the process of human competitive and cooperative behaviors, can better describe the interaction between drivers. Thus, there have been many MLCD models integrated with GT^[10,11], which are at the forefront of MLCD research. Evolutionary Game Theory (EGT) presents the objective of dynamically describing the competition and cooperation between human. MLCD models based on EGT can explain the progressive cooperative interactions of drivers. The parameters in the physics-based models have physical meaning, so the model is highly interpretable. However, the models only include a subset of the significant factors of MLCD and ignore the rest of the potential factors, so the prediction accuracy is low. Machine learning (ML) models focus on learning lane-changing behavior from vehicle-related data (e.g., dynamic and trajectory data). Due to the complexity of influencing factors of MLCD, ML models are gradually being applied to MLCD modeling^[12,13]. In addition, the effect of the driving style on MLCD was also considered in the modeling process^[14]. In general, the prediction accuracy of MLCD by ML models is high, but the models have high requirements on data quality and quantity, and low robustness. Besides, the model lacks interpretability, in other words, the model cannot explain how the driving behavior evolves as traffic environment changes.

Recently, modeling methods that combine physics-based models and machine learning models are gaining popularity in balancing prediction accuracy and the interpretability in the engineering field^[15,16]. In machine learning models' loss functions, physics information is usually encoded as governing equations, physical constraints, or regularity terms. In the field of traffic, the application of this method is not extensive enough, and it is currently limited to traffic state prediction and car-following (CF) behavior modeling. Shi et al. utilize a neural network to encode the traffic flow model for traffic state estimation^[17]. They observed that the proposed Physics-informed Deep learning (PIDL) approach has the capability of making precise and timely TSE even with sparse input. Yuan et al. transformed the physical knowledge in the traditional car-following model into a physical regularize of multivariate Gaussian processes to predict the drivers' car-following behaviors^[18]. The results demonstrated that the proposed method outperforms the previous methods in estimation precision. Mo et al.^[19] designed a physics-informed deep learning car-following model (PIDL-CF) architecture and utilized two neural network models: ANN and LSTM to further validate the generalization of the PIDL method. The results showed the superior performance of physics- informed methods over those without physical information. Masmoudi et al. propose an autonomous vehicle following framework that involves using leading vehicle detecting based on You Look Once version 3 (YOLOv3) and implementing vehicle following using reinforcement learning-based algorithms^[20]. This method, which combines physical models with machine learning, shows considerable advantages in terms of effectiveness. In all, physics-informed methods can overcome the challenges of training data-hungry machine learning models, particularly arising from limited data and imperfect data (e.g., missing data, outliers, noisy data).

To obtain a more predictive and explainable MLCD model that can depict the driving behavior of the interacted drivers with different driving styles, this study is aimed to develop an evolutionary game theory-based machine learning model (EGTML). The model prediction result is output by the machine learning model which is informed by the EGT-based physics model. The main contributions of this paper are as follows:

(1) Design an EGTML architecture to model the mandatory lane change decision of multi-style drivers, which combines the physics-based model (data efficient and interpretable) and the machine learning model (high prediction accuracy).

(2) Demonstrate the generalization of EGTML methods by using four different ML methods: ANN, RF, LightGBM, and XGBoost. The results showed that EGTML holds the potential to maintain high prediction accuracy and enhance the data-efficiency of training by incorporating physical knowledge.

(3) Demonstrate the superiority of EGTML on real-world data. The results showed that the proposed hybrid paradigm outperforms the general machine learning model across various training data, especially when the data is sparse.

Multi-style driver clustering

Since there are significant differences in driving behaviors of drivers with different styles, it is necessary to accurately model the lane-changing behaviors of drivers with different styles. This paper established a multi-style driver clustering model based on the Gaussian mixture model (GMM)^[21].

Preliminary of GMM

Gaussian mixture model (GMM) is a linear combination of multiple single Gaussian models. If the d-dimensional vector x obeys the Gaussian mixture distribution, its probability density function is defined as:

$ f_{M}(x)={\mathop\sum\nolimits_{i=1}^{k}} \alpha_{i} \times f\left(x \mid \mu_{i}, \Sigma_{i}\right) $

(1)

where, $ \alpha_{i} $ is the mixing coefficient, $ f\left(x \mid \mu_{i}, \Sigma_{i}\right) $ is the probability density function of the i-th Gaussian distribution, its equation is as follows:

$ f\left(x \mid \mu_{i}, \Sigma_{i}\right)=\dfrac{1}{(2 \pi)^{\frac{d}{2}}|\Sigma|^{\frac{1}{2}}} E X P\left[-\dfrac{1}{2}\left(x-\mu_{i}\right)^{T} \Sigma_{i}^{-1}(x-\mu i)\right] $

(2)

where, $ \mu_{i} $ is the d-dimensional mean vector, $ \Sigma_{i} $ is the d × d-dimensional covariance matrix. The main parameters of GMM are $ \left\{\left(a_{i}, \mu_{i}, \Sigma_{i}\right) \mid i=1,2, \ldots, k\right\} $. The Expectation-Maximum (EM) algorithm^[22] is the common solution algorithm to obtain the optimal parameters. The EM algorithm continuously updates the parameters in the iterative process until the termination condition is satisfied.

Multi-style driver clustering

During the operation of the vehicle by different styles of drivers, the operating parameters of the vehicle are different, which are intuitively reflected in the changes in parameters such as speed and acceleration^[23]. The vehicle operating parameters can be obtained from the vehicle trajectory data. To consider the impact of the traffic operation state on drivers, define the ratio of vehicle speed to the space average speed as the speed ratio r to replace speed, the calculation formula is as follows:

$ r_{i}=\dfrac{v_{i}}{\overline{v}_{s}} $ (3)

$ \overline{v}_{s}=\dfrac{\sum_{i=1}^{n} v_{i}}{n} $ (4)

where, $ \overline{v}_{b} $ is the space average speed, n is the number of vehicles, and v_i is the speed of i^th vehicle. Based on the speed ratio r and the acceleration a, the driving style feature vector is constructed $ \{E(r), V A R(r), E(a)\} $. The feature vector is brought into the GMM and the EM algorithm is used to obtain the optimal model parameters. Then, the vehicles are divided into k-clusters, corresponding to different driving styles.

Conclusions

This paper develops an evolutionary game theory-based machine learning mandatory lane change decision model (EGTML). The prediction result is output by the machine learning model which is informed by the EGT-based physical model. This modeling framework holds the potential to maintain high prediction accuracy and enhance the data efficiency of training by incorporating physical knowledge. The generalization of the EGTML method is further validated using four machine learning models: ANN, RF, LightGBM, and XGBoost, and the superiority of EGTML is demonstrated on the NGSIM dataset. Applying the best-performing EGT-LightGBM, and LightGBM to test the parameter sensitivity of EGTML, the results show that the EGTML model outperforms the general ML model, especially when the data is sparse.

To the best of our knowledge, this paper is the first-of-its-kind that employs a hybrid paradigm where a physics-based model is encoded into a machine learning model for mandatory lane-changing decision prediction. Thus, there are still a lot of unresolved research questions. This work will be extended in several directions. (1) More advanced physics-based MLCD models will be encoded into ML models, which may hold the potential to capture more complex lane-changing behaviors. (2) A systematic simulation procedure should be developed for testing the proposed EGTML model and identifying the best physics-based models by deriving some key metrics (e.g., collision rate, conflicting distribution).

Author contributions

The authors confirm contribution to the paper as follows: conceptualization, methodology, draft manuscript preparation: Xu S; software: Xu S, Li M; data curation: Li M; visualization, investigation: Li M, Zhou W, Zhang J; supervision, project administration, funding acquisition: Wang C. All authors reviewed the results and approved the final version of the manuscript.

[1]	Ali Y, Zheng Z, Haque MM, Wang M. 2019. A game theory-based approach for modelling mandatory lane-changing behaviour in a connected environment. Transportation Research Part C: Emerging Technologies 106:220−42 doi: 10.1016/j.trc.2019.07.011 CrossRef Google Scholar
[2]	Fu X, Liu J, Huang Z, Hainen A, Khattak AJ. 2023. LSTM-based lane change prediction using Waymo open motion dataset: the role of vehicle operating space. Digital Transportation and Safety 2:112−23 doi: 10.48130/dts-2023-0009 CrossRef Google Scholar
[3]	Mullakkal-Babu FA, Wang M, van Arem B, Happee R. 2020. Empirics and models of fragmented lane changes. IEEE Open Journal of Intelligent Transportation Systems 1:187−200 doi: 10.1109/ojits.2020.3029056 CrossRef Google Scholar
[4]	Wang Z, Guan M, Lan J, Yang B, Kaizuka T, et al. 2022. Classification of automated lane-change styles by modeling and analyzing truck driver behavior: a driving simulator study. IEEE Open Journal of Intelligent Transportation Systems 3:772−85 doi: 10.1109/OJITS.2022.3222442 CrossRef Google Scholar
[5]	An G, Bae JH, Talebpour A. 2023. An optimized car-following behavior in response to a lane-changing vehicle: a Bézier curve-based approach. IEEE Open Journal of Intelligent Transportation Systems 4:682−89 doi: 10.1109/OJITS.2023.3291177 CrossRef Google Scholar
[6]	Gipps PG. 1986. A model for the structure of lane-changing decisions. Transportation Research Part B: Methodological 20:403−14 doi: 10.1016/0191-2615(86)90012-3 CrossRef Google Scholar
[7]	Yang Q, Koutsopoulos HN. 1996. A Microscopic Traffic Simulator for evaluation of dynamic traffic management systems. Transportation Research Part C: Emerging Technologies 4:113−29 doi: 10.1016/s0968-090x(96)00006-x CrossRef Google Scholar
[8]	Kesting A, Treiber M, Helbing D. 2007. General lane-changing model MOBIL for car-following models. Transportation Research Record: Journal of the Transportation Research Board 1999:86−94 doi: 10.3141/1999-10 CrossRef Google Scholar
[9]	Toledo T, Koutsopoulos HN, Ben-Akiva ME. 2003. Modeling integrated lane-changing behavior. Transportation Research Record: Journal of the Transportation Research Board 1857:30−38 doi: 10.3141/1857-04 CrossRef Google Scholar
[10]	Kita H. 1999. A merging–giveway interaction model of cars in a merging section: a game theoretic analysis. Transportation Research Part A: Policy and Practice 33:305−12 doi: 10.1016/s0965-8564(98)00039-1 CrossRef Google Scholar
[11]	Liu HX, Xin W, Adams ZM, Ban J. 2007. A game theoretical approach for modelling merging and yielding behavior at freeway on-ramp sections. pp. 1−15.
[12]	Hou Y, Edara P, Sun C. 2014. Modeling mandatory lane changing using Bayes classifier and decision trees. IEEE Transactions on Intelligent Transportation Systems 15:647−55 doi: 10.1109/TITS.2013.2285337 CrossRef Google Scholar
[13]	Dou Y, Yan F, Feng D. 2016. Lane changing prediction at highway lane drops using support vector machine and artificial neural network classifiers. 2016 IEEE International Conference on Advanced Intelligent Mechatronics (AIM), Banff, AB, Canada, 12−15 July 2016. USA: IEEE. pp. 901−6. DOI: 10.1109/AIM.2016.7576883
[14]	Li X, Wang W, Roetting M. 2019. Estimating driver's lane-change intent considering driving style and contextual traffic. IEEE Transactions on Intelligent Transportation Systems 20:3258−71 doi: 10.1109/TITS.2018.2873595 CrossRef Google Scholar
[15]	Yang Y, Perdikaris P. 2019. Adversarial uncertainty quantification in physics-informed neural networks. Journal of Computational Physics 394:136−52 doi: 10.1016/j.jcp.2019.05.027 CrossRef Google Scholar
[16]	Raissi M, Wang Z, Triantafyllou MS, Karniadakis GE. 2019. Deep learning of vortex-induced vibrations. Journal of Fluid Mechanics 861:119−37 doi: 10.1017/jfm.2018.872 CrossRef Google Scholar
[17]	Shi R, Mo Z, Huang K, Di X, Du Q. 2021. Physics-informed deep learning for traffic state estimation. arXiv Preprint:2101.06580 doi: 10.48550/arXiv.2101.06580 CrossRef Google Scholar
[18]	Yuan Y, Wang Q, Yang XT. 2020. Modeling stochastic microscopic traffic behaviors: a physics regularized Gaussian process approach. arXiv Preprint:2007.10109 doi: 10.48550/arXiv.2007.10109 CrossRef Google Scholar
[19]	Mo Z, Shi R, Di X. 2020. A physics-informed deep learning paradigm for car-following models. Transportation Research Part C: Emerging Technologies 130:103240 doi: 10.1016/j.trc.2021.103240 CrossRef Google Scholar
[20]	Masmoudi M, Friji H, Ghazzai H, Massoud Y. 2021. A reinforcement learning framework for video frame-based autonomous car-following. IEEE Open Journal of Intelligent Transportation Systems 2:111−27 doi: 10.1109/OJITS.2021.3083201 CrossRef Google Scholar
[21]	Dempster AP, Laird NM, Rubin DB. 1977. Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society Series B: Statistical Methodology 39:1−22 doi: 10.1111/j.2517-6161.1977.tb01600.x CrossRef Google Scholar
[22]	Yang MS, Lai CY, Lin CY. 2012. A robust EM clustering algorithm for Gaussian mixture models. Pattern Recognition 45:3950−61 doi: 10.1016/j.patcog.2012.04.031 CrossRef Google Scholar
[23]	Sagberg F, Selpi, Bianchi Piccinini GF, Engström J. 2015. A review of research on driving styles and road safety. Human Factors 57:1248−75 doi: 10.1177/0018720815591313 CrossRef Google Scholar
[24]	Taylor PD, Jonker LB. 1978. Evolutionary stable strategies and game dynamics. Mathematical Biosciences 40:145−56 doi: 10.1016/0025-5564(78)90077-9 CrossRef Google Scholar
[25]	Hayward J. 1972. Near-miss determination through use of a scale of danger. Highway Research Record 1:1−2 Google Scholar
[26]	Zheng Y, Han L, Yu J, Yu R. 2023. Driving risk assessment under the connected vehicle environment: a CNN-LSTM modeling approach. Digital Transportation and Safety 2:211−19 doi: 10.48130/dts-2023-0017 CrossRef Google Scholar
[27]	Alexiadis V, Colyar J, Halkias J, Hranac R, McHale G. 2004. The next generation simulation program. ITE Journal 74:22−26 Google Scholar
[28]	Ossen S, Hoogendoorn SP. 2008. Validity of trajectory-based calibration approach of car-following models in presence of measurement errors. Transportation Research Record: Journal of the Transportation Research Board 2088:117−25 doi: 10.3141/2088-13 CrossRef Google Scholar
[29]	Wang Q, Li Z, Li L. 2014. Investigation of discretionary lane-change characteristics using next-generation simulation data sets. Journal of Intelligent Transportation Systems 18:246−53 doi: 10.1080/15472450.2013.810994 CrossRef Google Scholar
[30]	McCulloch WS, Pitts W. 1943. A logical calculus of the ideas immanent in nervous activity. The bulletin of mathematical biophysics 5:115−33 doi: 10.1016/S0092-8240(05)80006-0 CrossRef Google Scholar
[31]	Breiman L. 2001. Random forests. Machine Learning 45:5−32 doi: 10.1023/A:1010933404324 CrossRef Google Scholar
[32]	Ke G, Meng Q, Finley T, Wang T, Chen W, et al. 2017. LightGBM: A Highly Efficient Gradient Boosting Decision Tree. 31 ^st Conference on Neural Information Processing Systems (NiPs 2017), Long Beach, California, USA, 4−9 December, 2017. pp. 3149−57. DOI: 10.5555/3294996.3295074
[33]	Chen T, Guestrin C. 2016. XGBoost: A Scalable Tree Boosting System. Proceedings of the 22 ^nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. San Francisco California USA. USA: Association for Computing Machinery (ACM). pp. 785–94. https://doi.org/10.1145/2939672.2939785

Game players		SV
Game players		Lane change	No lane change
TB	Yield	P₁₁ : α₁TTC + β₁L	P₂₁ : −β₁L
	Yield	Q₁₁ : α₂TTC − β₂Δv	Q₂₁ : −β₂Δv
	No yield	P₁₂ : −α₁TTC	P₂₂ : −β₁L
	No yield	Q₁₂ : β₂Δv − α₂TTC	Q₂₂ : β₂Δv

(x₁, x₂)	$ F_{C V}^{\prime}\left(x^{*}\right) $	$ f_{T B}^{\prime}\left(x^{*}\right) $	Stability
(0, 0)	C-G	F-H	Determined by the payoff matrix
(0,1)	A-E	H-F
(1,0)	G-C	B-D
(1,1)	E-A	D-B
$\left( \dfrac{H-F}{B+H-D-F}, \dfrac{G-C}{A+G-C-E}\right) $	0	0	Unstable solution

Symbol	Meaning	Unit
V_OV, V_OF, V_OB, V_TP, V_TH	The speed of the vehicle	m/s
A_OV, A_GF, A_CB, A_TF, A_TB	The acceleration of the vehicle	m/s²
ΔV_CF, ΔV_CB, ΔV_TF, ΔV_TB	The speed difference between vehicles	m/s
G_CF, G_CB, G_TF, G_TB	The gap between vehicles	m
TTC_CF, TTC_CB, TTC_TF, TTC_TB	The TTC between vehicles	s
L	The distance of SV to the end of MLC	m
$ \overline{v}_{s} $	Space average speed	m/s

Category	1	2	3	4
α₁	0.98	0.99	0.96	0.97
β₁	0.02	0.01	0.04	0.03
α₂	0.8	0.9	0.8	0.85
β₂	0.2	0.1	0.2	0.15
$ T T C _{TF}^{\min} $	6.25	6.25	6.25	6.25
$ T T C _{TB}^{\min}$	6.25	6.25	6.25	6.25

{{lists.name}}

An evolutionary game theory-based machine learning framework for predicting mandatory lane change decision

Abstract

Rights and permissions

References

About this article

Cite this article

Article Metrics

Access History

Other Articles By Authors