Driving risk assessment under the connected vehicle environment: a CNN-LSTM modeling approach

Figures (7) Tables (6)

Figure 1.
A drone recording the motion of vehicles along a 420-meter stretch of highway from an overhead perspective^[38].
Figure 2.
The illustration of spatial range from a top view.
Figure 3.
The detailed and simplified top views for each moment with the subject vehicle in the center. (a) is an illustration of the spatial range, (b) is a zoomed-in version of the study area, and (c) is a simplified version as input to the model.
Figure 4.
The time series top view set describing an event.
Figure 5.
The structure of CNN model.
Figure 6.
The basic unit of LSTM model^[43].
Figure 7.
The proposed CNN-LSTM model.

Modeling process	Parameters with values
Input of CNN	75 top views with both the front and rear of the subject vehicle: 360 × 30 (or 75 top views with only the front of the subject vehicle: 360 × 60)
Convolution layer	No. of layers: 4 No. of kernels: 32, 64,128, and 256 Kernel size: (5 × 5), (3 × 3), (3 × 3), and (3 × 3) Stride: (2,2), (2,2), (2,2), and (2,2) Padding: (0,0), (0,0), (0,0), and (0,0) Activating function: ReLU
Pooling layer	No. of layers: 1 Kernel size: (2 × 2) Stride: (2,2) Padding: (0,0)
Fully connected layer	No. of layers: 2 Hidden neurons: 512 and 256 Activating function: ReLU
Output of CNN model/ Input of LSTM model	No. of features: 64 (for each top view) 75 top views for each event
LSTM	No. of layers: 3 Hidden neurons: 512, 512, 512
Fully connected layer	No. of layers: 1 Hidden neurons: 256 Activating function: ReLU
Output of LSTM model	Binary classification result: high-risk or non-high-risk
Training process	Backpropagation Learning rate: StepLR (lr = 1e-3, γ = 0.3) Loss function: Cross-entropy Mini-batch size: 128 Epochs: 50

Table 1.

Parameters with values in the CNN-LSTM modeling process.

	Spatial range	Temporal range
1	100 m in both the front and rear of the subject vehicle	5 s to 2 s before the zero time
2		5 s to 3 s before the zero time
3		4 s to 2 s before the zero time
4	Only 100 m in the front of the subject vehicle	5 s to 2 s before the zero time
5		5 s to 3 s before the zero time
6		4 s to 2 s before the zero time

Table 3.

Descriptions of the six experiment designs.

Model	Variable dimension	Time-series variability consideration	Sensitivity	False alarm rate	AUC
Random Forest^[44]	Cross-section	No	0.517	0.088	0.827
RPLR model^[40]	Single vehicle	Yes	0.980	0.060	0.960
CNN-LSTM model (this study)	Single vehicle	Yes	0.996	0.065	0.997

Table 6.

Comparation results of modeling performance based on testing data.