Short-term inbound rail transit passenger flow prediction based on BILSTM model and influence factor analysis

Qianru Qi; Rongjun Cheng; Hongxia Ge; Qianru Qi; Rongjun Cheng; Hongxia Ge

doi:10.48130/DTS-2023-0002

2023 Volume 2

Article Contents

Next Previous

ARTICLE Open Access

Short-term inbound rail transit passenger flow prediction based on BILSTM model and influence factor analysis

1.
Faculty of Maritime and Transportation, Ningbo University, Ningbo 315211, China
2.
Jiangsu Province Collaborative Innovation Center for Modern Urban Traffic Technologies, Nanjing 210096, China
3.
National Traffic Management Engineering and Technology Research Centre Ningbo University Sub-centre, Ningbo 315211, China

More Information

Corresponding author: chengrongjun76@126.com

Received: 16 November 2022
Accepted: 13 February 2023
Published online: 30 March 2023
Digital Transportation and Safety 2023, 2(1): 12−22 | Cite this article

Abstract

Accurate and real-time passenger flow prediction of rail transit is an important part of intelligent transportation systems (ITS). According to previous studies, it is found that the prediction effect of a single model is not good for datasets with large changes in passenger flow characteristics and the deep learning model with added influencing factors has better prediction accuracy. In order to provide persuasive passenger flow forecast data for ITS, a deep learning model considering the influencing factors is proposed in this paper. In view of the lack of objective analysis on the selection of influencing factors by predecessors, this paper uses analytic hierarchy processes (AHP) and one-way ANOVA analysis to scientifically select the factor of time characteristics, which classifies and gives weight to the hourly passenger flow through Duncan test. Then, combining the time weight, BILSTM based model considering the hourly travel characteristics factors is proposed. The model performance is verified through the inbound passenger flow of Ningbo rail transit. The proposed model is compared with many current mainstream deep learning algorithms, the effectiveness of the BILSTM model considering influencing factors is validated. Through comparison and analysis with various evaluation indicators and other deep learning models, the results show that the R2 score of the BILSTM model considering influencing factors reaches 0.968, and the MAE value of the BILSTM model without adding influencing factors decreases by 45.61%.
- Rail transit passenger flow predict,
- Time travel characteristics,
- BILSTM,
- Influence factor,
- Deep learning model

Supplementary information

Appendix Long Short-Term and Bi-directional Long Short-Term Memory Neural Network.

Rights and permissions
Copyright: © 2023 by the author(s). Published by Maximum Academic Press, Fayetteville, GA. This article is an open access article distributed under Creative Commons Attribution License (CC BY 4.0), visit https://creativecommons.org/licenses/by/4.0/.

References

[1]	Jiang W, Zhang H, Long Y, Chen J, Sui Y, et al. 2021. GPS data in urban online ride-hailing: The technical potential analysis of demand prediction model. Journal of Cleaner Production 279:123706 doi: 10.1016/j.jclepro.2020.123706 CrossRef Google Scholar
[2]	Ke J, Feng S, Zhu Z, Yang H, Ye J. 2021. Joint predictions of multi-modal ride-hailing demands: A deep multi-task multi-graph learning-based approach. Transportation Research Part C: Emerging Technologies 127:103063 doi: 10.1016/j.trc.2021.103063 CrossRef Google Scholar
[3]	Rahman MH, Rifaat SM. 2021. Using spatio-temporal deep learning for forecasting demand and supply-demand gap in ride-hailing system with anonymised spatial adjacency information. IET Intelligent Transport Systems 15:941−57 doi: 10.1049/itr2.12073 CrossRef Google Scholar
[4]	Zhang D, Xiao F, Shen M, Zhong S. 2021. DNEAT: A novel dynamic node-edge attention network for origin-destination demand prediction. Transportation Research Part C: Emerging Technologies 122:102851 doi: 10.1016/j.trc.2020.102851 CrossRef Google Scholar
[5]	Elman JL. 1991. Distributed representations, simple recurrent networks, and grammatical structure. Machine Learning 7:195−225 doi: 10.1007/BF00114844 CrossRef Google Scholar
[6]	Rumelhart DE, Hinton GE, Williams RJ. 1986. Learning representations by back-propagating errors. Nature 323:533−36 doi: 10.1038/323533a0 CrossRef Google Scholar
[7]	Schmidhuber J. 2015. Deep learning in neural networks: An overview. Neural Networks 61:85−117 doi: 10.1016/j.neunet.2014.09.003 CrossRef Google Scholar
[8]	Yang D, Chen K, Yang M, Zhao X. 2019. Urban rail transit passenger flow forecast based on LSTM with enhanced long-term features. IET Intelligent Transport Systems 10:1475−82 doi: 10.1049/iet-its.2018.5511 CrossRef Google Scholar
[9]	Zhang J, Chen F, Shen Q. 2019. Cluster-Based LSTM Network for Short-Term Passenger Flow Forecasting in Urban Rail Transit. IEEE Access 7:147653−71 doi: 10.1109/ACCESS.2019.2941987 CrossRef Google Scholar
[10]	Yang X, Xue Q, Ding M, Wu J, Gao Z. 2021. Short-term prediction of passenger volume for urban rail systems: A deep learning approach based on smart-card data. International Journal of Production Economics 231:107920 doi: 10.1016/j.ijpe.2020.107920 CrossRef Google Scholar
[11]	Ibrahim A, Hall F. 1994. Effect of adverse weather conditions on speed-flow-occupancy relationships. Transportation Research Record 1994:184−91 Google Scholar
[12]	Brilon W, Ponzlet M. 1996. Variability of speed-flow relationships on German autobahns. Transportation Research Record 1555:91−98 doi: 10.1177/0361198196155500112 CrossRef Google Scholar
[13]	Agarwal M, Maze T, Souleyrette R. 2005. Impacts of weather on urban freeway traffic flow characteristics and facility capacity. Proceedings of the 2005 Mid-Continent Transportation Research Symposium, Ames, Iowa, August 2005. pp. 1121−34.
[14]	Zhang D, Kabuka MR. 2018. Combining weather condition data to predict traffic flow: a GRU-based deep learning approach. IET Intelligent Transport Systems 12:578−85 doi: 10.1049/iet-its.2017.0313 CrossRef Google Scholar
[15]	Li G, Yang Y, Qu X. 2020. Deep learning approaches on pedestrian detection in hazy weather. IEEE Transactions on Industrial Electronics 67:8889−99 doi: 10.1109/TIE.2019.2945295 CrossRef Google Scholar
[16]	Liu L, Chen RC. 2017. A novel passenger flow prediction model using deep learning methods. Transportation Research Part C: Emerging Technologies 84:74−91 doi: 10.1016/j.trc.2017.08.001 CrossRef Google Scholar
[17]	Hou Y, Deng Z, Cui H. 2021. Short-term traffic flow prediction with weather conditions: Based on deep learning algorithms and data fusion. Complexity 2021:6662959 doi: 10.1155/2021/6662959 CrossRef Google Scholar
[18]	Liu L, Chen R, Zhu S. 2020. Impacts of weather on short-term metro passenger flow forecasting using a deep LSTM neural network. Applied Sciences 10:2962 doi: 10.3390/app10082962 CrossRef Google Scholar
[19]	Zhang S, Zhang J, Yang L, Yin J, Gao Z. 2022. Spatial-temporal attention fusion network for short-term passenger flow prediction on holidays in urban rail transit systems. Machine Learning arXiv:2203.00007 doi: abs/2203.00007 CrossRef Google Scholar
[20]	Yang J, Liu T, Li C, Tong W, Zhu Y. et al. 2021. MGSTCN: A Multi-Graph Spatio-Temporal Convolutional Network for Metro Passenger Flow Prediction. 2021 7th International Conference on Big Data Computing and Communications (BigCom), Deqing, China, 2021. pp. 164−71. USA: IEEE. https://doi.org/10.1109/BigCom53800.2021.00050.
[21]	Zhu H, Yang X, Wang Y. 2018. Prediction of Daily Entrance and Exit Passenger Flow of Rail Transit Stations by Deep Learning Method. Journal of Advanced Transportation 2018:6142724 doi: 10.1155/2018/6142724 CrossRef Google Scholar
[22]	Ling X, Huang Z, Wang C, Zhang F, Wang P. 2018. Predicting subway passenger flows under different traffic conditions. Plos One 13:e0202707 doi: 10.1371/journal.pone.0202707 CrossRef Google Scholar
[23]	Zhu K, Xun P, Li W, Li Z, Zhou R. 2019. Prediction of passenger flow in urban rail transit based on big data analysis and deep learning. IEEE Access 7:142272−79 doi: 10.1109/ACCESS.2019.2944744 CrossRef Google Scholar
[24]	Guo J, Xie Z, Qin Y, Jia L, Wang Y. 2019. Short-term abnormal passenger flow prediction based on the fusion of SVR and LSTM. IEEE Access 7:42946−55 doi: 10.1109/ACCESS.2019.2907739 CrossRef Google Scholar
[25]	Guo Z, Zhao X, Chen Y, Wu W, Yang J. 2019. Short-term passenger flow forecast of urban rail transit based on GPR and KRR. IET Intelligent Transport Systems 13:1374−82 doi: 10.1049/iet-its.2018.5530 CrossRef Google Scholar
[26]	Li D, Cao J, Li R, Wu L. 2020. A spatio-temporal structured LSTM model for short-term prediction of origin-destination matrix in rail transit with multisource data. IEEE Access 8:84000−19 doi: 10.1109/ACCESS.2020.2991982 CrossRef Google Scholar
[27]	Xue F, Yao E, Huan N, Li B, Liu S. 2020. Prediction of Urban Rail Transit Ridership under Rainfall Weather Conditions. Journal of Transportation Engineering, Part A: Systems 146:4020061 doi: 10.1061/jtepbs.0000383 CrossRef Google Scholar
[28]	Liu Q, Guo Q, Wang W, Zhang Y, Kang Q. 2021. An automatic detection algorithm of metro passenger boarding and alighting based on deep learning and optical flow. IEEE Transactions on Instrumentation and Measurement 70:5006613 doi: 10.1109/TIM.2021.3054627 CrossRef Google Scholar
[29]	Jing Y, Hu H, Guo S, Wang X, Chen F. 2021. Short-term prediction of urban rail transit passenger flow in external passenger transport hub based on LSTM-LGB-DRS. IEEE Transactions on Intelligent Transportation Systems 22:4611−21 doi: 10.1109/TITS.2020.3017109 CrossRef Google Scholar
[30]	Liu D, Wu Z, Sun S. 2022. Study on subway passenger flow prediction based on deep recurrent neural network. Multimedia Tools and Applications 81:18979−92 doi: 10.1007/s11042-020-09088-x CrossRef Google Scholar
[31]	He Y, Li L, Zhu X, Tsui KL. 2022. Multi-graph convolutional-recurrent neural network (MGC-RNN) for short-term forecasting of transit passenger flow. IEEE Transactions on Intelligent Transportation Systems 23:8155−74 doi: 10.1109/TITS.2022.3150600 CrossRef Google Scholar
[32]	Mudashiru RB, Sabtu N, Abdullah R, Saleh A, Abustan I. 2022. A comparison of three multi-criteria decision-making models in mapping flood hazard areas of Northeast Penang, Malaysia. Natural Hazards 112:1903−39 doi: 10.1007/s11069-022-05250-w CrossRef Google Scholar
[33]	Wang F, Huang GH, Fan Y, Li YP. 2020. Robust Subsampling ANOVA Methods for Sensitivity Analysis of Water Resource and Environmental Models. Water Resour Manag 34:3199−17 doi: 10.1007/s11269-020-02608-2 CrossRef Google Scholar
[34]	Yang G, Xu H. 2020. A residual BiLSTM model for named entity recognition. IEEE Access 8:227710−18 doi: 10.1109/ACCESS.2020.3046253 CrossRef Google Scholar
[35]	Moayedi H, Osouli A, Nguyen H, Rashid ASA. 2021. A novel Harris hawks' optimization and k-fold cross-validation predicting slope stability. Engineering With Computers 37:369−79 doi: 10.1007/s00366-019-00828-8 CrossRef Google Scholar
[36]	Vabalas A, Gowen E, Poliakoff E, Casson A. 2019. Machine learning algorithm validation with a limited sample size. PLoS One 14:e0224365 doi: 10.1371/journal.pone.0224365 CrossRef Google Scholar
[37]	Xiong Z, Cui Y, Liu Z, Zhao Y, Hu M, et al. 2020. Evaluating explorative prediction power of machine learning algorithms for materials discovery using k-fold forward cross-validation. Computational Materials Science 171:109203 doi: 10.1016/j.commatsci.2019.109203 CrossRef Google Scholar
[38]	Wu W, Liu R, Jin W, Ma C. 2019. Stochastic bus schedule coordination considering demand assignment and rerouting of passengers. Transportation Research Part B: Methodological 121:275−303 doi: 10.1016/j.trb.2019.01.010 CrossRef Google Scholar
[39]	Cheng R, Ge H, Wang J. 2017. An extended continuum model accounting for the driver’s timid and aggressive attributions. Physics Letters A 381:1302−12 doi: 10.1016/j.physleta.2017.02.018 CrossRef Google Scholar
[40]	Sun Y, Ge H, Cheng R. 2019. An extended car-following model considering driver’s memory and average speed of preceding vehicles with control strategy. Physica A: Statistical Mechanics and Its Applications 521:752−61 doi: 10.1016/j.physa.2019.01.092 CrossRef Google Scholar
[41]	Jiang C, Ge H, Cheng R. 2019. Mean-field flow difference model with consideration of on-ramp and off-ramp. Physica A: Statistical Mechanics and Its Applications 513:465−67 doi: 10.1016/j.physa.2018.09.026 CrossRef Google Scholar
[42]	Ma C, Dai G, Zhou J. 2022. Short-term traffic flow prediction for urban road sections based on time series analysis and LSTM_BILSTM method. IEEE Transactions on Intelligent Transportation Systems 23:5615−24 doi: 10.1109/TITS.2021.3055258 CrossRef Google Scholar
[43]	Li L, Yang Y, Yuan Z, Chen Z. 2021. Aspatial-temporal approach for traffic status analysis and prediction based on Bi-LSTM structure. Modern Physics Letters 35:2150481 doi: 10.1142/s0217984921504819 CrossRef Google Scholar
[44]	Yang Y, Yuan Z, Meng R. 2022. Exploring traffic crash occurrence mechanism toward cross-area freeways via an improved data mining approach. Journal of Transportation Engineering, Part A: Systems 148:04022052 doi: 10.1061/jtepbs.0000698 CrossRef Google Scholar

About this article

Cite this article

Qi Q, Cheng R, Ge H. 2023. Short-term inbound rail transit passenger flow prediction based on BILSTM model and influence factor analysis. Digital Transportation and Safety 2(1):12−22 doi: 10.48130/DTS-2023-0002

Qi Q, Cheng R, Ge H. 2023. Short-term inbound rail transit passenger flow prediction based on BILSTM model and influence factor analysis. Digital Transportation and Safety 2(1):12−22 doi: 10.48130/DTS-2023-0002

Figures(13) / Tables(12)

Download PDF

Article Metrics

Article views(8293) PDF downloads(1650)

Other Articles By Authors

on this site
on Google Scholar

HTML

INTRODUCTION

With rapid economic development, the Metro penetration rate is also increasing. As the population of a city increases, so does the metro passenger flow^[1−4]. With the development of Intelligent Transportation Systems (ITS), passenger flow forecasting has become an important link in rail transit. Rail transit passenger flow prediction can provide scientific data for rail transit and other related departments, effectively allocate resources and manpower, and improve the safety, comfort and economic benefits of the entire transportation system. At the same time, it can provide effective data for relevant departments to handle emergencies, and can provide effective data guarantee for emergencies. Through extensive publicity and reporting of passenger flow forecast, passengers can scientifically and reasonably choose a public transport mode, and try to travel against peak times, which can not only improve passenger travel efficiency, but also reduce the pressure of rail transit.

Deep learning is a new research direction in the field of machine learning, which is now widely applied in the field of transportation. Before the rise of deep learning models, many scholars mostly used statistical analysis models as research tools, such as the Autoregressive Integrated Moving Average model (ARIMA). In 1994, Bengio et al.^[5,6] researched this problem in depth, and have found some fairly fundamental reasons that make training Recurrent Neural Network (RNN) very difficult. Then, in 2015, Schmidhuber^[7] proposed Long Short-term Memory (LSTM) Neural Network, a special RNN, which solves the problem of long-term dependence of RNN and can learn long-term dependence information. After that, many deep learning models appeared, such as Gate Recurrent Unit (GRU) and Bi-directional LSTM (BILSTM) Neural Network models. Based on the development of deep learning, the prediction of rail transit passenger flow has gradually increased in recent years. Some scholars have optimized the model to improve the accuracy of rail transit passenger flow prediction. Yang et al.^[8] proposed an improved long-term feature enhancement model based on long short-term memory (ELF-LSTM) neural networks. It makes full use of the LSTM Neural Network model (LSTMNN) in processing time series, and overcomes the limitation of long-time dependent learning caused by time delay. Zhang et al.^[9] proposed a new two-step K-means clustering model, which not only captures the change trend of passenger flow, but also captures the characteristics of passenger flow, and then proposed CB-LSTM model for short-term passenger flow prediction. Yang et al.^[10] established an improved Spatiotemporal Short-term Memory model (SP-LSTM) for short-term outbound passenger flow prediction of urban rail transit stations. The model predicts the outbound passenger volume based on the historical data of spatiotemporal passenger volume, origin station (OD) matrix and rail transit network operation data. At the same time, some scholars have proposed a deep learning model considering influencing factors to predict the passenger flow of rail transit and traffic flow^[11−16]. Hou et al.^[17] considered weather factors and proposed a traffic flow prediction framework combining Stacked Auto-Encoder (SAE) and Radial Basis Function (RBF) neural networks, which can effectively capture the time correlation and periodicity of traffic flow data and the interference of weather factors. Liu et al.^[18] considered the weather factors (also the wind speed) and combined them with the LSTM model to predict the short-term passenger flow of the subway. The forecast results show that the weather variables have a significant impact on the passenger flow. Some scholars^[19,20] have used Graph Convolution Network models (GCN) to predict the passenger flow of orbital stations by considering the spatial topology structure of orbital stations. The results show that the GCN model has a good prediction performance on the spatial topology structure.

Based on the summary of the main literature on passenger flow prediction of rail transit in Table 1, it can be found that few scholars have considered the influencing factors of passenger flow in the study of passenger flow prediction of rail transit. And existing literature considering the influencing factors did not analyze the influencing factors. This paper uses the analytic hierarchy process, one-way Analysis of Variance (ANOVA) and Duncan method to analyze the influencing factors. Through the study and comparison of deep learning models, the BILSTM model by adding influencing factors is used to predict the passenger flow of rail transit.

Table 1. Literature on passenger flow forecast of rail transit.

Author	Year	Model	Considering factors	Analyzing influencing factors
Zhu et al.^[21]	2018	Adam	No	No
Ling et al.^[22]	2018	DBSCAN	No	No
Zhang et al.^[9]	2019	CB-LSTM	No	No
Zhu et al.^[23]	2019	DBN-SVM	No	No
Guo et al. ^[24]	2019	SVR-LSTM	No	No
Guo et al.^[25]	2019	KRR and GPR	No	No
Li et al.^[26]	2020	STLSTM	No	No
Zhang & Kabuka^[14]	2020	LSTM	Yes	No
Xue et al.^[27]	2020	SVR	Yes	No
Liu et al.^[28]	2021	MPD	No	No
Jing et al.^[29]	2021	LGB-LSTM-DRS	No	No
Liu et al.^[30]	2022	DRNN	No	No
He et al.^[31]	2022	MGC-RNN	No	No
This paper	2022	BILSTM	Yes	Yes

The main contributions of this paper are as follows:

(i) The influencing factors of inbound passenger flow of rail transit are analyzed, the influencing factors with the highest weight are selected scientifically, and the related weights are set through scientific methods.

(ii) By comparing the predictions of each deep learning model, the deep learning model with the best predictive performance is selected.

(iii) BILSTM considering the influencing factors is proposed to predict the passenger flow of rail transit. And the model parameters can be scientifically adjusted the through the K-fold cross-validation method. The actual prediction results verify that the proposed model has good prediction performance, and the prediction accuracy can be improved by adding influencing factors.

Card no	Time	Swipe type	Busline	Fee	Stop no
1057471b906d1eb7	2019-09-16 08:16:32	0	1	1.9	115
c5360d16cf44a600	2019-09-16 08:12:11	1	1	0	125
04b0dbf510ce4d68	2019-09-16 08:14:33	1	1	0	119
8e043c1fcd22c414	2019-09-16 08:17:25	1	1	0	119
987b102a48b2cb3a	2019-09-16 08:16:08	0	1	2.85	115
The first column of the form 'Card no' is the IC card number for rail transit, The second column 'Time' is the card swipe time; and the third column 'Swipe type' is the type of card swipe; The fourth column 'Busline' is the rail transit line; The fifth column 'Fee' is the rail transit fee; and the last column 'Stop no' is the rail transit stop number.

Scale	Meaning
1	Indicates that the two factors are of equal importance
3	Indicates that one factor is obviously more important than the other
5	Indicates that one factor is strongly more important than the other
2 and 4	The median value of the above two adjacent judgments

	A	B	C	D	E	F	G	H
A	1	3	5	4	4	4	5	4
B	1/3	1	4	1/3	1/3	1/3	2	3
C	1/5	1/4	1	1/2	1/3	1/3	1	1
D	1/4	3	2	1	1	2	3	3
E	1/4	3	3	1	1	1	4	3
F	1/4	3	3	1/2	1	1	3	2
G	1/5	1/2	1	1/3	1/4	1/3	1	1/2
H	1/4	1/3	1	1/3	1/3	1/2	2	1

	A	B	C	D	E	F	G	H
A	1	3	4	3	2	2	5	5
B	1/3	1	4	1	1/2	1/2	3	3
C	1/4	1/4	1	1/2	1/3	1/3	1	1
D	1/3	1	2	1	1/2	1	3	3
E	1/4	2	3	2	1	1	5	4
F	1/3	2	3	1	1	1	5	3
G	1/5	1/3	1	1/3	1/5	1/5	1	1
H	1/5	1/3	1	1/3	1/4	1/3	1	1

	A	B	C	D	E	F	G	H
A	1	4	4	3	3	2	4	3
B	1/4	1	3	1/2	1/3	1/2	5	4
C	1/4	1/3	1	1/3	1/3	1/3	1	1
D	1/3	2	3	1	1/2	1	4	3
E	1/3	3	3	2	1	1	4	4
F	1/2	2	3	1	1	1	4	4
G	1/4	1/5	1	1/4	1/4	1/4	1	1
H	1/3	1/4	1	1/3	1/4	1/4	1	1

{{lists.name}}

Short-term inbound rail transit passenger flow prediction based on BILSTM model and influence factor analysis