Taxi origin and destination demand prediction based on deep learning: a review

Dan Peng; Mingxia Huang; Zhibo Xing; Dan Peng; Mingxia Huang; Zhibo Xing

doi:10.48130/DTS-2023-0014

2023 Volume 2

Article Contents

Next Previous

ARTICLE Open Access

Taxi origin and destination demand prediction based on deep learning: a review

1.
Department of Transportation and Geomatics Engineering, Shenyang Jianzhu University, Shenyang 110168, China

More Information

Corresponding author: mingxia@sjzu.edu.cn

Received: 21 April 2023
Accepted: 22 August 2023
Published online: 28 September 2023
Digital Transportation and Safety 2023, 2(3): 176−189 | Cite this article

Abstract

Taxi demand prediction is a crucial component of intelligent transportation system research. Compared to region-based demand prediction, origin-destination (OD) demand prediction has a wide range of potential applications, including real-time matching, idle vehicle allocation, ride-sharing services, and dynamic pricing, among others. However, because OD demand involves complex spatiotemporal dependence, research in this area has been limited thus far. In this paper, we first review existing research from four perspectives: topology construction, temporal and spatial feature processing, and other relevant factors. We then elaborate on the advantages and limitations of OD prediction methods based on deep learning architecture theory. Next, we discuss ongoing challenges in OD prediction, such as dynamics, spatiotemporal dependence, semantic differentiation, time window selection, and data sparsity problems, and summarize and compare potential solutions to each challenge. These findings offer valuable insights for model selection in OD demand prediction. Finally, we provide public datasets and open-source code, along with suggestions for future research directions.
- Deep learning,
- Taxi demand prediction,
- Taxi OD demand prediction,
- Spatiotemporal data mining,
- Dynamic graph
Rights and permissions
Copyright: © 2023 by the author(s). Published by Maximum Academic Press, Fayetteville, GA. This article is an open access article distributed under Creative Commons Attribution License (CC BY 4.0), visit https://creativecommons.org/licenses/by/4.0/.

References

[1]	Tebaldi C, West M. 1998. Bayesian inference on network traffic using link count data. Journal of the American Statistical Association 93:557−73 doi: 10.1080/01621459.1998.10473707 CrossRef Google Scholar
[2]	Carvalho L. 2014. A Bayesian statistical approach for inference on static origin–destination matrices in transportation studies. Technometrics 56:225−37 doi: 10.1080/00401706.2013.826144 CrossRef Google Scholar
[3]	Spiess H. 1987. A maximum likelihood model for estimating origin-destination matrices. Transportation Research Part B: Methodological 21:395−412 doi: 10.1016/0191-2615(87)90037-3 CrossRef Google Scholar
[4]	Chang GL, Tao X. 1999. An integrated model for estimating time-varying network origin-destination distributions. Transportation Research Part A: Policy and Practice 33:381−99 doi: 10.1016/S0965-8564(98)00038-X CrossRef Google Scholar
[5]	Chen Y, Ordónez F, Palmer K. 2006. Confidence intervals for OD demand estimation. USC-ISE Working Paper 2006:1 Google Scholar
[6]	Hazelton ML. 2008. Statistical inference for time varying origin-destination matrices. Transportation Research Part B: Methodological 42:542−52 doi: 10.1016/j.trb.2007.11.003 CrossRef Google Scholar
[7]	Djukic T, Flötteröd G, van Lint H, Hoogendoorn S. 2012. Efficient real time OD matrix estimation based on Principal Component Analysis. 2012 15^th International IEEE Conference on Intelligent Transportation Systems, Anchorage, AK, USA, 2012. USA: IEEE. pp. 115−21. https://doi.org/10.1109/ITSC.2012.6338720
[8]	Shao H, Lam WHK, Sumalee A, Chen A, Hazelton ML. 2014. Estimation of mean and covariance of peak hour origin-destination demands from day-to-day traffic counts. Transportation Research Part B:Methodological 68:52−75 doi: 10.1016/j.trb.2014.06.002 CrossRef Google Scholar
[9]	Lu S, Wang J, Xue Z, Liu X. 2016. Traffic analysis and OD travel time matrix based on two-fluid model. Journal of Highway and Transportation Research and Development (English Edition) 10:78−84 doi: 10.1061/jhtrcq.0000522 CrossRef Google Scholar
[10]	Zhu X, Guo D. 2017. Urban event detection with big data of taxi OD trips: a time series decomposition approach. Transactions in GIS 21:560−74 doi: 10.1111/tgis.12288 CrossRef Google Scholar
[11]	Ren J, Xie Q. 2017. Efficient OD trip matrix prediction based on tensor decomposition. 2017 18^th IEEE International Conference on Mobile Data Management (MDM), Daejeon, Korea (South), 2017. UAS: IEEE. pp. 180−85. https://doi.org/10.1109/MDM.2017.32
[12]	Li X, Kurths J, Gao C, Zhang J, Wang Z, et al. 2017. A hybrid algorithm for estimating origin-destination flows. IEEE Access 6:677−87 doi: 10.1109/ACCESS.2017.2774449 CrossRef Google Scholar
[13]	Li J, Wen H, Lin L, Qi W. 2018. Demand prediction model of E-hailing based on QPSO_RBF neural network. Journal of Guangxi University (Natural Science Edition) 43(2):700−9 doi: 10.13624/j.cnki.issn.1001-7445.2018.0700 CrossRef Google Scholar
[14]	Lu Y, Li S. 2014. An empirical study of with-in day OD prediction using taxi GPS data in Singapore. Report. No. 14-5074.
[15]	Hong WC. 2011. Traffic flow forecasting by seasonal SVR with chaotic simulated annealing algorithm. Neurocomputing 74(12–13):2096−107 doi: 10.1016/j.neucom.2010.12.032 CrossRef Google Scholar
[16]	Tong Y, Chen Y, Zhou Z, Chen L, Wang J, et al. 2017. The simpler the better: a unified approach to predicting original taxi demands based on large-scale online platforms. KDD '17: Proceedings of the 23^rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Halifax, NS, Canada, 2017. New York, United States: Association for Computing Machinery. pp. 1653−62. https://doi.org/10.1145/3097983.3098018
[17]	Skarding J, Gabrys B, Musial K. 2021. Foundations and modeling of dynamic networks using dynamic graph neural networks: a survey. IEEE Access 9:79143−68 doi: 10.1109/ACCESS.2021.3082932 CrossRef Google Scholar
[18]	Huang H, Fang Z, Wang X, Miao Y, Jin H. 2020. Motif-Preserving Temporal Network Embedding. Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, Yokohama, Japan, 2020. California: International Joint Conferences on Artificial Intelligence Organization. pp. 1237−43. https://doi.org/10.24963/ijcai.2020/172
[19]	Trivedi R, Farajtabar M, Biswal P, et al. 2019. Dyrep: Learning representations over dynamic graphs. International Conference on Learning Representations.
[20]	Kumar S, Zhang X, Leskovec J. 2019. Predicting dynamic embedding trajectory in temporal interaction networks. KDD '19: Proceedings of the 25^th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, Anchorage, AK, USA, 2019. New York, United States: Association for Computing Machinery. pp. 1269−78. https://doi.org/10.1145/3292500.3330895
[21]	Lv Y, Duan Y, Kang W, Li Z, Wang FY. 2015. Traffic flow prediction with big data: a deep learning approach. IEEE Transactions on Intelligent Transportation Systems 16:865−73 doi: 10.1109/TITS.2014.2345663 CrossRef Google Scholar
[22]	Krupski J, Graniszewski W, Iwanowski M. 2021. Data transformation schemes for CNN-based network traffic analysis: a survey. Electronics 10:2042 doi: 10.3390/electronics10162042 CrossRef Google Scholar
[23]	Ranjan N, Bhandari S, Zhao HP, Kim H, Khan P. 2020. City-wide traffic congestion prediction based on CNN, LSTM and transpose CNN. IEEE Access 8:81606−20 doi: 10.1109/ACCESS.2020.2991462 CrossRef Google Scholar
[24]	Li X, Zhao Z, Wang Q. 2022. ABSSNet: attention-based spatial segmentation network for traffic scene understanding. IEEE Transactions on Cybernetics 52:9352−62 doi: 10.1109/TCYB.2021.3050558 CrossRef Google Scholar
[25]	Baheti B, Gajre S, Talbar S. 2019. Semantic scene understanding in unstructured environment with deep convolutional neural network. TENCON 2019 - 2019 IEEE Region 10 Conference (TENCON), 2019, Kochi, India, 2019. USA: IEEE. pp. 790−95. https://doi.org/10.1109/TENCON.2019.8929376
[26]	Haque WA, Arefin S, Shihavuddin ASM, Hasan MA. 2021. DeepThin: a novel lightweight CNN architecture for traffic sign recognition without GPU requirements. Expert Systems with Applications 168:114481 doi: 10.1016/j.eswa.2020.114481 CrossRef Google Scholar
[27]	Zhang J, Xie Z, Sun J, Zou X, Wang J. 2020. A cascaded R-CNN with multiscale attention and imbalanced samples for traffic sign detection. IEEE Access 8:29742−54 doi: 10.1109/ACCESS.2020.2972338 CrossRef Google Scholar
[28]	Bogaerts T, Masegosa AD, Angarita-Zapata JS, Onieva E, Hellinckx P. 2020. A graph CNN-LSTM neural network for short and long-term traffic forecasting based on trajectory data. Transportation Research Part C: Emerging Technologies 112:62−77 doi: 10.1016/j.trc.2020.01.010 CrossRef Google Scholar
[29]	Zhou Z, Qin Y, Luo H. 2021. Deep spatio-temporal convolutional neural network for city traffic flow prediction. 2021 2^nd International Conference on Computing and Data Science (CDS), Stanford, CA, USA, 2021. USA: IEEE. pp. 171−75. https://doi.org/10.1109/CDS52072.2021.00037
[30]	Guo S, Lin Y, Li S, Chen Z, Wan H. 2019. Deep spatial–temporal 3D convolutional neural networks for traffic data forecasting. IEEE Transactions on Intelligent Transportation Systems 20:3913−26 doi: 10.1109/TITS.2019.2906365 CrossRef Google Scholar
[31]	Ma X, Dai Z, He Z, Ma J, Wang Y, et al. 2017. Learning traffic as images: a deep convolutional neural network for large-scale transportation network speed prediction. Sensors 17:818 doi: 10.3390/s17040818 CrossRef Google Scholar
[32]	Ran J, Chen Y, Li S. 2019. Three-dimensional convolutional neural network based traffic classification for wireless communications. 2018 IEEE Global Conference on Signal and Information Processing (GlobalSIP), Anaheim, CA, USA, 2018. USA: IEEE. pp. 624−27. https://doi.org/10.1109/GlobalSIP.2018.8646659
[33]	Zhu J, Wang Q, Tao C, Deng H, Zhao L, et al. 2021. AST-GCN: attribute-augmented spatiotemporal graph convolutional network for traffic forecasting. IEEE Access 9:35973−83 doi: 10.1109/ACCESS.2021.3062114 CrossRef Google Scholar
[34]	Li Z, Xiong G, Chen Y, Lv Y, Hu B, et al. 2019. A hybrid deep learning approach with GCN and LSTM for traffic flow prediction. 2019 IEEE Intelligent Transportation Systems Conference (ITSC), Auckland, New Zealand, 2019. USA: IEEE. pp. 1929−33. https://doi.org/10.1109/ITSC.2019.8916778
[35]	Diao Z, Xie G, Wang X, Ren R, Meng X, et al. 2023. EC-GCN: a encrypted traffic classification framework based on multi-scale graph convolution networks. Computer Networks 224:109614 doi: 10.1016/j.comnet.2023.109614 CrossRef Google Scholar
[36]	Guo K, Hu Y, Sun Y, Qian S, Gao J, et al. 2021. Hierarchical graph convolution network for traffic forecasting. Proceedings of the AAAI Conference on Artificial Intelligence 35:151−59 doi: 10.1609/aaai.v35i1.16088 CrossRef Google Scholar
[37]	Dong X, Thanou D, Rabbat M, Frossard P. 2019. Learning graphs from data: a signal representation perspective. IEEE Signal Processing Magazine 36:44−63 doi: 10.1109/MSP.2018.2887284 CrossRef Google Scholar
[38]	Geng X, Li Y, Wang L, Zhang L, Yang Q, et al. 2019. Spatiotemporal multi-graph convolution network for ride-hailing demand forecasting. Proceedings of the AAAI Conference on Artificial Intelligence 33:3656−63 doi: 10.1609/aaai.v33i01.33013656 CrossRef Google Scholar
[39]	Cui Z, Henrickson K, Ke R, Wang Y. 2020. Traffic graph convolutional recurrent neural network: a deep learning framework for network-scale traffic learning and forecasting. IEEE Transactions on Intelligent Transportation Systems 21:4883−94 doi: 10.1109/TITS.2019.2950416 CrossRef Google Scholar
[40]	Ali A, Zhu Y, Chen Q, Yu J, Cai H. 2020. Leveraging spatio-temporal patterns for predicting citywide traffic crowd flows using deep hybrid neural networks. 2019 IEEE 25^th International Conference on Parallel and Distributed Systems (ICPADS), Tianjin, China, 2019. USA: IEEE. pp. 125−32. https://doi.org/10.1109/ICPADS47876.2019.00025
[41]	Yu L, Du B, Hu X, Sun L, Han L, et al. 2021. Deep spatio-temporal graph convolutional network for traffic accident prediction. Neurocomputing 423:135−47 doi: 10.1016/j.neucom.2020.09.043 CrossRef Google Scholar
[42]	Li M, Zhu Z. 2021. Spatial-temporal fusion graph neural networks for traffic flow forecasting. Proceedings of the AAAI Conference on Artificial Intelligence 35:4189−96 doi: 10.1609/aaai.v35i5.16542 CrossRef Google Scholar
[43]	Wang X, Ma Y, Wang Y, Jin W, Wang X, et al. 2020. Traffic flow prediction via spatial temporal graph neural network. WWW '20: Proceedings of The Web Conference 2020, Taipei, Taiwan, 2020. New York, United States: Association for Computing Machinery. pp. 1082−92. https://doi.org/10.1145/3366423.3380186
[44]	Zhang Q, Yu K, Guo Z, Garg S, Rodrigues JJPC, et al. 2021. Graph neural network-driven traffic forecasting for the connected internet of vehicles. IEEE Transactions on Network Science and Engineering 9(5):3015−27 doi: 10.1109/TNSE.2021.3126830 CrossRef Google Scholar
[45]	Liu T, Wu W, Zhu Y, Tong W. 2020. Predicting taxi demands via an attention-based convolutional recurrent neural network. Knowledge-Based Systems 206:106294 doi: 10.1016/j.knosys.2020.106294 CrossRef Google Scholar
[46]	Rossi A, Barlacchi G, Bianchini M, Lepri B. 2020. Modelling taxi drivers’ behaviour for the next destination prediction. IEEE Transactions on Intelligent Transportation Systems 21:2980−89 doi: 10.1109/TITS.2019.2922002 CrossRef Google Scholar
[47]	Tian Y, Pan L. 2016. Predicting short-term traffic flow by long short-term memory recurrent neural network. 2015 IEEE International Conference on Smart City/SocialCom/SustainCom (SmartCity), Chengdu, China 2015. USA: IEEE. pp. 153−58. https://doi.org/10.1109/SmartCity.2015.63
[48]	Fukuda S, Uchida H, Fujii H, Yamada T. 2020. Short-term prediction of traffic flow under incident conditions using graph convolutional recurrent neural network and traffic simulation. IET Intelligent Transport Systems 14:936−46 doi: 10.1049/iet-its.2019.0778 CrossRef Google Scholar
[49]	Kim K, Lee JH, Lim HK, Oh S, Han YH. 2022. Deep RNN-based network traffic classification scheme in edge computing system. Computer Science and Information Systems 19:165−84 doi: 10.2298/csis200424038k CrossRef Google Scholar
[50]	Paul A, Mitra S. 2021. Management of traffic signals using deep reinforcement learning in bidirectional recurrent neural network in ITS. ISMSI '21: Proceedings of the 2021 5th International Conference on Intelligent Systems, Metaheuristics & Swarm Intelligence, Victoria, Seychelles, 2021. New York, United States: Association for Computing Machinery. pp. 60−64. https://doi.org/10.1145/3461598.3461608
[51]	Li M, Wang Y, Wang Z, Zheng H. 2020. A deep learning method based on an attention mechanism for wireless network traffic prediction. Ad Hoc Networks 107:102258 doi: 10.1016/j.adhoc.2020.102258 CrossRef Google Scholar
[52]	Lai Y, Zhang K, Lin J, Yang F, Fan Y. 2020. Taxi demand prediction with LSTM-based combination model. 2019 IEEE Intl Conf on Parallel & Distributed Processing with Applications, Big Data & Cloud Computing, Sustainable Computing & Communications, Social Computing & Networking (ISPA/BDCloud/SocialCom/SustainCom), Xiamen, China, 2019. USA: IEEE. pp. 944−50. https://doi.org/10.1109/ISPA-BDCloud-SustainCom-SocialCom48970.2019.00137
[53]	Nihale S, Sharma S, Parashar L, Singh U. 2020. Network traffic prediction using long short-term memory. 2020 International Conference on Electronics and Sustainable Communication Systems (ICESC), Coimbatore, India, 2020. USA: IEEE. pp. 338−43. https://doi.org/10.1109/ICESC48915.2020.9156045
[54]	Zeng C, Ma C, Wang K, Cui Z. 2022. Predicting vacant parking space availability: a DWT-Bi-LSTM model. Physica A: Statistical Mechanics and Its Applications 599:127498 doi: 10.1016/j.physa.2022.127498 CrossRef Google Scholar
[55]	Fu R, Zhang Z, Li L. 2017. Using LSTM and GRU neural network methods for traffic flow prediction. 2016 31^st Youth Academic Annual Conference of Chinese Association of Automation (YAC), Wuhan, China, 2016. USA: IEEE. pp. 324−28. https://doi.org/10.1109/YAC.2016.7804912
[56]	Zhao J, Kong W, Zhou M, Zhou T, Xu Y, et al. 2022. Prediction of urban taxi travel demand by using hybrid dynamic graph convolutional network model. Sensors 22:5982 doi: 10.3390/s22165982 CrossRef Google Scholar
[57]	Abideen ZU, Sun H, Yang Z, Ahmad RZ, Iftekhar A, et al. 2020. Deep wide spatial-temporal based transformer networks modeling for the next destination according to the taxi driver behavior prediction. Applied Sciences 11:17 doi: 10.3390/app11010017 CrossRef Google Scholar
[58]	Tsiligkaridis A, Zhang J, Taguchi H, Nikovski D. 2020. Personalized destination prediction using transformers in a contextless data setting. 2020 International Joint Conference on Neural Networks (IJCNN), Glasgow, UK, 2020. USA: IEEE. pp. 1−7. https://doi.org/10.1109/IJCNN48605.2020.9207514
[59]	Li D, Lin C, Gao W, Chen Z, Wang Z, et al. 2020. Capsules TCN network for urban computing and intelligence in urban traffic prediction. Wireless Communications and Mobile Computing 2020:6896579 doi: 10.1155/2020/6896579 CrossRef Google Scholar
[60]	Wang Y, Li J, Zhao A, Lv Z, Lu G. 2021. Temporal attention-based graph convolution network for taxi demand prediction in functional areas. WASA 2021: Wireless Algorithms, Systems, and Applications, Nanjing, China, 2021. Switzerland: Springer, Cham. pp. 203−14. https://doi.org/10.1007/978-3-030-85928-2_16
[61]	Xu J, Rahmatizadeh R, Bölöni L, Turgut D. 2018. Real-time prediction of taxi demand using recurrent neural networks. IEEE Transactions on Intelligent Transportation Systems 19:2572−81 doi: 10.1109/TITS.2017.2755684 CrossRef Google Scholar
[62]	Chang HW, Tai YC, Hsu JYJ. 2010. Context-aware taxi demand hotspots prediction. International Journal of Business Intelligence and Data Mining 5:3−18 doi: 10.1504/IJBIDM.2010.030296 CrossRef Google Scholar
[63]	Tong Y, Chen Y, Zhou Z, Chen L, Wang J, et al. 2017. The simpler the better: a unified approach to predicting original taxi demands based on large-scale online platforms. KDD '17: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Halifax, NS, Canada, 2017. New York, United States: Association for Computing Machinery. pp. 1653−62. https://doi.org/10.1145/3097983.3098018
[64]	Vanichrujee U, Horanont T, Pattara-atikom W, Theeramunkong T, Shinozaki T. 2018. Taxi demand prediction using ensemble model based on RNNs and XGBOOST. 2018 International Conference on Embedded Systems and Intelligent Technology & International Conference on Information and Communication Technology for Embedded Systems (ICESIT-ICICTES), Khon Kaen, Thailand. USA: IEEE. pp. 1−6. https://doi.org/10.1109/ICESIT-ICICTES.2018.8442063
[65]	Xu Y, Li D. 2019. Incorporating graph attention and recurrent architectures for city-wide taxi demand prediction. ISPRS International Journal of Geo-Information 8:414 doi: 10.3390/ijgi8090414 CrossRef Google Scholar
[66]	Liu Y, Liu Z, Lyu C, Ye J. 2020. Attention-based deep ensemble net for large-scale online taxi-hailing demand prediction. IEEE Transactions on Intelligent Transportation Systems 21:4798−807 doi: 10.1109/TITS.2019.2947145 CrossRef Google Scholar
[67]	Kuang L, Yan X, Tan X, Li S, Yang X. 2019. Predicting taxi demand based on 3D convolutional neural network and multi-task learning. Remote Sensing 11:1265 doi: 10.3390/rs11111265 CrossRef Google Scholar
[68]	Duan ZT, Zhang K, Yang Y, Ni YY, Saurab B. 2018. Taxi demand prediction based on CNN-LSTM-ResNet hybrid depth learning model. Journal of Transportation Systems Engineering and Information Technology 18(4):215−23 doi: 10.16097/j.cnki.1009-6744.2018.04.032 CrossRef Google Scholar
[69]	Zhang C, Zhu F, Wang X, Sun L, Tang H, et al. 2022. Taxi demand prediction using parallel multi-task learning model. IEEE Transactions on Intelligent Transportation Systems 23:794−803 doi: 10.1109/TITS.2020.3015542 CrossRef Google Scholar
[70]	Chen Z, Zhao B, Wang Y, Duan Z, Zhao X. 2020. Multitask learning and GCN-based taxi demand prediction for a traffic road network. Sensors 20:3776 doi: 10.3390/s20133776 CrossRef Google Scholar
[71]	Liu L, Qiu Z, Li G, Wang Q, Ouyang W, et al. 2019. Contextualized spatial–temporal network for taxi origin-destination demand prediction. IEEE Transactions on Intelligent Transportation Systems 20:3875−87 doi: 10.1109/TITS.2019.2915525 CrossRef Google Scholar
[72]	Duan Z, Zhang K, Chen Z, Liu Z, Tang L, et al. 2019. Prediction of city-scale dynamic taxi origin-destination flows using a hybrid deep neural network combined with travel time. IEEE Access 7:127816−32 doi: 10.1109/ACCESS.2019.2939902 CrossRef Google Scholar
[73]	Chu KF, Lam AYS, Li VOK. 2020. Deep multi-scale convolutional LSTM network for travel demand and origin-destination predictions. IEEE Transactions on Intelligent Transportation Systems 21:3219−32 doi: 10.1109/TITS.2019.2924971 CrossRef Google Scholar
[74]	Wang Y, Yin H, Chen H, Wo T, Xu J, et al. 2019. Origin-destination matrix prediction via graph convolution: a new perspective of passenger demand modeling. Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. August 4 - 8, 2019, Anchorage, AK, USA. ACM: 1227−35
[75]	Xiong X, Ozbay K, Jin L, Feng C. 2020. Dynamic origin–destination matrix prediction with line graph neural networks and Kalman filter. Transportation Research Record: Journal of the Transportation Research Board 2674:491−503 doi: 10.1177/0361198120919399 CrossRef Google Scholar
[76]	Zhang J, Che H, Chen F, Ma W, He Z. 2020. Short-term origin-destination demand prediction in urban rail transit systems: a channel-wise attentive split-convolutional neural network method. arXiv In press doi: 10.48550/arXiv.2008.08036 CrossRef Google Scholar
[77]	Shi H, Yao Q, Guo Q, Li Y, Zhang L, et al. 2020. Predicting origin-destination flow via multi-perspective graph convolutional network. 2020 IEEE 36th International Conference on Data Engineering (ICDE), Dallas, TX, USA, 2020. USA: IEEE. pp. 1818−21. https://doi.org/10.1109/ICDE48307.2020.00178
[78]	Chen P, Fu X, Wang X. 2022. A graph convolutional stacked bidirectional unidirectional-LSTM neural network for metro ridership prediction. IEEE Transactions on Intelligent Transportation Systems 23:6950−62 doi: 10.1109/TITS.2021.3065404 CrossRef Google Scholar
[79]	Ke J, Qin X, Yang H, Zheng Z, Zhu Z, et al. 2021. Predicting origin-destination ride-sourcing demand with a spatio-temporal encoder-decoder residual multi-graph convolutional network. Transportation Research Part C: Emerging Technologies 122:102858 doi: 10.1016/j.trc.2020.102858 CrossRef Google Scholar
[80]	Zhang D, Xiao F, Shen M, Zhong S. 2021. DNEAT: a novel dynamic node-edge attention network for origin-destination demand prediction. Transportation Research Part C: Emerging Technologies 122:102851 doi: 10.1016/j.trc.2020.102851 CrossRef Google Scholar
[81]	Chen D, Wang J, Xiong C. 2021. Research on origin-destination travel demand prediction method of inter-regional online taxi based on SpatialOD-BiConvLSTM. IET Intelligent Transport Systems 15:1533−47 doi: 10.1049/itr2.12119 CrossRef Google Scholar
[82]	Han L, Ma X, Sun L, Du B, Fu Y, et al. 2022. Continuous-time and multi-level graph representation learning for origin-destination demand prediction. KDD '22: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington DC, USA, 2022. New York, United States: Association for Computing Machinery. pp. 516−24. https://doi.org/10.1145/3534678.3539273
[83]	Zhang R, Han L, Liu B, Zeng J, Sun L. 2022. Dynamic graph learning based on hierarchical memory for origin-destination demand prediction. arXiv In press doi: 10.48550/arXiv.2205.14593 CrossRef Google Scholar
[84]	Zhuang D, Wang S, Koutsopoulos H N, et al. 2022. Uncertainty quantification of sparse travel demand prediction with spatial-temporal graph neural networks. Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD '22), Washington DC, USA, 2022. New York, United States: Association for Computing Machinery. pp. 4639–47. https://doi.org/10.1145/3534678.3539093
[85]	Hu J, Yang B, Guo C, Jensen CS, Xiong H. 2020. Stochastic origin-destination matrix forecasting using dual-stage graph convolutional, recurrent neural networks. 2020 IEEE 36^th International Conference on Data Engineering (ICDE), Dallas, TX, USA, 2020. USA: IEEE. pp. 1417−28. https://doi.org/10.1109/ICDE48307.2020.00126
[86]	Huang B, Ruan K, Yu W, Xiao J, Xie R, et al. 2023. ODformer: spatial–temporal transformers for long sequence Origin–Destination matrix forecasting against cross application scenario. Expert Systems with Applications 222:119835 doi: 10.1016/j.eswa.2023.119835 CrossRef Google Scholar
[87]	Yao X, Gao Y, Zhu D, Manley E, Wang J, et al. 2021. Spatial origin-destination flow imputation using graph convolutional networks. IEEE Transactions on Intelligent Transportation Systems 22:7474−84 doi: 10.1109/TITS.2020.3003310 CrossRef Google Scholar
[88]	Zou X, Zhang S, Zhang C, Yu JJQ, Chung E. 2022. Long-term origin-destination demand prediction with graph deep learning. IEEE Transactions on Big Data 8:1481−95 doi: 10.1109/TBDATA.2021.3063553 CrossRef Google Scholar
[89]	Wang N, Zheng L, Shen H, Li S. 2023. Ride-hailing origin-destination demand prediction with spatiotemporal information fusion. Transportation Safety and Environment Accepted paper:tdad026 doi: 10.1093/tse/tdad026 CrossRef Google Scholar
[90]	Huang Z, Zhang W, Wang D, Yin Y. 2022. A GAN framework-based dynamic multi-graph convolutional network for origin-destination-based ride-hailing demand prediction. Information Sciences 601:129−46 doi: 10.1016/j.ins.2022.04.024 CrossRef Google Scholar
[91]	Yang Y, Zhang S, Zhang C, Yu JJQ. 2021. Origin-destination matrix prediction via hexagon-based generated graph. 2021 IEEE International Intelligent Transportation Systems Conference (ITSC), Indianapolis, IN, USA, 2021. USA: IEEE. pp. 1399−404. https://doi.org/10.1109/ITSC48978.2021.9564718
[92]	Li D, Wang W, Zhao D. 2023. Designing a novel two-stage fusion framework to predict short-term origin–destination flow. Journal of Transportation Engineering-Part A: Systems 149(5):04023032 doi: 10.1061/JTEPBS.TEENG-7573 CrossRef Google Scholar
[93]	Peng Z, Wu G, Xia F. 2021. Clustering shift graph convolutional network for taxi origin-destination demand prediction. 2021 IEEE 33^rd International Conference on Tools with Artificial Intelligence (ICTAI), Washington, DC, USA, 2021. USA: IEEE. pp. 268−72. https://doi.org/10.1109/ICTAI52525.2021.00044
[94]	Bhanu M, Kumar R, Roy S, Mendes-Moreira J, Chandra J. 2022. Graph multi-head convolution for spatio-temporal attention in origin destination tensor prediction. In PAKDD 2022: Advances in Knowledge Discovery and Data Mining, eds. Gama J, Li T, Yu Y, Chen E, Zheng Y, et al. Switzerland: Springer Cham. pp. 459−71. https://doi.org/10.1007/978-3-031-05933-9_36
[95]	Chen T, Nie L, Pan J, Tu L, Zheng B, et al. 2023. Origin-destination traffic prediction based on hybrid spatio-temporal network. 2022 IEEE International Conference on Data Mining (ICDM), Orlando, FL, USA, 2022. USA: IEEE. pp. 879−84. https://doi.org/10.1109/ICDM54844.2022.00101
[96]	Cao Y, Liu L, Dong Y. 2023. Convolutional long short-term memory two-dimensional bidirectional graph convolutional network for taxi demand prediction. Sustainability 15:7903 doi: 10.3390/su15107903 CrossRef Google Scholar
[97]	Shuai C, Zhang X, Wang Y, He M, Yang F, et al. 2023. Online car-hailing origin-destination forecast based on a temporal graph convolutional network. IEEE Intelligent Transportation Systems Magazine 15:121−36 doi: 10.1109/MITS.2023.3244935 CrossRef Google Scholar

About this article

Cite this article

Peng D, Huang M, Xing Z. 2023. Taxi origin and destination demand prediction based on deep learning: a review. Digital Transportation and Safety 2(3):176−189 doi: 10.48130/DTS-2023-0014

Peng D, Huang M, Xing Z. 2023. Taxi origin and destination demand prediction based on deep learning: a review. Digital Transportation and Safety 2(3):176−189 doi: 10.48130/DTS-2023-0014

Figures(9) / Tables(4)

Download PDF

Article Metrics

Article views(5337) PDF downloads(1029)

Other Articles By Authors

on this site
on Google Scholar

HTML

Introduction

As urban populations grow and motorization rates increase, the daily transportation needs of city residents have become more significant. However, the widespread use of private cars exacerbates traffic congestion, and large-scale public transportation systems are often limited in their ability to meet individualized travel needs due to issues such as coverage, operating hours, and route limitations. As a result, demand-responsive public transportation services such as taxis and ride-hailing have emerged as preferred modes of travel for city dwellers due to their high accessibility, all-day operation, and comfortable, quick services.

Since the issuance of the National Informatization Plan's '13^th Five-Year Plan' by the State Council in 2016, intelligent transportation construction has become a significant focus in China's smart city development. Shared travel, with taxis and ride-hailing services playing a crucial role, has emerged as an important direction for this effort. Governments at provincial and municipal levels have released relevant planning documents to guide and support the development of various operating models, including new energy, ride-hailing, and cruise taxis, among others, with a focus on deep integration and intelligent services.

In recent years, problems associated with taxi services such as difficulties in accessing a taxi, long wait times, traffic congestion, and wastage of resources have become increasingly prominent. Accurate prediction of taxi demand can aid in rebalancing the spatial and temporal distribution of vehicle resources and alleviate the spatial and temporal discrepancies between the supply and demand of taxis.

The issues of taxi demand prediction include both node and edge forecasts. Node forecasts aim to predict the total number of trips for each region, while edge forecasts focus on predicting travel demand relationships between two regions

Currently, most research on taxi demand prediction focuses on forecasting the total passenger demand in a particular target area for a specific time frame. Deep learning techniques such as Convolutional Neural Networks (CNN), Recurrent Neural Networks (RNN), and Graph Convolutional Networks (GCN), and their variations have been widely employed to extract temporal and spatial features for accurate predictions. In addition, several studies explore the incorporation of external factors, such as weather and points of interest (POI) in urban areas, to enhance prediction accuracy. Furthermore, researchers have utilized attention mechanisms, multi-task learning, residual networks, and other methods to further improve forecast accuracy.

Accurately predicting origin-destination demand is crucial for taxi platforms to make optimal real-time decisions regarding vehicle matching, idle vehicle reallocation, ride-sharing services, dynamic pricing, and other operational strategies. Origin-destination prediction involves forecasting the travel demand or origin-destination patterns of a particular region for a given period. OD demand prediction is more complex than regional-level demand forecasting due to its intricate spatial and temporal dependencies. However, given the current need to serve as many passengers as possible with limited taxi resources, OD demand prediction has a wide range of practical applications. Despite this, the existing research on OD demand prediction is limited. To address this gap, this paper aims to make the following contributions:

This paper provides a systematic summary of existing research on taxi OD demand prediction, including methods used, challenges faced, and future research directions. The findings presented in this paper are intended to assist researchers in identifying areas for further investigation, as well as expanding existing research. Moreover, the practical applications of this research, which employs deep learning methods to enhance OD demand prediction, make this study highly relevant and timely. In conclusion, this paper aims to promote the application and development of OD demand prediction based on deep learning methods.

This paper provides a comprehensive review of the existing research on OD demand prediction that utilizes deep learning methods to process temporal and spatial features. The review delves into not only the theoretical aspects but also the advantages and limitations of these methods, aiming to inspire subsequent researchers to develop more novel models.

Furthermore, this paper discusses some of the key challenges that are faced by most OD demand prediction models. For each challenge, several existing solutions are summarized and compared, providing useful insights for selecting appropriate models in different contexts. Finally, based on the review and analysis conducted, this paper proposes future research directions for OD demand prediction.

This paper aims to facilitate baseline experiments in the field of transportation by collecting open-source datasets and codes from relevant literature. Additionally, this study proposes future research directions in the field.

Mathematical statistical methods

Statistical prediction methods are based on historical data and time series and belong to parameter methods. The commonly used models include the history average model, Auto-regressive Moving Average (ARMA) model, moving average (MA) model, auto-regressive integrated moving average (ARIMA) model, and Kalman filtering model.

Tebaldi & West^[1] used a Bayesian model to analyze the flow intensity between directed origin-destination (OD) pairs. Carvalho^[2] used a hierarchical Bayesian statistical model to address the problem of reconstructing static OD matrices. Spiess^[3] estimated the mean using a maximum likelihood model to estimate the OD matrix. Chang & Tao^[4] proposed a two-stage method for parallel computation that decomposed multiple subnets in the first stage and designed updated parameters for dynamic OD estimation in large-scale networks. They further developed a dynamic traffic assignment model for estimating time-varying network OD distributions. Chen et al.^[5] divided the uncertainty of estimating the OD matrix into two types: statistical uncertainty and the existence of multiple feasible OD demands on the same link. This was done to improve the prediction accuracy by determining the confidence interval. Hazelton^[6] proposed a Gaussian model based on the lower-level over-dispersed process and developed a Markov chain Monte Carlo algorithm for OD matrix prediction. Djukic et al.^[7] applied PCA to transform high-dimensional OD matrices into low-dimensional space and estimated real-time OD demand. Shao et al.^[8] proposed a heuristic iterative estimation allocation algorithm to optimize the path selection behavior for OD demand changes based on weighted least squares predictions of the mean and covariance matrix of OD demand. Lu et al.^[9] proposed a dual-fluid curve analysis method and iterative matrix for dynamic OD route guidance. They calculated the dwell time based on iterative matrix calculations and conducted dynamic OD route guidance. Zhu & Guo^[10] proposed a LOESS method for urban event detection based on time series decomposition and anomaly detection at specific locations for OD big data urban event prediction. In the same year, Ren & Xie^[11] proposed a four-order tensor modeling method consisting of origin, destination, vehicle type, and time. By decomposing the tensor and extracting time factor matrices, it was used to predict future OD flow. However, the issue of data loss in high-dimensional data analysis remains unsolved. Li et al.^[12] proposed the NMF-AR method for OD matrix prediction through non-negative matrix decomposition (NMF) and autoregressive (AR) modeling.

Although the statistical prediction methods based on mathematical statistics have made some progress by only extracting time-related features, their dimensionality is too simple. Taxi data is a typical spatiotemporal data set, and this method cannot extract spatial impacts, thus resulting in limited effect.

Traditional machine learning methods
Machine learning-based predictive methods belong to non-parametric methods, which are data-driven methods that can capture feature relationships in complex data. Commonly used methods include regression analysis represented by linear regression, Support Vector Machine (SVM), decision tree algorithm, Random Forest (RF), artificial neural network (ANN), and so on.

Support Vector Machine (SVM) is a non-linear regression method based on the minimization of risk structure criteria. When samples are linearly inseparable in the original space, SVM can use kernel functions to map samples from the original space to a high-dimensional space, making the samples linearly separable in the high-dimensional space. SVM can extract decisive features from small samples and is less prone to overfitting than other machine learning algorithms. However, when the sample size or number of dimensions is large, the model may run slowly and take longer.

Random Forest (RF) is composed of decision trees, which are common classification and regression algorithms. Decision trees consist of nodes and directed edges. At each internal node of each tree, the optimal feature for splitting is chosen according to a certain criterion, and the dataset is recursively divided into subsets. To avoid overly complex decision trees, pruning operations are performed. Random Forest uses two forms of randomness, sample Bagging and feature random subspaces, to learn from multiple decision trees and combines their results to make predictions for regression problems such as taxi demand prediction by taking the average of the decision tree results. Random Forest does not require pruning and is less likely to overfit, with good computational efficiency, robustness, and noise resistance.

Li et al.^[13] used the Quantum Particle Swarm Optimization (QPSO) algorithm to optimize the Radial Basis Function (RBF) neural network and established the QPSO_RBF neural network model to predict the demand for ride-hailing services in urban mixed areas, using passenger boarding demand, weather conditions, and road congestion ratio as input feature variables. Lu & Li^[14] compared historical averages, the ARIMA model, the KNN method, and the ANN model using Singapore taxi GPS data and verified the superiority of ANN in long-term forecasting. Hong^[15] used a Support Vector Regression (SVR) model to predict future traffic flow and employed the Chaos Simulated Annealing (CSA) algorithm and seasonal index calculation method to measure the impact of periodic changes on future traffic flow. Tong et al.^[16] proposed the Lin-UOTD model, which aims to quickly adapt to changing application scenarios by using a simple machine learning model to predict future taxi demand.

The selection of features in machine learning-based predictive methods directly affects the accuracy of the prediction model. Compared with methods based on time series prediction, machine learning-based methods have certain improvements in accuracy and generalization ability. However, they have defects in processing high-dimensional data and cannot effectively solve the nonlinear correlation of complex multidimensional data.

Challenges

Challenge 1: Representation of dynamic correlations in OD flow

The paired attraction relationship between two regions is subject to dynamic changes over time, typically exhibiting stronger intensity during peak periods and weaker intensity during non-peak periods. Static graphs cannot represent the dynamic trend of OD flow. Therefore, capturing these relationships dynamically is crucial for node representation (Table 1).

Table 1. Summary of deep learning models in taxi origin-destination prediction.

Model	Spatial topology construction	Spatial dependency	Temporal dependency	Data set	Other factors
CSTN^[71]	Raster	3DCNN	Conv LSTM	NYC-TOD	Local spatial context, meteorological information, globally relevant context
MultiConvLSTM^[72]	Raster	MultiConv	ConvLSTM	NYC taxi	None
CLTS^[73]	Raster	Conv2D	ConvLSTM	Beijing Taxi	None
GEML^[74]	Raster (Geographic/ semantic nodes)	SGCN (Grid embedding)	LSTM	NYC-taxi /DiDi ChengDu	Multi-task learning
FL-GCN^[75]	Graph	Graph convolution (nodes, edges)	Kalman filtering	New Jersey Highway	None
CAS-CNN^[76]	Raster	Split CNN		URT	Channel-wise attention
MPGCN^[77]	Graph	2D-GCN	LSTM	DIDI Beijing /DiDi shanghai	None
GCN-SBULSTM^[78]	Graph	GCN	Stacking bidirectional unidirectional LSTMs	SZ Metro	None
ST-ED-RMGC^[79]	Graph	Multi-graph convolutional networks	LSTM	NYC taxi	Encoder decoder
DNEAT^[80]	Dynamic node topology	GCN (k-TNEAT)	k-hop temporal encoder	DiDi ChengDu/ NYC taxi	None
Spatial OD-BiConvLSTM^[81]	Raster	Conv2D	BiLSTM	NYC taxi	None

CMOD^[82]	Graph (Event)	The graph represents learning	CTDG (Continuous-time evolution representation)	BJ Subway/NYC-Taxi	Multi- Head Attention
HMOD^[83]	Graph (Event)	Graph embedded /Random walk	GRU/CTDG	NYC-Taxi/ Beijing Metro	None
SIZINB-GNN^[84]	Graph	GNN	TCN	CDP dataset	None
ODformer^[86]	Graph (Event)	2DGCN	ODformer	NYC taxi	OD attention
SI-GCN^[87]	Graph (Event)	GCN (graph embedding)	Encoder-decoder	DIDI Beijing	a mapping function
STGDL^[88]	Graph (road)	S-GCN	ResNet-based block ST-Conv CNN	NYC taxi/DIDI Haikou	both short-term and long-term OD predictions
CWGAN-div^[89]	Graph (road network)	GAN	ResNet	NYC taxi	network-wide OD demand
DMGC-GAN^[90]	Graph (neighbor/ mutual attraction/ passengers' mobility association mode)	GCN	TMGCN	NYC taxi	None
Hex D-GCN^[91]	Graph (hexagon-based path)	GCN	CNN	Taxi Shanghai	None
OD-TGAT^[65]	Graph (grid map)	GAT	GRU	NYC Taxi	None
TFF^[92]	Graph	GCN	ST-Attention block	Chongqing	A modified Kalman filter (KF)
CSGCN^[93]	Graph	GCN	CNN	Taxi Beijing	Shifted Graph Clustering
gHMC-STA^[94]	Graph	GCN	Multi-Head Convolution	Taxi Beijing	Graph multi-head convolution for spatio-temporal aggregation
HSTN^[95]	Raster	Separable 2D-CNN	ResNet	Taxi Shanghai	None
CTBGCN^[96]	Graph	2DGCN	Conv-LSTM	NYC Taxi	None
CT-GCN^[97]	Graph	GCN	ST block	DIDI Haikou	None

In early taxi OD demand prediction studies, most research was based on static networks. Liu et al.^[72] constructed local spatial context (LSC) and global correlation context (GCC) modules based on Euclidean spatial grid data. The former learned the local spatial dependence of order demand from the starting point and destination perspectives, while the latter modeled the correlation between different regions. Wang et al.^[75] used grid embedding based on grid data to construct geographic and semantic neighbors to model passenger spatial flow patterns and adjacent relationships of different regions. The former measured the intrinsic closeness between grids and their neighbors, while the latter modeled the semantic intensity of traffic flow between starting points and destinations in the grid network. Chen et al.^[79] constructed the OD demands between each region in a single period based on regional grids. Then, they reduced the three-dimensional tensor to a two-dimensional matrix through matrix cascading, considering the spatiotemporal properties in chronological order. Ke et al.^[80] encoded the context-aware spatial dependence of OD pairs by designing a residual multi-graph convolutional (RMGC) network through multiple OD graphs. Each node in the graph corresponded to an OD pair, and the adjacent matrix of the node was established to represent the neighborhood, distance, functional similarity, and historical demand correlation of OD pairs. The above studies represent the dependency relationship between regions based on static networks, ignoring the dynamic dependency relationships that may change over time.

Shi et al.^[78] constructed both static and dynamic graphs simultaneously to capture complex dynamic spatial dependency relationships and used the average strategy to obtain the final OD flow prediction. Zhang et al.^[81] proposed a dynamic node topology representation method to jointly represent the static and dynamic structural information of OD graphs. They introduced the k-TNEAT layer to adaptively adjust the relationship between each OD pair at different time intervals to learn the representation of nodes and edges, thus capturing the dynamic demand patterns of the time-varying OD graph. This method applies to both Euclidean and non-Euclidean datasets. Han et al.^[83] constructed a continuous time dynamic graph representation learning framework based on event updates, maintaining a dynamic state vector for each transportation node and representing multi-level spatiotemporal dependency relationships by sharing information among virtual cluster-level and regional-level nodes. Zhang et al.^[84] constructed dynamic graphs by treating the starting point and endpoint as two different semantic entities based on time updates, proposing an embedding module for the departure-destination pair and aggregating neighbor information through random walks. The above studies advance from predicting the starting point and destination on a static network to capturing spatiotemporal dynamic correlations by constructing dynamic graphs. Huang et al.^[91] developed a TMGCN layer to capture spatiotemporal correlations in dynamic OD graphs, which includes a static neighborhood relationship graph, Origin-Destination mutual attraction dynamic graph, and passengers’ mobility association mode dynamic graph. This layer can learn relationships across different time intervals in all types of OD graphs.

Challenge 2: Spatial-temporal correlation

Spatial-temporal data possess both correlation and heterogeneity. Spatial-temporal correlation is manifested in the fact that each node can influence adjacent nodes at the next time step. Spatial-temporal heterogeneity is manifested in the different distributions of OD flow under conditions such as morning peak, evening peak, city center, and city edge. Currently, using two independent components to capture the temporal and spatial dependencies in a chained prediction often fails to capture the impact of spatial-temporal correlation and heterogeneity.

Zhang et al.^[84] and Han et al.^[83] applied node embedding on dynamic graphs, extending time into the spatial domain as the second dimension, which can simultaneously capture the structural relationship between nodes and their evolutionary relationship over time. Huang et al.^[87] proposed an OD attention mechanism to capture the unique spatial dependency between OD pairs with identical origins or destinations.

Challenge 3: Differentiation of different semantics of origin and destination
In a complex and irregular transportation network, the passenger demand of different OD pairs can be geographically and semantically correlated and has both directed and bidirectional correlations. However, modeling the demand separately for the origin and destination to learn local features around each grid discards the flow relationship between OD pairs and has no practical application. If only the distance and flow information between any two grids are considered without distinguishing the origin and destination, the directedness of the OD flow is ignored, and the varying attraction relationship between the origin and destination at different times is neglected.

Liu et al.^[72] constructed an LSC module that used two convolutional neural networks to learn local spatial contextual information of taxi demand from origin and destination views. However, this model does not take into account the flow relationship between the two regions or different semantic information. Wang et al.^[75] considered the combination of different origin-destination pairs and the number of passenger demands for each origin to predict the number of taxi orders from one area to another in a given time period, but only considered the flow relationship between two regions and ignored directedness and the distinction between different semantics and bi-directionality. Shi et al.^[78] constructed a multi-perspective graph convolutional network and proposed bidirectional correlations for OD flows when the start points are the same or similar and the endpoints are the same or similar. Zhang et al.^[81] defined a weighted bidirectional graph and learned dynamic demand patterns from both the demand generation and attraction aspects while incorporating dynamic and bidirectional structure characteristics of edges. Zhang et al.^[84] proposed an origin-destination embedding module, treating the origin and destination as different semantic entities and using the parity of sampling to obtain semantic entities with different starting and ending points to distinguish different semantic information. Chen et al.^[79]proposed a BiConvLSTM method that processes input data in both forward and reverse directions through two ConvLSTMs, while maintaining hidden layer states and memory unit states in both directions.

Challenge 4: Time window selection
Currently, the predominant approach to predicting OD flows continues to be the discrete dynamic graph method for node prediction, which aggregates historical transactions into demand snapshots. Each snapshot contains demand within a fixed time window, resulting in disconnected OD flows. Moreover, the temporal aspect of OD flows is a continuous feature, and processing it under a fixed time window is intuitive but lacks rigor. The choice of time granularity can lead to biased prediction accuracy, with selecting too small a granularity generating a large amount of noise, and selecting too large a granularity causing decreased perception of important information. Additionally, predicting based on a continuous-time dynamic graph involves maintaining a dynamic state vector for each traffic node, potentially resulting in a large number of OD pairs and posing challenges in updating and maintaining representations for the many continuous time nodes.

Earlier approaches for OD demand forecasting aggregated taxi OD demand into demand snapshots, with each snapshot containing the total demand within a fixed time window. Zhang et al.^[81] designed a spatiotemporal attention network with a k-hop temporal node-edge attention layer to capture time-evolving node topology in dynamic OD graphs and to use different time granularities to explore complex time patterns, yet still falls under the category of discrete dynamic graph methods. Zhang et al.^[84] designed a layered memory storage technique that integrates discrete and continuous-time information of OD demand, extending the learning of traffic node representations to a continuous-time dynamic graph view. Han et al.^[83] constructed a framework for learning continuous-time dynamic graph representations, maintaining a dynamic state vector for each traffic node to store historical transaction information and continuously update it, lifting prediction from discrete time slices to continuous-time dynamic graph prediction.

Challenge 5: Data sparseness problem solving

Each OD pair has a time sequence that requires more complex spatial dependencies. Discrete dynamic graph-based prediction methods inevitably suffer from information loss and produce a large number of zero values. The use of continuous-time dynamic graph-based prediction methods also encounters the problem of sparse data for some OD pairs, and exacerbates the data sparsity issue through the quadratic quantity of predicted OD demand.

Wang et al.^[75] proposed a pre-weighted aggregator that combines the perception of data sparsity and range at different granularity levels based on grid embedding to mitigate the impact of sparse data. Hu et al.^[86] designed three modules, namely, factorization, prediction, and restoration, to handle data sparsity in matrix factorization and restoration steps. Zhang et al.^[77] introduced a segmentation CNN to transform sparse OD data into dense features and verified the effectiveness of a masking loss function for tackling data dimensionality and sparsity issues. Zhang et al.^[81] designed a loss function $ {\mathcal{l}}_{reg} $, commonly used for quadratic regression loss, and a masking loss function $ {\mathcal{l}}_{mask} $ that prioritizes harder-to-predict edges based on the auxiliary loss function approach to combat the adverse effects of high sparsity. Zhuang et al.^[85] proposed a STZINB-GNN model to quantify the uncertainty of sparse travel demand using the zero-inflated negative binomial (ZINB) distribution to capture an extensive amount of zeros in sparse O-D matrices and the negative binomial (NB) distribution for each non-zero entry. The model also introduced a spatiotemporal embedding with an additional parameter $ \pi $ to learn the likelihood of input zeros. Han et al.^[83] mitigated sparse data issues by proposing a layered message passing module, allowing virtual cluster-level nodes and region-level nodes to share information, and designing a loss function that focuses more on non-zero demand. Yao et al.^[88] applied two gating mechanisms to the vanilla convolution operation to alleviate the error accumulation issue of typical recurrent forecasting in long-term OD prediction. Zou et al.^[89] proposed using residual blocks and refined loss functions to enhance model training stability. Huang et al.^[91] used a GAN structure to address the problem of zero-valued elements dominating the OD demand matrix. Yang et al.^[65] used a method based on filling the lower triangular matrix, only considering OD pairs with travel volumes greater than zero, and for the remaining OD pairs, used a method similar to interpolation. Li et al.^[93] Proposed a hybrid framework to predict short-term OD, in which the two-step design predicts trip generation/attraction in the first stage, and estimates trip distribution in the second stage. This framework not only takes advantage of robust spatio-temporal predictors but also avoids under- or over-estimations of short-term OD matrices due to high sparsity. These 5 challenges are reviewed and summarized in Table 2.

Table 2. Summary of problem solving in taxi origin-destination prediction.

Model	Dynamic/Static	Directed/Undirected	Time window	Sparse data	Spatial-temporal correlation
GEML^[74]	Static	Fluid relationship but undirected	Discrete-time snapshots with the same time granularity	Multi-granularity level mesh embedding/pre-weighted aggregator	None
MultiConvLSTM^[72]	Dynamic and Static	Undirected	Discrete-time snapshots with the same time granularity	Self-attention	None
CLTS^[73]	Dynamic	Undirected	Discrete-time snapshots with the same time granularity	None	None
CSTN^[71]	Static	Undirected	Discrete-time snapshots with the same time granularity	None	None
CAS-CNN^[76]	Static	Undirected	Discrete-time snapshots with the same time granularity	Split the CNN/masking loss function	None
MPGCN^[77]	Two static adjacency matrices and one dynamic adjacency	Undirected	Discrete-time snapshots with the same time granularity	None	Spatiotemporal dynamic adjacency matrix
GCN-SBULSTM^[78]	Static	Undirected	Discrete-time snapshots with the same time granularity	None	None
ST-ED-RMGC^[79]	Static	Undirected	Discrete-time snapshots with the same time granularity	None	Parallel prediction
DNEAT^[80]	Dynamic and Static	Directed	Discrete-time snapshots of different time granularities	The loss function $ \mathcal{l} $ jointly minimizes the $ {\mathcal{l}}_{reg} $ and $ {\mathcal{l}}_{mask} $ masks	None
Spatial OD-BiConvLSTM^[81]	Dynamic	Undirected	Discrete-time snapshots (sliding window)	The loss function $ \mathcal{l} $	None
CMOD^[82]	Dynamic	Undirected	Continuous dynamic time	The loss function focuses on non-zero demand	Node embedding
HMOD^[83]	Dynamic and Static	Directed and semantically differentiated	Continuous dynamic time	None	Node embedding
SIZINB-GNN^[84]	Static	Undirected	Discrete-time snapshots with the same time granularity	Zero expansion negative binomial distribution/probability of learning input being zero additive parameter $ \pi $	None
ODformer^[86]	Static	Directed	Long sequence time window	Transformer	Spatial-Temporal Transformers
SI-GCN^[87]	Dynamic	Directed	Continuous dynamic time	Negative sampling	data imputation,
STGDL^[88]	Static	Directed	Discrete-time snapshots (sliding window)	Two gate mechanisms	ST-GDL model
CWGAN-div^[89]	Dynamic and Static	Undirected	Discrete-time snapshots (moving average)	Residual blocks	Interpretable conditional information
DMGC-GAN^[90]	Dynamic and Static	Directed	Discrete-time snapshots (sliding window)	GAN	TMGCN
Hex D-GCN^[91]	Dynamic	Undirected	Discrete-time snapshots (moving average)	Filling the lower triangle matrix	None
OD-TGAT^[65]	Static	Directed	Discrete-time snapshots (sliding window)	GAT	None
TFF^[92]	Dynamic	Directed	Discrete-time snapshots (moving average)	Two-step design performs	None
CSGCN^[93]	Static	Directed	Discrete-time snapshots (sliding window)	None	None
gHMC-STA^[94]	Static	Directed	Discrete-time snapshots (sliding window)	None	None
HSTN^[95]	Static	Directed	Discrete-time snapshots (moving average)	None	None
CTBGCN^[96]	Static	Directed	Discrete-time snapshots (sliding window)	None	None
CT-GCN^[97]	Static	Directed	Discrete-time snapshots (sliding window)	None	None

Public datasets and open-source codes

Details shown in Tables 3 and 4.

Table 3. Open taxi datasets.

Dataset	Links
NYC-Taxi	https://www.nyc.gov/site/tlc/about/tlc-trip-record-data.page
Porto	−
T-drive (Beijing)	https://www.microsoft.com/en-us/research/publication/t-drive-trajectory-data-sample/
Taxi-Shanghai	−
Taxi-Shenzhen	https://opendata.sz.gov.cn/
Taxi-Chengdu	https://tianchi.aliyun.com/dataset/39384

Table 4. Open-source codes.

Model	Github
GEML	https://github.com/Zekun-Cai/GEML-Origin-Destination-Matrix-Prediction-via-Graph-Convolution
CSTN	https://github.com/liulingbo918/CSTN
FL-GCN	https://github.com/alzmxx/OD_Prediction
MPGCN	https://github.com/underdoc-wang/MPGCN
ST-ED-RMGC	https://github.com/kejintao/ST-ED-RMGC/tree/main/od_prediction
CMOD	https://github.com/liangzhehan/cmod
HMOD	https://github.com/Rising0321/HMOD
SIZINB-GNN	https://github.com/zhuangdingyi/stzinb

{{lists.name}}

Taxi origin and destination demand prediction based on deep learning: a review

Abstract

Rights and permissions

References

About this article

Cite this article

Article Metrics

Access History

Other Articles By Authors