A comprehensive review of traffic flow prediction: from traditional models to deep learning architectures

Heng Zhang; Zhizhe Lin; Jinglin Zhou; Jiaxin Sun; Teng Zhou; Chunjie Cao; Heng Zhang; Zhizhe Lin; Jinglin Zhou; Jiaxin Sun; Teng Zhou; Chunjie Cao

doi:10.48130/dts-0025-0027

2025 Volume 4

Article Contents

Next Previous

REVIEW Open Access

A comprehensive review of traffic flow prediction: from traditional models to deep learning architectures

1.
School of Cyberspace Security, Hainan University, Haikou 570228, China
2.
School of Philosophy, Fudan University, Shanghai 200433, China
3.
Department of Economics, Ludwig Maximilian University of Munich, München 80539, Germany
4.
Yangtze Delta Region Institute, University of Electronic Science and Technology of China, Quzhou 324003, China

More Information

Corresponding author: teng.zhou@hainanu.edu.cn

Received: 11 May 2025
Revised: 21 October 2025
Accepted: 06 November 2025
Published online: 31 December 2025
Digital Transportation and Safety 2025, 4(4): 281−297 | Cite this article

Abstract

Traffic congestion gradually becomes an increasingly serious problem in cities. Intelligent transportation systems (ITS) have been introduced to alleviate traffic congestion. Traffic flow prediction, as an important part of ITS, has become a research hotspot in recent years. This study presents machine learning techniques in traffic flow prediction. First, it systematically introduces the task of traffic flow prediction, including the research background and problem definition. Then, it reviews twelve benchmark datasets commonly used for traffic flow prediction and provides a detailed description of the data processing steps. After that, this study highlights the theoretical foundations and significant impact of Graph Convolutional Networks for traffic flow modeling. Finally, this study provides an in-depth analysis of the opportunities and challenges in this field and offers pertinent suggestions for future research. Overall, this study can help researchers quickly get started with traffic flow prediction.
- Intelligent transportation systems,
- Traffic flow modeling,
- Comprehensive review,
- Traffic flow prediction,
- Traffic flow theory

Supplementary information

Supplementary Table S1 Data used in this study.

Rights and permissions
Copyright: © 2025 by the author(s). Published by Maximum Academic Press, Fayetteville, GA. This article is an open access article distributed under Creative Commons Attribution License (CC BY 4.0), visit https://creativecommons.org/licenses/by/4.0/.

References

[1]	Xu W, Liu J, Yan J, Yang J, Liu H, et al. 2024. Dynamic spatiotemporal graph wavelet network for traffic flow prediction. IEEE Internet of Things Journal 11(5):8019−29 doi: 10.1109/JIOT.2023.3317190 CrossRef Google Scholar
[2]	Abadi M, Barham P, Chen J, Chen Z, Davis A, et al. 2016. TensorFlow: a system for large-scale machine learning. Proc. 12^th USENIX symposium on operating systems design and implementation (OSDI 16), Savannah, GA, USA, November 2–4, 2016. USA: USENIX Association. pp. 265−83 www.usenix.org/system/files/conference/osdi16/osdi16-abadi.pdf
[3]	Sayed SA, Abdel-Hamid Y, Hefny HA. 2023. Artificial intelligence-based traffic flow prediction: a comprehensive review. Journal of Electrical Systems and Information Technology 10:13 doi: 10.1186/s43067-023-00081-6 CrossRef Google Scholar
[4]	Cai D, Chen K, Lin Z, Li D, Zhou T, et al. 2024. JointSTNet: Joint Pre-Training for Spatial-Temporal Traffic Forecasting. IEEE Transactions on Consumer Electronics 71(2):6239−52 doi: 10.1109/TCE.2024.3476129 CrossRef Google Scholar
[5]	Wang F, Liang Y, Lin Z, Zhou J, Zhou T. 2024. SSA-ELM: a hybrid learning model for short-term traffic flow forecasting. Mathematics 12:1895 doi: 10.3390/math12121895 CrossRef Google Scholar
[6]	Chai W, Zhang L, Lin Z, Zhou J, Zhou T. 2024. GSA-KELM-KF: a hybrid model for short-term traffic flow forecasting. Mathematics 12:103 doi: 10.3390/math12010103 CrossRef Google Scholar
[7]	Wen Y, Xu P, Li Z, Xu W, Wang X. 2023. RPConvformer: a novel Transformer-based deep neural networks for traffic flow prediction. Expert Systems with Applications 218:119587 doi: 10.1016/j.eswa.2023.119587 CrossRef Google Scholar
[8]	Cui Z, Huang B, Dou H, Tan G, Zheng S, et al. 2022. Gsa‐elm: a hybrid learning model for short‐term traffic flow forecasting. IET Intelligent Transport Systems 16:41−52 doi: 10.1049/itr2.12127 CrossRef Google Scholar
[9]	Abdullah SM, Periyasamy M, Kamaludeen NA, Towfek SK, Marappan R, et al. 2023. Optimizing traffic flow in smart cities: soft GRU-based recurrent neural networks for enhanced congestion prediction using deep learning. Sustainability 15:5949 doi: 10.3390/su15075949 CrossRef Google Scholar
[10]	Wu K, Xu C, Yan J, Wang F, Lin Z, et al. 2023. Error-distribution-free kernel extreme learning machine for traffic flow forecasting. Engineering Applications of Artificial Intelligence 123:106411 doi: 10.1016/j.engappai.2023.106411 CrossRef Google Scholar
[11]	Chai W, Luo Q, Lin Z, Yan J, Zhou J, et al. 2024. Spatiotemporal dynamic multi-hop network for traffic flow forecasting. Sustainability 16:5860 doi: 10.3390/su16145860 CrossRef Google Scholar
[12]	Xing Z, Huang M, Peng D. 2023. Overview of machine learning-based traffic flow prediction. Digital Transportation and Safety 2:164−75 doi: 10.48130/DTS-2023-0013 CrossRef Google Scholar
[13]	Smith BL, Demetsky MJ. 1997. Traffic flow forecasting: comparison of modeling approaches. Journal of Transportation Engineering 123:261−66 doi: 10.1061/(ASCE)0733-947X(1997)123:4(261) CrossRef Google Scholar
[14]	Williams BM, Hoel LA. 2003. Modeling and forecasting vehicular traffic flow as a seasonal ARIMA process: Theoretical basis and empirical results. Journal of transportation engineering 129:664−72 doi: 10.1061/(ASCE)0733-947X(2003)129:6(664) CrossRef Google Scholar
[15]	Schimbinschi F, Moreira-Matias L, Nguyen VX, Bailey J. 2017. Topology-regularized universal vector autoregression for traffic forecasting in large urban areas. Expert Systems with Applications 82:301−16 doi: 10.1016/j.eswa.2017.04.015 CrossRef Google Scholar
[16]	Tan G, Zhou T, Huang B, Dou H, Song Y, et al. 2024. A noise-immune and attention-based multi-modal framework for short-term traffic flow forecasting. Soft Computing 28:4775−90 doi: 10.1007/s00500-023-09173-x CrossRef Google Scholar
[17]	Lu H, Ge Z, Song Y, Jiang D, Zhou T, et al. 2021. A temporal-aware LSTM enhanced by loss-switch mechanism for traffic flow forecasting. Neurocomputing 427:169−78 doi: 10.1016/j.neucom.2020.11.026 CrossRef Google Scholar
[18]	Liu M, Liu G, Sun L. 2023. Spatial–temporal dependence and similarity aware traffic flow forecasting. Information Sciences 625:81−96 doi: 10.1016/j.ins.2022.12.107 CrossRef Google Scholar
[19]	Yang S, Li H, Luo Y, Li J, Song Y, et al. 2022. Spatiotemporal adaptive fusion graph network for short-term traffic flow forecasting. Mathematics 10:1594 doi: 10.3390/math10091594 CrossRef Google Scholar
[20]	Lin Z, Wang D, Cao C, Xie H, Zhou T, et al. 2025. GSA-KAN: a hybrid model for short-term traffic forecasting. Mathematics 13:1158 doi: 10.3390/math13071158 CrossRef Google Scholar
[21]	Fernandes B, Silva F, Alaiz-Moretón H, Novais P, Analide C, et al. 2019. Traffic flow forecasting on data-scarce environments using ARIMA and LSTM networks. In New Knowledge in Information Systems and Technologies. WorldCIST'19 2019. Advances in Intelligent Systems and Computing, eds. Rocha Á, Adeli H, Reis L, Costanzo S. Vol 930. Cham: Springer. pp. 273−82 doi: 10.1007/978-3-030-16181-1_26
[22]	Zhao Z, Chen W, Wu X, Chen PCY, Liu J. 2017. LSTM network: a deep learning approach for short-term traffic forecast. IET intelligent transport systems 11:68−75 doi: 10.1049/iet-its.2016.0208 CrossRef Google Scholar
[23]	Fu R, Zhang Z, Li L. Using LSTM and GRU neural network methods for traffic flow prediction. 2016 31^st Youth Academic Annual Conference of Chinese Association of Automation (YAC), Wuhan, China, 11−13 November 2016. USA: IEEE. pp. 324−28 doi: 10.1109/YAC.2016.7804912
[24]	Ma X, Tao Z, Wang Y, Yu H, Wang Y. 2015. Long short-term memory neural network for traffic speed prediction using remote microwave sensor data. Transportation Research Part C: Emerging Technologies 54:187−97 doi: 10.1016/j.trc.2015.03.014 CrossRef Google Scholar
[25]	Fang W, Li X, Lin Z, Zhou J, Zhou T. 2024. Mixture correntropy with variable center LSTM network for traffic flow forecasting. Digital Transportation and Safety 3:264−70 doi: 10.48130/dts-0024-0023 CrossRef Google Scholar
[26]	Qi Q, Cheng R, Ge H. 2023. Short-term inbound rail transit passenger flow prediction based on BILSTM model and influence factor analysis. Digital Transportation and Safety 2:12−22 doi: 10.48130/DTS-2023-0002 CrossRef Google Scholar
[27]	Yang D, Li S, Peng Z, Wang P, Wang J, et al. 2019. MF-CNN: traffic flow prediction using convolutional neural network and multi-features fusion. IEICE Transactions on Information and Systems E102.D:1526−36 doi: 10.1587/transinf.2018edp7330 CrossRef Google Scholar
[28]	Méndez M, Merayo MG, Núñez M. 2023. Long-term traffic flow forecasting using a hybrid CNN-BiLSTM model. Engineering Applications of Artificial Intelligence 121:106041 doi: 10.1016/j.engappai.2023.106041 CrossRef Google Scholar
[29]	Zhang W, Yu Y, Qi Y, Shu F, Wang Y. 2019. Short-term traffic flow prediction based on spatio-temporal analysis and CNN deep learning. Transportmetrica A: Transport Science 15:1688−711 doi: 10.1080/23249935.2019.1637966 CrossRef Google Scholar
[30]	Narmadha S, Vijayakumar V. 2023. Spatio-Temporal vehicle traffic flow prediction using multivariate CNN and LSTM model. Materials Today: Proceedings 81:826−33 doi: 10.1016/j.matpr.2021.04.249 CrossRef Google Scholar
[31]	Huang B, Dou H, Luo Y, Li J, Wang J, et al. 2022. Adaptive spatiotemporal transformer graph network for traffic flow forecasting by iot loop detectors. IEEE Internet of Things Journal 10:1642−53 doi: 10.1109/JIOT.2022.3209523 CrossRef Google Scholar
[32]	Jiang W, Xiao Y, Liu Y, Liu Q, Li Z. 2022. Bi‐GRCN: a spatio‐temporal traffic flow prediction model based on graph neural network. Journal of Advanced Transportation 2022:5221362 doi: 10.1155/2022/5221362 CrossRef Google Scholar
[33]	Li Z, Zhou J, Lin Z, Zhou T. 2024. Dynamic spatial aware graph transformer for spatiotemporal traffic flow forecasting. Knowledge-based systems 297:111946 doi: 10.1016/j.knosys.2024.111946 CrossRef Google Scholar
[34]	Zhang H, Lin Z, Xie H, Zhou J, Song Y, et al. 2025. Two-way heterogeneity model for dynamic spatiotemporal traffic flow prediction. Knowledge-Based Systems 320:113635 doi: 10.1016/j.knosys.2025.113635 CrossRef Google Scholar
[35]	Chai W, Zheng Y, Tian L, Qin J, Zhou T. 2023. GA-KELM: genetic-algorithm-improved kernel extreme learning machine for traffic flow forecasting. Mathematics 11:3574 doi: 10.3390/math11163574 CrossRef Google Scholar
[36]	Cui Z, Huang B, Dou H, Cheng Y, Guan J, et al. 2022. A two-stage hybrid extreme learning model for short-term traffic flow forecasting. Mathematics 10:2087 doi: 10.3390/math10122087 CrossRef Google Scholar
[37]	Ou J, Li J, Wang C, Wang Y, Nie Q. 2024. Building trust for traffic flow forecasting components in intelligent transportation systems via interpretable ensemble learning. Digital Transportation and Safety 3:126−43 doi: 10.48130/dts-0024-0012 CrossRef Google Scholar
[38]	Zou G, Lai Z, Wang T, Liu Z, Li Y. 2024. MT-STNet: a novel multi-task spatiotemporal network for highway traffic flow prediction. IEEE Transactions on Intelligent Transportation Systems 25(7):8221−36 doi: 10.1109/TITS.2024.3411638 CrossRef Google Scholar
[39]	Goyal MTSP, Gulghane A. 2020. A review of speed flow density study of two different road Indian road and their comparison. International Journal of Scientific Research & Engineering Trends 6(2):499-504
[40]	Dorokhin S, Artemov A, Likhachev D, Novikov A, Starkov E. 2020. Traffic simulation: an analytical review. IOP Conference Series: Materials Science and Engineering 918:012058 doi: 10.1088/1757-899x/918/1/012058 CrossRef Google Scholar
[41]	Zhang L, Yuan Z, Yang L, Liu Z. 2020. Recent developments in traffic flow modelling using macroscopic fundamental diagram. Transport Reviews 40:689−710 doi: 10.1080/01441647.2020.1738588 CrossRef Google Scholar
[42]	Bramich DM, Menéndez M, Ambühl L. 2022. Fitting empirical fundamental diagrams of road traffic: A comprehensive review and comparison of models using an extensive data set. IEEE Transactions on Intelligent Transportation Systems 23:14104−27 doi: 10.1109/TITS.2022.3142255 CrossRef Google Scholar
[43]	Liu J, Wu N, Qiao Y, Li Z. 2021. A scientometric review of research on traffic forecasting in transportation. IET Intelligent Transport Systems 15:1−16 doi: 10.1049/itr2.12024 CrossRef Google Scholar
[44]	Kashyap AA, Raviraj S, Devarakonda A, Nayak KSR, K V S, et al . 2022. Traffic flow prediction models – a review of deep learning techniques. Cogent Engineering 9:2010510 doi: 10.1080/23311916.2021.2010510 CrossRef Google Scholar
[45]	Jarmuż D, Chmiel J. 2020. A review of approaches to the study of weather's effect on road traffic parameters. Transport Problems 15:241−51 doi: 10.21307/tp-2020-063 CrossRef Google Scholar
[46]	Fang W, Cai W, Fan B, Yan J, Zhou T. 2021. Kalman-LSTM model for short-term traffic flow forecasting. 2021 IEEE 5^th Advanced Information Technology, Electronic and Automation Control Conference (IAEAC), Chongqing, China, 12−14 March 2021. USA: IEEE. pp. 1604−8 doi: 10.1109/IAEAC50856.2021.9390991
[47]	Jiang W, Luo J. 2022. Graph neural network for traffic forecasting: a survey. Expert Systems with Applications 207:117921 doi: 10.1016/j.eswa.2022.117921 CrossRef Google Scholar
[48]	Chen X, Lu J, Zhao J, Qu Z, Yang Y, et al. 2020. Traffic flow prediction at varied time scales via ensemble empirical mode decomposition and artificial neural network. Sustainability 12:3678 doi: 10.3390/su12093678 CrossRef Google Scholar
[49]	Li Y, Yu R, Shahabi C, Liu Y. 2017. Diffusion convolutional recurrent neural network: data-driven traffic forecasting. arXiv 1707.01926 doi: 10.48550/arXiv.1707.01926 CrossRef Google Scholar
[50]	Yu B, Yin H, Zhu Z. 2017. Spatio-temporal graph convolutional networks: a deep learning framework for traffic forecasting. Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence. California: International Joint Conferences on Artificial Intelligence (IJCAI). pp. 3634−40 doi: 10.24963/ijcai.2018/505
[51]	Wu Z, Pan S, Long G, Jiang J, Zhang C. 2019. Graph wavenet for deep spatial-temporal graph modeling. Proceedings of the 28^th International Joint Conference on Artificial Intelligence, Macao, China 2019. Macao, China: AAAI Press. pp. 1907−13 https://dl.acm.org/doi/abs/10.5555/3367243.3367303
[52]	Song C, Lin Y, Guo S, Wan H. 2020. Spatial-temporal synchronous graph convolutional networks: A new framework for spatial-temporal network data forecasting. Proceedings of the AAAI Conference on Artificial Intelligence 34:914−21 doi: 10.1609/aaai.v34i01.5438 CrossRef Google Scholar
[53]	Tian C, Chan WK. 2021. Spatial-temporal attention wavenet: a deep learning framework for traffic prediction considering spatial-temporal dependencies. IET Intelligent Transport Systems 15:549−61 doi: 10.1049/itr2.12044 CrossRef Google Scholar
[54]	Ji J, Wang J, Huang C, Wu J, Xu B, et al. 2023. Spatio-temporal self-supervised learning for traffic flow prediction. Proceedings of the AAAI Conference on Artificial Intelligence 37:4356−64 doi: 10.1609/aaai.v37i4.25555 CrossRef Google Scholar
[55]	Jiang R, Wang Z, Yong J, Jeph P, Chen Q, et al. 2023. Spatio-temporal meta-graph learning for traffic forecasting. Proceedings of the AAAI Conference on Artificial Intelligence 37:8078−86 doi: 10.1609/aaai.v37i7.25976 CrossRef Google Scholar
[56]	Shao Z, Zhang Z, Wei W, Wang F, Xu Y, et al. 2022. Decoupled dynamic spatial-temporal graph neural network for traffic forecasting. arXiv 2206.09112 doi: 10.48550/arXiv.2206.09112 CrossRef Google Scholar
[57]	Weng W, Fan J, Wu H, Hu Y, Tian H, et al. 2023. A decomposition dynamic graph convolutional recurrent network for traffic forecasting. Pattern Recognition 142:109670 doi: 10.1016/j.patcog.2023.109670 CrossRef Google Scholar
[58]	Fan J, Weng W, Tian H, Wu H, Zhu F, et al. 2024. RGDAN: a random graph diffusion attention network for traffic prediction. Neural Networks 172:106093 doi: 10.1016/j.neunet.2023.106093 CrossRef Google Scholar
[59]	Fang Y, Qin Y, Luo H, Zhao F, Xu B, et al. 2023. When spatio-temporal meet wavelets: disentangled traffic forecasting via efficient spectral graph attention networks. 2023 IEEE 39th International Conference on Data Engineering (ICDE), Anaheim, CA, USA, 3−7 April 2023 . USA: IEEE. pp. 517−29 doi: 10.1109/ICDE55515.2023.00046
[60]	Liu H, Dong Z, Jiang R, Deng J, Deng J, et al.2023. Spatio-temporal adaptive embedding makes vanilla transformer SOTA for traffic forecasting. CIKM '23: Proceedings of the 32^nd ACM International Conference on Information and Knowledge Management. Birmingham United Kingdom, October 21−25, 2023. New York, United States: Association for Computing Machinery. pp. 4125−29 doi: 10.1145/3583780.361516
[61]	Jiang J, Han C, Zhao WX, Wang J. 2023. PDFormer: propagation delay-aware dynamic long-range transformer for traffic flow prediction. Proceedings of the AAAI Conference on Artificial Intelligence 37:4365−73 doi: 10.1609/aaai.v37i4.25556 CrossRef Google Scholar
[62]	Shao Z, Zhang Z, Wang F, Xu Y. 2022. Pre-training enhanced spatial-temporal graph neural network for multivariate time series forecasting. KDD '22: Proceedings of the 28^th ACM SIGKDD Conference on Knowledge Discovery and Data Minin. Washington DC, USA, August 14−18, 2022. New York, United States: Association for Computing Machinery. pp. 1567−77 doi: 10.1145/3534678.3539396
[63]	Gao H, Jiang R, Dong Z, Deng J, Song X. 2023. Spatio-temporal-decoupled masked pre-training for traffic forecasting. arXiv 2312.00516 doi: 10.48550/arXiv.2312.00516 CrossRef Google Scholar
[64]	Choi J, Choi H, Hwang J, Park N. 2022. Graph neural controlled differential equations for traffic forecasting. Proceedings of the AAAI Conference on Artificial Intelligence 36:6367−74 doi: 10.1609/aaai.v36i6.20587 CrossRef Google Scholar
[65]	Choi J, Park N. 2023. Graph neural rough differential equations for traffic forecasting. ACM Transactions on Intelligent Systems and Technology 14:1−27 doi: 10.1145/3604808 CrossRef Google Scholar
[66]	Ye J, Liu Z, Du B, Sun L, Li W, et al. 2022. Learning the evolutionary and multi-scale graph structure for multivariate time series forecasting. KDD '22: Proceedings of the 28^th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. Washington DC, USA, August 14−18, 2022. New York, United States: Association for Computing Machinery. pp. 2296−306 doi: 10.1145/3534678.3539274
[67]	Paszke A, Gross S, Massa F, Lerer A, Bradbury J, et al. 2019. PyTorch: an imperative style, high-performance deep learning library. Proceedings of the 33^rd International Conference on Neural Information Processing Systems, Vancouver BC, Canada, December 8−14, 2019. Red Hook, NY, United States: Curran Associates Inc. pp. 8026−37 doi: 10.5555/3454287.3455008
[68]	Dougherty MS, Kirby HR, Boyle RD. 1993. The use of neural networks to recognise and predict traffic congestion. Traffic Engineering & Control 34:311−14 Google Scholar
[69]	Vlahogianni EI, Karlaftis MG, Golias JC. 2005. Optimized and meta-optimized neural networks for short-term traffic flow prediction: a genetic approach. Transportation Research Part C: Emerging Technologies 13:211−34 doi: 10.1016/j.trc.2005.04.007 CrossRef Google Scholar
[70]	Zheng W, Lee DH, Shi Q. 2006. Short-term freeway traffic flow prediction: bayesian combined neural network approach. Journal of Transportation Engineering 132:114−21 doi: 10.1061/(ASCE)0733-947X(2006)132:2(114) CrossRef Google Scholar
[71]	Chan KY, Dillon TS, Singh J, Chang E. 2012. Neural-network-based models for short-term traffic flow forecasting using a hybrid exponential smoothing and Levenberg–Marquardt algorithm. IEEE Transactions on Intelligent Transportation Systems 13:644−54 doi: 10.1109/TITS.2011.2174051 CrossRef Google Scholar
[72]	Davis GA, Nihan NL. 1991. Nonparametric regression and short-term freeway traffic forecasting. Journal of Transportation Engineering 117:178−88 doi: 10.1061/(ASCE)0733-947X(1991)117:2(178) CrossRef Google Scholar
[73]	Cai P, Wang Y, Lu G, Chen P, Ding C, et al. 2016. A spatiotemporal correlative k-nearest neighbor model for short-term traffic multistep forecasting. Transportation Research Part C: Emerging Technologies 62:21−34 doi: 10.1016/j.trc.2015.11.002 CrossRef Google Scholar
[74]	Castro-Neto M, Jeong YS, Jeong MK, Han LD. 2009. Online-SVR for short-term traffic flow prediction under typical and atypical traffic conditions. Expert Systems with Applications 36:6164−73 doi: 10.1016/j.eswa.2008.07.069 CrossRef Google Scholar
[75]	Su H, Zhang L, Yu S. Short-term traffic flow prediction based on incremental support vector regression. Third International Conference on Natural Computation (ICNC 2007), Haikou, China, 24−27 August 2007. USA: IEEE. pp. 640−45 doi: 10.1109/ICNC.2007.661
[76]	Sengupta S, Basak S, Saikia P, Paul S, Tsalavoutis V, et al. 2020. A review of deep learning with special emphasis on architectures, applications and recent trends. Knowledge-Based Systems 194:105596 doi: 10.1016/j.knosys.2020.105596 CrossRef Google Scholar
[77]	Guo J, Liu Y, Yang Q, Wang Y, Fang S. 2021. GPS-based citywide traffic congestion forecasting using CNN-RNN and C3D hybrid model. Transportmetrica A: Transport Science 17:190−211 doi: 10.1080/23249935.2020.1745927 CrossRef Google Scholar
[78]	Guo S, Lin Y, Li S, Chen Z, Wan H. 2019. Deep spatial–temporal 3D convolutional neural networks for traffic data forecasting. IEEE Transactions on Intelligent Transportation Systems 20:3913−26 doi: 10.1109/TITS.2019.2906365 CrossRef Google Scholar
[79]	Bao Y, Huang J, Shen Q, Cao Y, Ding W, et al. 2023. Spatial–temporal complex graph convolution network for traffic flow prediction. Engineering Applications of Artificial Intelligence 121:106044 doi: 10.1016/j.engappai.2023.106044 CrossRef Google Scholar
[80]	Rahmani S, Baghbani A, Bouguila N, Patterson Z. 2023. Graph neural networks for intelligent transportation systems: a survey. IEEE Transactions on Intelligent Transportation Systems 24:8846−85 doi: 10.1109/TITS.2023.3257759 CrossRef Google Scholar
[81]	Guo K, Hu Y, Sun Y, Qian S, Gao J, et al. 2021. Hierarchical graph convolution network for traffic forecasting. Proceedings of the AAAI Conference on Artificial Intelligence 35:151−59 doi: 10.1609/aaai.v35i1.16088 CrossRef Google Scholar
[82]	Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, et al. 2017. Attention is all you need. NIPS'17: Proceedings of the 31st International Conference on Neural Information Processing Systems, Long Beach, California, USA, December 4−9, 2017. Red Hook, NY, United States: Curran Associates Inc. pp. 6000−10 doi: 10.5555/3295222.3295349
[83]	Liu Z, Zheng G, Yu Y. 2023. Cross-city few-shot traffic forecasting via traffic pattern bank. CIKM '23: Proceedings of the 32^nd ACM International Conference on Information and Knowledge Management. Birmingham United Kingdom October 21−25, 2023. New York, United States: Association for Computing Machinery. pp. 1451−60 doi: 10.1145/3583780.361482
[84]	Sun M, Ding W, Zhang T, Liu Z, Xing M, et al. 2023. STDA-Meta: a meta-learning framework for few-shot traffic prediction. 2023 IEEE 29^th International Conference on Parallel and Distributed Systems (ICPADS). Ocean Flower Island, China, 17−21 December 2023. USA: IEEE. pp. 534−41 doi: 10.1109/ICPADS60453.2023.00085
[85]	Wang Y, Yu C, Hou J, Chu S, Zhang Y, Zhu Y. 2022. ARIMA model and few-shot learning for vehicle speed time series analysis and prediction. Computational Intelligence and Neuroscience 2022:2526821 doi: 10.1155/2022/2526821 CrossRef Google Scholar
[86]	Yuan Y, Shao C, Ding J, Jin D, Li Y. 2024. Spatio-temporal few-shot learning via diffusive neural network generation. Proc. The Twelfth International Conference on Learning Representations, Vienna, Austria, 2024. Austria: ICLR. pp. 1−28 https://openreview.net/forum?id=QyFm3D3Tzi
[87]	Song Y, Wang T, Cai P, Mondal SK, Sahoo JP. 2023. A comprehensive survey of few-shot learning: evolution, applications, challenges, and opportunities. ACM Computing Surveys 55:1−40 doi: 10.1145/3582688 CrossRef Google Scholar
[88]	Wang Y, Yao Q, Kwok JT, Ni LM. 2020. Generalizing from a few examples: a survey on few-shot learning. ACM Computing Surveys (csur) 53:1−34 doi: 10.1145/3386252 CrossRef Google Scholar
[89]	Iman M, Arabnia HR, Rasheed K. 2023. A review of deep transfer learning and recent advancements. Technologies 11:40 doi: 10.3390/technologies11020040 CrossRef Google Scholar
[90]	Chen K, Liang Y, Han J, Feng S, Zhu M, et al. 2024. Semantic-fused multi-granularity cross-city traffic prediction. Transportation Research Part C: Emerging Technologies 162:104604 doi: 10.1016/j.trc.2024.104604 CrossRef Google Scholar
[91]	Ouyang X, Yang Y, Zhou W, Zhang Y, Wang H, et al. 2024. CityTrans: domain-adversarial training with knowledge transfer for spatio-temporal prediction across cities. IEEE Transactions on Knowledge and Data Engineering 36:62−76 doi: 10.1109/TKDE.2023.3283520 CrossRef Google Scholar
[92]	Li J, Liao C, Hu S, Chen X, Lee DH. 2024. Physics-guided multi-source transfer learning for network-scale traffic flow prediction. IEEE Transactions on Intelligent Transportation Systems 25:17533−46 doi: 10.1109/TITS.2024.3405970 CrossRef Google Scholar
[93]	Li K, Bai W, Huang S, Tan G, Zhou T, et al. 2024. Lag-related noise shrinkage stacked LSTM network for short-term traffic flow forecasting. IET Intelligent Transport Systems 18:244−57 doi: 10.1049/itr2.12448 CrossRef Google Scholar
[94]	Adetiloye T, Awasthi A. 2019. Multimodal big data fusion for traffic congestion prediction. Multimodal Analytics for Next-Generation Big Data Technologies and Applications, eds. Seng K, Ang LM, Liew AC, Gao J. Cham: Springer. pp. 319−35 doi: 10.1007/978-3-319-97598-6_13
[95]	Zhao J, Xie X, Xu X, Sun S. 2017. Multi-view learning overview: Recent progress and new challenges. Information Fusion 38:43−54 doi: 10.1016/j.inffus.2017.02.007 CrossRef Google Scholar
[96]	Huang X, Ye Y, Yang X, Xiong L. 2023. Multi-view dynamic graph convolution neural network for traffic flow prediction. Expert Systems with Applications 222:119779 doi: 10.1016/j.eswa.2023.119779 CrossRef Google Scholar
[97]	Du S, Li T, Gong X, Horng S-J. 2020. A hybrid method for traffic flow forecasting using multimodal deep learning. International journal of computational intelligence systems 13:85−97 doi: 10.2991/ijcis.d.200120.001 CrossRef Google Scholar
[98]	Mohammad AA, Al Nawaiseh HM, Alhajyaseen WK, Dias C, Mehran B. 2023. Lane-based analysis of the saturation flow rate considering traffic composition. Transportation Planning and Technology 46:653−71 doi: 10.1080/03081060.2023.2214144 CrossRef Google Scholar
[99]	Bhaumik KK, Niloy FF, Mahmud S, Woo SS. 2024. STLGRU: spatio-temporal lightweight graph GRU for traffic flow prediction. Advances in Knowledge Discovery and Data Mining. PAKDD 2024. Lecture Notes in Computer Science, eds. Yang DN, Xie X, Tseng VS, Pei J, Huang JW, Lin JCW. Singapore: Springer. pp. 288−99 doi: 10.1007/978-981-97-2266-2_23
[100]	Liu X, Xia Y, Liang Y, Hu J, Wang Y, et al. 2024. Largest: a benchmark dataset for large-scale traffic forecasting. Proceedings of the 37^th International Conference on Neural Information Processing Systems, New Orleans, LA, USA, December 10−16, 2023. Red Hook, NY, USA: Curran Associates Inc. pp. 75354−71 doi: 10.5555/3666122.3669415
[101]	Wang J, Jiang J, Jiang W, Li C, Zhao WX. 2021. LibCity: an open library for traffic prediction. SIGSPATIAL '21: Proceedings of the 29^th International Conference on Advances in Geographic Information Systems, Beijing China, November 2−5, 2021. New York, United States: Association for Computing Machinery. pp. 145−48 doi: 10.1145/3474717.3483923
[102]	Shao Z, Wang F, Xu Y, Wei W, Yu C, et al. 2024. Exploring progress in multivariate time series forecasting: comprehensive benchmarking and heterogeneity analysis. IEEE Transactions on Knowledge and Data Engineering 37:291−305 doi: 10.1109/TKDE.2024.3484454 CrossRef Google Scholar

About this article

Cite this article

Zhang H, Lin Z, Zhou J, Sun J, Zhou T, et al. 2025. A comprehensive review of traffic flow prediction: from traditional models to deep learning architectures. Digital Transportation and Safety 4(4): 281−297 doi: 10.48130/dts-0025-0027

Zhang H, Lin Z, Zhou J, Sun J, Zhou T, et al. 2025. A comprehensive review of traffic flow prediction: from traditional models to deep learning architectures. Digital Transportation and Safety 4(4): 281−297 doi: 10.48130/dts-0025-0027

Figures(20) / Tables(5)

Download PDF

Article Metrics

Article views(84) PDF downloads(23)

Other Articles By Authors

on this site
on Google Scholar

HTML

Introduction

With the rapid advancement of technology, urbanization has accelerated, resulting in a significant increase in the urban population and placing substantial pressure on urban traffic management^[1,2]. The development of a city cannot be separated from efficient and intelligent traffic management. For this purpose, Intelligent transportation systems (ITS) were developed and have been widely adopted around the world^[3,4]. The need for short-term traffic flow prediction has also become greater and more urgent with the rapid technological advancement and popular application of intelligent transportation systems^[5].

Traffic flow prediction is a complex task. It aims to predict the future traffic state based on the input historical traffic state. Accurate traffic flow forecasting is essential for many intelligent transportation systems^[6]. It helps traffic management implement interventions to alleviate traffic congestion^[7], such as signalization control and manual road direction. Timely and reliable traffic flow information plays a remarkable role in alleviating traffic congestion, improving traffic operation efficiency, and supporting travel decision-making^[8].

In addition to providing assistance in traffic management, traffic flow prediction also contributes to infrastructure development and resource allocation in cities^[9]. The government can adjust urban development plans based on traffic flow during specific periods. The development of new roads and residential areas can optimize the utilization of road resources and help address the imbalance between the supply and demand of road capacity and travel demand.

Generally, the prevalent statistical approaches for traffic flow forecasting can be divided into parametric and non-parametric methods^[10]. Traditional traffic flow prediction methods use parametric models to fit historical data and output predictions. However, forecasting real-time and accurate traffic flow is highly challenging because of the complex spatiotemporal dependencies inherent in traffic flow^[11,12]. Parametric models underperform in prediction due to the complex spatial and temporal correlations of traffic flow. Early parametric models, such as Historical Average (HA)^[13], Autoregressive Integrated Moving Average (ARIMA)^[14], and Vector Autoregression (VAR)^[15] provided the foundation for traffic prediction.

In recent years, with the rise of machine learning, many traffic flow prediction models have been developed based on machine learning. Machine learning models are capable of capturing complex spatiotemporal dependencies from massive traffic flow data to provide more accurate predictions. Additionally, data-driven methods do not require prior knowledge of the underlying system dynamics or equations, making them flexible and adaptable to different datasets and scenarios^[16].

Classical machine learning models, such as linear regression, decision trees, and support vector machines, have been widely used for traffic flow prediction. However, traffic flow sequences are easily affected by external factors, such as unexpected accidents or manual interventions in traffic control^[17], and classical machine learning models cannot capture complex spatiotemporal dependencies^[18].

Deep learning approaches are not limited by the stationarity assumption and achieve better performance in time series prediction^[19,20]. Some modern deep neural networks, such as Recurrent Neural Networks (RNN)^[21−23], Long Short-Term Memory Networks (LSTM)^[24−26], and Convolutional Neural Networks (CNN)^[27−29], have achieved outstanding results in traffic flow prediction^[30]. These deep neural networks can extract complex spatiotemporal features from traffic flow data. They capture both temporal and spatial dependencies of traffic flow and output more accurate predictions. However, approaches based on CNNs and RNNs can only handle standard gridded data and neglect non-Euclidean correlations generated by complex road networks^[31].

Graph Neural Networks (GNNs) are a recent research hotspot. Graphs can represent many real-life objects, such as social relationships and knowledge graphs, among others. The traffic network is also a 'natural graph'^[32,33]. In deep learning, graphs can be used to represent complex road network relationships, thus capturing the spatial dependencies of traffic flow. GCNs, when integrated with RNNs or Gated CNNs, achieve better performance in spatial and temporal modeling^[34].

Despite the significant progress of deep learning in traffic flow prediction, there are still some challenges. For example, challenges remain in terms of model efficiency, transferability, abnormal event handling, and interpretability. This also brings many opportunities for future research.

Traffic congestion occurs all over the world, and traffic flow prediction is needed in many regions. However, these regions have different levels of economic development. A deep learning network requires a large number of parameters and long training times^[35]. Some regions may not be able to provide enough computational power to train the model. Therefore, it makes sense to develop lightweight but effective traffic flow prediction models. Such models can be extended to economically underdeveloped regions to provide effective traffic flow prediction services, thus improving local traffic conditions.

In addition, due to regional economic conditions, certain regions may not be able to provide sufficient infrastructure to capture traffic flow data. Therefore, traffic flow data is lacking in these regions. The deficiency and low-quality training data may cause these models to fall into local minima^[36]. Meanwhile, the traffic conditions in each region are influenced by local customs, culture, and road construction, and a traffic flow prediction model trained in a different region cannot be well adapted to the local traffic conditions. Therefore, it is necessary to develop an effective pre-training model and a domain-adaptive fine-tuning strategy.

While existing deep learning models show good prediction performance, their parameters still lack interpretability. As an auxiliary system for traffic management authorities to make decisions, traffic flow prediction models need to be sufficiently interpretable to make their predictions more trustworthy^[37].

It is evident from the literature that there have been various surveys on traffic flow prediction. While most of these articles have investigated the role of machine learning (ML) in predicting traffic flow, the focus of this study differs from that of existing surveys.

● More concern is placed on open-source traffic flow prediction models that can be reproduced. Meanwhile, 15 publicly available datasets are reviewed, and details on how they can be used are provided, so that beginners can get started quickly.

● Although GNNs are used by most traffic flow prediction models, this review goes beyond GNNs to include new models from recent years. Examples include the Transformer model, pre-training models, and differential equation models.

● The future directions proposed are not limited to designing more complex models, but are more oriented towards real-world problems—for example, large-scale deployment, generalization capability of models, and multi-factor combination capability.

Background

Here, the traffic flow prediction task is first described. The traffic flow prediction task aims to forecast future traffic conditions based on historical traffic flow data, taking into account a variety of external factors, such as weather and time of day^[38]. In general, three common metrics for traffic flow prediction are traffic flow, traffic speed, and traffic density^[33]. The choice of metric depends on the dataset employed.

Problem definition

Traffic flow rate is the total number of vehicles passing a given point during a specific time period^[39]. It measures how many vehicles pass through a given point on the roadway and is critical to understanding the overall usage of the roadway network and identifying potential congestion points. Traffic speed is the average speed at which vehicles are detected traveling at a target location during the same time period^[40]. Traffic speed indicates road conditions: high speed implies smooth traffic, while low speed implies congestion. Traffic density is the number of vehicles per unit length at a given time^[41]. Higher traffic densities generally result in slower speeds and a higher likelihood of traffic jams, while lower densities indicate smoother traffic flow.

Traffic flow, traffic speed, and traffic density are all important metrics for characterizing road network performance^[42]. In practice, it is common to use one of the three metrics for forecasting purposes due to data availability. For example, some datasets may provide detailed speed and flow information but lack density data, while others may provide all three metrics. The choice of metric usually depends on the specific requirements of the prediction task and data availability.

It is important to note that many data mining methods have been proposed due to increased data availability. Traffic flow prediction uses not only the metrics to be predicted, but also additional data. The most common type of additional data is periodic embedding^[43]. Periodic embeddings are designed by extracting timestamps from traffic flow datasets. Common periodic embeddings include 'time of day', which represents the relative position of the current time within a day, and 'day of week', which represents the relative position of the current day within a week^[44]. Periodic embedding allows the traffic flow prediction model to learn the periodicity of traffic flow. In addition, data such as weather and holidays^[45] can also be incorporated into the traffic flow prediction model as embeddings.

Traffic flow prediction, as a key factor for measuring road traffic, can be grouped into coarse-grained and fine-grained types, which differ essentially in their applications to traffic planning and strategic decision-making. The main content of fine-grained traffic flow prediction is based on current and past traffic flow data, using time series analysis methods to predict future traffic flow over short-term horizons such as seconds, minutes, or half an hour^[46].

Formally, the traffic prediction problem can be stated as follows: given a series of historical traffic data points, the objective is to predict traffic conditions at one or more future time steps^[47]. This involves creating a predictive model that can accurately capture the temporal dependencies and patterns in the traffic data. The input to this model typically includes a sequence of traffic observations over a specified period, and the output is the forecasted traffic condition at a future time step. Thus, the traffic flow prediction task can be defined as follows:

Given the current time t, the prediction and history input windows is defined as follows: the prediction window of T' time steps is [t+1, t+2, ..., t+T'] and the history input window of T time steps time steps is [t−T+1, t−T+2, ..., t]. The ground truth of traffic flow is denoted as X, partitioned into historical data X = [X_t−T+1, X_t−T+2, ..., X_t] and labels Y = [Y_t+1, Y_t+2, ..., Y_t+T']. The predicted output is denoted as $ \hat{Y}=[{\hat{Y}}_{t+1},{\hat{Y}}_{t+2},\cdots ,{\hat{Y}}_{t+{T}^{{'}}}] $. Assuming that the machine learning model is a function f, the traffic flow prediction task can be defined as Eq. (1):

$ \hat{Y}=f\left(\left[{X}_{t-T+1},{X}_{t-T+2},\dots ,{X}_{t}\right]\right)$

(1)

The goal of the traffic flow prediction task is to design a model that takes historical data X as input and produces a set of predictions $ \hat{Y} $ close to the labels Y. Machine learning uses a loss function L to measure the difference between Y and $ \hat{Y} $. Formally, the goal of machine learning for traffic flow prediction is to find a set of trainable parameters θ* that minimizes the loss function L. The detailed definition is as Eq. (2):

${\theta }^{\mathrm{*}}=\mathrm{arg}\underset{{\theta }^{\mathrm{*}}}{min} L\left({y}_{t+{T}^{{'}}},{\hat{y}}_{t+{T}^{{'}}};{\theta }^{\mathrm{*}}\right) $

(2)

Performance metrics

After model training is complete, three metrics are typically used to evaluate predictive performance. They are Mean Absolute Error (MAE), Root Mean Squared Error (RMSE), and Mean Absolute Percentage Error (MAPE)^[48]. The MAE is usually used as the model's loss function. The formulas of the three metrics are as Eqs (3)−(5):

$ {\mathrm{MAE}}=\dfrac{1}{n}\sum _{i=1}^{n} \left|{y}_{i}-{\hat{y}}_{i}\right| $

(3)

$ {\mathrm{RMSE}}=\sqrt{\dfrac{1}{n}\sum _{i=1}^{n} {\left({y}_{i}-{\hat{y}}_{i}\right)}^{2}} $

(4)

$ {\mathrm{MAPE}}=\dfrac{1}{n}\sum _{i=1}^{n} \left|\dfrac{{\hat{y}}_{i}-{y}_{i}}{{y}_{i}}\right|\times 100{\text{%}}$

(5)

Model list

Table 1 shows a list of machine learning models for traffic flow prediction collected in this review. Specific details of these models, as published in the papers, are discussed in section Machine Learning for Traffic Flow Prediction, including research gaps and innovations.

Table 1. Model list.

Model	Tag	Source	Year
DCRNN^[49]	RNN/GNN	ICLR 2018	2018
STGCN^[50]	CNN/GCN	IJCAI 2018	2018
GraphWaveNet^[51]	TCN/GCN	IJCAI 2019	2019
STSGCN^[52]	GCN	AAAI 2020	2020
STAWnet^[53]	TCN	IET intelligent transport systems	2021
ST-SSL^[54]	GCN	AAAI 2023	2023
MegaCRN^[55]	RNN/GCN	AAAI 2023	2023
D2STGNN^[56]	CNN/GCN	VLDB 2022	2022
DDGCRN^[57]	GCN	Pattern recognition	2023
RGDAN^[58]	GAT	Neural networks	2024
STWave^[59]	GAT	ICDE 2023	2023
STAEformer^[60]	Transformer	CIKM 2023	2023
PDFormer^[61]	Transformer	AAAI 2023	2023
STEP^[62]	Pre-training	SIGKDD 2022	2022
STDMAE^[63]	Pre-training	IJCAI 2024	2024
STG-NCDE^[64]	Differential equations	AAAI 2022	2022
STG-NRDE^[65]	Differential equations	ACM transactions on intelligent systems and technology	2023

Machine learning for traffic flow prediction

Future directions and challenges

Deep neural networks have driven the development of traffic flow prediction. However, there are still some challenges in this field. In this section, future directions and challenges are discussed.

Few-shot learning and transfer learning
Effective traffic flow prediction remains challenging under insufficient data conditions, especially in the early stages of building new roads or traffic networks. Some cities may lack sufficient sensing infrastructure, resulting in limited available data^[83]. Even in cities with a sufficient number of sensors, the amount of usable data in practical deployments can be severely limited due to the presence of noisy or missing measurements. This issue is particularly pronounced when considering multiple nodes simultaneously over a continuous time interval, highlighting data scarcity as a critical challenge. For instance, a case study is conducted using the PEMS04 dataset and the GraphWaveNet model. The model is evaluated with varying proportions of the training set—10%, 30%, 50%, 70%, and 100%—while keeping the validation and test sets unchanged. All models are trained with identical hyperparameter settings. Table 5 presents the results of this experiment.

Table 5. Model performance under different training data proportions.

Training data proportion MAE RMSE MAPE

10% 19.66 31.77 13.08%

30% 20.12 32.54 13.15%

50% 19.87 31.97 13.64%

70% 19.50 31.54 12.66%

100% 18.82 30.78 12.41%

Notably, it was observed that using only 10% of the training set outperforms models trained with 30%, 50%, or 70% of the data. This counterintuitive result may be attributed to sampling bias or noise distribution in the training subsets. Nonetheless, overall, model performance generally improves as the training data volume increases. This indicates that data quantity remains a critical factor affecting model effectiveness. Consequently, few-shot learning and transfer learning techniques have become key research focuses^[84−86].

Few-shot learning and transfer learning are two effective approaches for addressing data scarcity^[87]. Few-shot learning aims to achieve effective model training and generalization with very limited samples^[88]. It improves the generalization ability of models with few samples through methods such as metric learning and data augmentation. Transfer learning enhances the performance of target domain models by transferring pre-trained models or knowledge from one domain (source domain) to another domain (target domain)^[89]. It can effectively predict traffic conditions in newly built roads or regions with sparse traffic data using techniques such as model fine-tuning, feature transfer, and cross-domain adaptation.

In recent years, few-shot learning and transfer learning have shown preliminary progress in traffic flow forecasting tasks. Chen et al.^[90] proposed a cross-city transfer learning framework that integrates multi-semantic fusion with hierarchical graph clustering, which enables the model to capture dynamic traffic states, preserve static spatial dependencies, and leverage multi-granular information via simultaneous multi-scale prediction. Ouyang et al.^[91] proposed a graph-based domain adversarial framework for cross-city spatiotemporal prediction, leveraging self-adaptive spatiotemporal knowledge, a knowledge attention mechanism, and dual domain discriminators to extract domain-invariant features. Li et al.^[92] proposed a physics-guided multi-source transfer learning method for multi-region traffic flow, which leverages prior knowledge from observations, empirical studies, and traffic network physical properties, and transfers network traffic features using adversarial training combined with an Macroscopic Fundamental Diagram (MFD)-based weighting scheme.

Future research in traffic prediction could further explore cross-city transfer learning and few-shot learning to address challenges posed by data scarcity in newly deployed sensors or regions. One promising direction is the development of dynamic multi-source transfer frameworks that leverage both static and evolving traffic network properties. These frameworks incorporate long-term, high-resolution data and real-time traffic regulation records. In parallel, advanced adaptation techniques are needed to mitigate issues such as overfitting and catastrophic forgetting, ensuring robust knowledge transfer across heterogeneous spatiotemporal domains, such as trajectories, weather, and points of interest. Combining these strategies may enable the development of more generalizable and adaptive models capable of few-shot learning in new cities or traffic scenarios.

Multimodal data in traffic flow prediction
Subtle changes in traffic flow often have a great impact on future traffic flow. Due to the inherent randomness of traffic flow and external noise, such as accidents, weather conditions, manual traffic control, or detector malfunctions, accurately identifying effective changes in traffic flow and filtering noise is challenging^[93]. Multimodal data fusion is an effective approach that integrates multiple data sources and thus improves the accuracy of traffic flow prediction. By integrating multiple data sources, such as traffic surveillance videos, GPS trajectories, meteorological information, and social media data^[94], researchers can build more comprehensive and robust traffic flow prediction models. These diverse data sources provide rich information that can help the model better understand the complex dynamics and potential patterns of traffic flow.

Multimodal data fusion techniques aim to extract features from different data sources and effectively combine these features to enhance the model's predictive capability. Multi-view learning and deep fusion strategies are two commonly used methods^[95]. Multi-view learning captures the diversity of traffic flow by processing data from different views^[96], while deep fusion strategies use deep learning models, such as convolutional neural networks and graph neural networks, to deeply integrate multimodal data^[97], thereby making full use of the complementary advantages of different data sources and improving the accuracy and robustness of predictions.

Future research can further explore the application of multimodal data fusion in traffic flow prediction. This can improve model adaptability in diverse data environments and enhance the understanding of complex traffic scenarios by incorporating additional data sources. Meanwhile, as sensor technology and data collection methods continue to advance, multimodal data fusion is expected to be widely applied in real-world traffic scenarios, promoting the development of intelligent transportation systems and enabling more accurate and efficient traffic management.

Development of a lightweight and effective traffic flow prediction model
As the scale of the transportation network expands, a large amount of traffic flow data is being collected. These data provide a wealth of information for traffic flow prediction, but also impose higher demands on model computational capabilities^[98]. Currently, models require substantial computational resources to process large-scale road network data^[99], making the development of lightweight and effective models an important future research direction.

Future research can further explore the use of efficient graph neural network algorithms and parallel computing techniques in traffic flow prediction. Such approaches can enhance model processing capabilities for large-scale data while reducing computational costs through optimized resource utilization. With ongoing advancements in computing hardware and algorithms, lightweight and efficient models are expected to see broader application in real-world traffic scenarios, thereby supporting the development of intelligent transportation systems and enabling more accurate and efficient traffic management.

Dataset	Nodes	Edges	Time range	Sample rate	Zero value rate	Type
METR-LA	207	1,515	2012/03/01−2012/06/30	5 min	7.581%	Speed
PEMS-BAY	325	2,369	2017/01/01−2017/05/31	5 min	4.906%	Speed
PeMSD7(M)	228	1,664	2012/05/01−2012/06/30	5 min	0.10%	Speed
PeMSD7(L)	1,026	14,534	2012/05/01−2016/06/30	5 min	0.50%	Speed
PEMS03	358	546	2018/09/01−2018/11/30	5 min	5.102%	Flow
PEMS04	307	338	2018/01/01−2018/02/28	5 min	5.730%	Flow
PEMS07	883	865	2017/01/05−2017/08/06	5 min	5.028%	Flow
PEMS08	170	276	2016/07/01−2016/08/31	5 min	5.070%	Flow
CA	8,600	201,363	2017/01/01−2021/12/31	15 min	5.869%	Flow
GLA	3,834	98,703	2017/01/01−2021/12/31	15 min	5.712%	Flow
GBA	2,352	61,246	2017/01/01−2021/12/31	15 min	5.719%	Flow
SD	716	17,319	2017/01/01−2021/12/31	15 min	5.845%	Flow
NYC-taxi	16*12	−	2016/01/01−2016/02/29	30 min	48.63%	Inflow/outflow
NYC-bike	14*8	−	2016/08/01−2016/09/29	30 min	52.55%	Inflow/outflow
SZ-taxi	156	532	2015/01/01−2015/01/31	15 min	26.57%	Speed

Device	Info
CPU	Intel(R) Xeon(R) Gold 6133 CPU @ 2.50GHz
GPU	NVIDIA GeForce RTX 4090
RAM	32 GB
VRAM	24 GB
Python	3.10.16
CUDA	12.3
PyTorch	2.6.0

Model	Params	MAE	RMSE	MAPE	VRAM	RAM	Iter per second
HA	−	38.03	59.24	27.88	−	−	−
ARIMA	−	33.73	48.80	24.18	−	−	−
VAR	−	24.54	38.61	17.24	−	−	−
DCRNN	546,881	21.00	33.16	14.66	2,385.22	1,276.96	3.09
STGCN	297,228	21.03	33.38	15.07	1,886.88	1,177.90	14.30
GraphWaveNet	278,632	18.80	30.87	12.38	2,461.48	1,241.01	5.46
STSGCN	2,024,445	21.88	35.49	15.29	4,114.91	1,028.78	7.09
STAWnet	281,756	18.96	31.35	12.72	3,200.94	1,254.59	5.21
ST-SSL	288,275	16.89	26.98	10.48	3,208.00	1,566.00	16.19
MegaCRN	392,761	19.15	31.36	12.58	3,217.96	1,140.38	7.72
D2STGNN	398,788	18.38	30.10	12.06	10,310.0	5,067.00	6.34
DDGCRN	671,704	18.39	30.73	12.09	3,957.25	1,441.35	8.38
RGDAN	107,285	19.09	32.10	12.99	2,759.20	1,164.68	19.12
STWave	882,558	18.56	30.36	12.81	9,884.15	4,712.99	2.17
STAEFormer	1,354,932	20.76	32.38	20.06	3,200.36	1,217.37	19.88
PDFormer	531,165	18.46	29.97	12.67	4,591.52	5,351.85	1.64
STEP	25,469,726	18.31	29.90	12.49	7,091.23	1,936.99	2.02
STD-MAE	673,384	17.89	29.33	12.11	532.70	3,628.76	99.72
STG-NRDE	716,628	20.15	32.22	13.51	577.83	3,739.68	5.26
STG-NCDE	2,600,532	20.40	32.30	13.95	3,657.37	3,793.33	0.85

{{lists.name}}

A comprehensive review of traffic flow prediction: from traditional models to deep learning architectures