Rutting depth prediction model for asphalt pavements based on a dual branch spatiotemporal attention network

Jun Hao; Yuhan Weng; Le Li; Zhenzhen Xing; Lili Pei; Jun Hao; Yuhan Weng; Le Li; Zhenzhen Xing; Lili Pei

doi:10.48130/dts-0026-0003

Existing asphalt pavement rutting prediction models suffer from large long-term prediction errors due to their reliance on laboratory parameters and simplified assumptions. To address this issue, a dual-branch spatio-temporal attention network model (DSAN) is proposed. The model is constructed by fusing temporal convolutional networks (TCN), long short-term memory networks (LSTM), and multi-head attention mechanisms to form parallel feature extraction branches for hierarchical spatio-temporal modeling. Validation is conducted based on full-scale pavement loop test results of eight typical asphalt pavement structures (AC layer thickness: 12–52 cm) under 80 million equivalent standard axle loads (ESALs) accumulated during 2017–2023. Results show that the DSAN model significantly outperforms comparative models in prediction accuracy, breaking through the generalization bottleneck of traditional models for different layer thickness structures. This study provides an efficient solution for long-term performance prediction of asphalt pavements.

HTML

Introduction

Against the backdrop of the continuous expansion of global transportation networks and the surge in freight demand, asphalt pavement, as the primary structural form of high-grade highways, is facing increasingly severe challenges regarding durability. Among these challenges, rutting, one of the most typical early-stage distresses in asphalt pavement, exhibits significant characteristics of nonlinearity, accumulation, and spatiotemporal variability in its evolutionary process due to the coupled effects of long-term repeated wheel loading, cyclic alternations of environmental temperature and humidity, and the inherent deterioration of materials^[1−4]. The occurrence of rutting not only leads to a decline in pavement smoothness and driving comfort but also causes local stress concentration in the wheel path, exacerbates fatigue damage to the pavement structure, and even induces safety hazards such as vehicle skidding and prolonged braking distances^[5].

In recent years, academia and engineering circles have conducted extensive research on rutting prediction models, forming two core technical approaches: mechanical models and data-driven models. Mechanical models are based on material constitutive relationships and structural mechanical responses, describing the deformation behavior of asphalt mixtures under complex loads through the construction of a viscoelastic-plastic theoretical framework^[6,7]. For example, the Burgers model or the generalized Maxwell model is used to characterize the viscoelastic properties of asphalt mixtures, and the interlayer interaction is considered to establish the physical mechanism of rutting accumulation^[8]. However, such models are limited by the assumption of a homogeneous medium and the constraints of laboratory test parameters, making it difficult to accurately simulate the long-term mechanical responses of the multiphase heterogeneous system of actual pavements.

With the development of sensing technology and big data analysis, data-driven models, relying on their adaptive learning ability for complex nonlinear relationships, have gradually become a research hotspot in rutting prediction. Early traditional machine learning methods, such as random forests and support vector machines (SVM), have achieved certain results in rutting prediction for specific road sections by integrating traffic loads, environmental factors, and pavement parameters^[9−13]. For instance, a rutting model based on random forests can identify key factors affecting rutting (such as the number of axle load applications and the duration of high temperatures) through importance ranking, with prediction accuracy improved by 10%−15% compared to regression analysis^[14]. Nevertheless, these methods rely on manual feature engineering, making it difficult to capture the long-term temporal dependencies in rutting evolution (such as the lag effect of seasonal temperature fluctuations on deformation accumulation). Additionally, they are sensitive to changes in data distribution and have limited generalization capabilities^[15].

In recent years, the advent of deep learning technology has opened up a novel avenue for rutting prediction. Deep learning models (e.g., LSTM and Transformer) have substantially enhanced the modeling accuracy of complex time-series data by virtue of their end-to-end feature learning and nonlinear mapping capabilities^[16,17]. For instance, the LSTM-BPNN hybrid model integrates short-term fluctuations and long-term trends through an attention mechanism, achieving a 15% improvement in prediction accuracy relative to conventional models^[18]. Nevertheless, existing deep learning models still suffer from limitations such as inadequate multi-source data integration capacity, generalization performance that is contingent upon the coverage of training data, and a lack of in-depth integration with the physical mechanisms underlying rutting.

To address these issues, this study proposes a DSAN model, which aims to realize high-precision prediction and mechanistic interpretation of rutting depth via multi-scale feature fusion and focusing on key information. The model innovatively adopts a parallel dual-branch architecture: the TCN-LSTM branch leverages the local feature extraction capability of TCN and the long-term time-series modeling advantage of LSTM to capture both short-term fluctuations and long-term trends in rutting evolution; the LSTM-MHA branch incorporates a Multi-Head Attention (MHA) mechanism to weighted focus on critical time-series nodes (e.g., load peaks, abrupt temperature changes), thereby enhancing the model's sensitivity to influencing factors.

To validate the effectiveness and generalization ability of the proposed model, this research utilizes long-term observational data from the RIOHTrack full-scale circular test track facility of the Research Institute of Highway, Ministry of Transport. The dataset encompasses a cumulative 80 million ESALs and covers typical pavement structures with varying asphalt layer thicknesses (12–52 cm) involving four major base types (semi-rigid base, flexible base, composite base, etc.). By comparing the prediction performance of DSAN with that of mainstream deep learning models, including LSTM, TCN-LSTM, and LSTM-Attn, this study elucidates the mechanism by which variations in asphalt layer thickness affect rutting development rates, providing a theoretical foundation and technical support for rutting prevention and control in pavements of different structural types.

[1]	Wang Z, Guo N, Wang S, Xu Y. 2021. Prediction of highway asphalt pavement performance based on Markov chain and artificial neural network approach. The Journal of Supercomputing 77(2):1354−1376 doi: 10.1007/s11227-020-03329-4 CrossRef Google Scholar
[2]	Han Z, Sha A, Hu L, Jiang W. 2023. Calibration of inverted asphalt pavement rut prediction model, based on full-scale accelerated pavement testing. Materials 16(2):814 doi: 10.3390/ma16020814 CrossRef Google Scholar
[3]	Bao S, Han K, Zhang L, Luo X, Chen S. 2021. Pavement maintenance decision making based on optimization models. Applied Sciences 11(20):9706 doi: 10.3390/app11209706 CrossRef Google Scholar
[4]	Jourdain NOAS, Steinsland I, Birkhez-Shami M, Vedvik E, Olsen W, et al. 2024. A spatial-statistical model to analyse historical rutting data. International Journal of Pavement Engineering 25(1):2385013 doi: 10.1080/10298436.2024.2385013 CrossRef Google Scholar
[5]	Elhadidy AA, El-Badawy SM, Elbeltagi EE. 2021. A simplified pavement condition index regression model for pavement evaluation. International Journal of Pavement Engineering 22(5):643−652 doi: 10.1080/10298436.2019.1633579 CrossRef Google Scholar
[6]	Said SF, Hakim H. 2016. Asphalt concrete rutting predicted using the PEDRO model. International Journal of Pavement Engineering 17(3):245−252 doi: 10.1080/10298436.2014.993184 CrossRef Google Scholar
[7]	Perl M, Uzan J, Sides A. 1983. Visco-elasto-plastic constitutive law for a bituminous mixture under repeated loading. Transportation Research Record 1983(911):118−127 Google Scholar
[8]	Alae M, Zhao Y, Zarei S, Fu G, Cao D. 2020. Effects of layer interface conditions on top-down fatigue cracking of asphalt pavements. International Journal of Pavement Engineering 21(3):280−288 doi: 10.1080/10298436.2018.1461870 CrossRef Google Scholar
[9]	Shtayat A, Moridpour S, Best B, Abuhassan M. 2022. Using supervised machine learning algorithms in pavement degradation monitoring. International Journal of Transportation Science and Technology 12:628−639 doi: 10.1016/j.ijtst.2022.10.001 CrossRef Google Scholar
[10]	Sandamal K, Shashiprabha S, Muttil N, Rathnayake U. 2023. Pavement roughness prediction using explainable and supervised machine learning technique for long-term performance. Sustainability 15(12):9617 doi: 10.3390/su15129617 CrossRef Google Scholar
[11]	Marcelino P, de Lurdes Antunes M, Fortunato E, Gomes MC. 2021. Machine learning approach for pavement performance prediction. International Journal of Pavement Engineering 22(3):341−354 doi: 10.1080/10298436.2019.1609673 CrossRef Google Scholar
[12]	Sharma A, Sachdeva SN, Aggarwal P. 2023. Predicting IRI using machine learning techniques. International Journal of Pavement Research and Technology 16(1):128−137 doi: 10.1007/s42947-021-00119-w CrossRef Google Scholar
[13]	Gong H, Sun Y, Shu X, Huang B. 2018. Use of random forests regression for predicting IRI of asphalt pavements. Construction and Building Materials 189:890−897 doi: 10.1016/j.conbuildmat.2018.09.017 CrossRef Google Scholar
[14]	Li W, Ju H, Xiao L, Tighe S, Pei L. 2019. International roughness index prediction based on multigranularity fuzzy time series and particle swarm optimization. Expert Systems with Applications: X 2:100006 doi: 10.1016/j.eswax.2019.100006 CrossRef Google Scholar
[15]	Justo-Silva R, Ferreira A, Flintsch G. 2021. Review on machine learning techniques for developing pavement performance prediction models. Sustainability 13(9):5248 doi: 10.3390/su13095248 CrossRef Google Scholar
[16]	Zhou Q, Okte E, Al-Qadi IL. 2021. Predicting pavement roughness using deep learning algorithms. Transportation Research Record 2675(11):1062−1072 doi: 10.1177/03611981211023765 CrossRef Google Scholar
[17]	Ziari H, Sobhani J, Ayoubinejad J, Hartmann T. 2016. Prediction of IRI in short and long terms for flexible pavements: ANN and GMDH methods. International Journal of Pavement Engineering 17(9):776−788 doi: 10.1080/10298436.2015.1019498 CrossRef Google Scholar
[18]	Dong Y, Shao Y, Li X, Li S, Quan L, et al. 2019. Forecasting pavement performance with a feature fusion LSTM-BPNN model. Proceedings of the 28th ACM International Conference on Information and Knowledge Management. Beijing, China, 2019. New York, NY, USA: ACM. pp. 1953−1962 doi: 10.1145/3357384.3357867
[19]	Selsal Z, Karakas AS, Sayin B. 2022. Effect of pavement thickness on stress distribution in asphalt pavements under traffic loads. Case Studies in Construction Materials 16:e01107 doi: 10.1016/j.cscm.2022.e01107 CrossRef Google Scholar
[20]	Alavi MZ, Ahmadi A, Movahed FV. 2025. How aggregate gradation and layer thickness influence asphalt microsurfacing texture and skid resistance. Construction and Building Materials 481:141482 doi: 10.1016/j.conbuildmat.2025.141482 CrossRef Google Scholar
[21]	Yang R, Liu L, Sun L, Jin T, Cheng H, et al. 2025. Effective temperature model for rutting prediction considering temperature distribution inside the asphalt pavements. Road Materials and Pavement Design 26(12):3118−3137 doi: 10.1080/14680629.2025.2477312 CrossRef Google Scholar

Structure No.	AC layer thickness (cm)	Base course type
STR1	12	Thin asphalt semi-rigid structure
STR6	16	Ordinary semi-rigid structure
STR7	18	Ordinary semi-rigid structure
STR13	24	24~28 cm thick asphalt structure
STR11	28	24~28 cm thick asphalt structure
STR16	36	36 cm thick asphalt structure
STR19	48	Full-depth asphalt structure
STR18	52	Full-depth asphalt structure

Parameter name	Value
Look_back	12
Hidden_dim	128
Layers of TCN	2
Layers of LSTM	1
Heads of MSA	8
Learning_rate	0.001

Model	RMSE	MAE	R²
LSTM	4.581	2.696	0.93
TCN_LSTM	4.527	2.728	0.932
LSTM_Attn	4.562	2.665	0.931
DSAN	4.272	2.452	0.943

Thickness grouping	Model	Evaluation metric
Thickness grouping	Model	RMSE	MAE	R²
Thin layer structure group	DSAN	5.3	3.32	0.913
	TCN-only	5.721	3.632	0.898
	LSTM-only	5.573	3.541	0.903
	LSTM + MHA-only	5.424	3.44	0.908
	DualTCN-only	5.514	3.53	0.904
	DSAN (without TCN)	5.628	3.579	0.902
	DSAN (without MSA)	5.649	3.588	0.9
Middle structure group	DSAN	5.514	3.706	0.959
	TCN-only	6.626	4.612	0.942
	LSTM-only	5.991	4.17	0.951
	LSTM + MHA-only	5.629	3.917	0.956
	DualTCN-only	6.165	4.29	0.947
	DSAN (without TCN)	5.728	3.986	0.955
	DSAN (without MSA)	5.93	4.126	0.952
Thick layer structure group	DSAN	5.808	3.819	0.958
	TCN-only	6.482	4.131	0.947
	LSTM-only	6.29	4.01	0.951
	LSTM + MHA-only	5.905	3.763	0.956
	DualTCN-only	6.391	4.073	0.95
	DSAN (without TCN)	5.905	3.763	0.955
	DSAN (without MSA)	6.288	4.013	0.951

{{lists.name}}

Rutting depth prediction model for asphalt pavements based on a dual branch spatiotemporal attention network

Abstract

Rights and permissions

References

About this article

Cite this article

Article Metrics

Access History

Other Articles By Authors