Road traffic can be predicted by machine learning equally effectively as by complex microscopic model

Sroczyński, Andrzej; Czyżewski, Andrzej

doi:10.1038/s41598-023-41902-y

Download PDF

Article
Open access
Published: 04 September 2023

Road traffic can be predicted by machine learning equally effectively as by complex microscopic model

Scientific Reports volume 13, Article number: 14523 (2023) Cite this article

4246 Accesses
3 Citations
Metrics details

Subjects

Abstract

Since high-quality real data acquired from selected road sections are not always available, a traffic control solution can use data from software traffic simulators working offline. The results show that in contrast to microscopic traffic simulation, the algorithms employing neural networks can work in real-time, so they can be used, among others, to determine the speed displayed on variable message road signs. This paper describes an experiment to develop and test machine learning models, i.e., long short-term memory, gated recurrent unit recurrent networks, and stacked autoencoder networks. It compares their effectiveness with traffic prediction results generated using a widely recognized traffic simulator that analyzes traffic at the level of individual vehicles.

Development and evaluation of bidirectional LSTM freeway traffic forecasting models using simulation data

Article Open access 13 December 2021

Learning naturalistic driving environment with statistical realism

Article Open access 11 April 2023

Using machine learning methods to predict electric vehicles penetration in the automotive market

Article Open access 23 May 2023

Introduction

Over the past decade, intelligent transportation systems (ITS) have become an essential research object and continue developing rapidly. ITS combines information, communications, and transportation technology to optimize traffic management, improve road safety, increase the efficiency of transportation networks, and improve public transportation management. An important factor affecting the quality of ITS operations, especially in the context of traffic control, is traffic volume information, both long-term and short-term. Individual and business road users particularly desire the latter. Because of the stochastic and nonlinear nature of the traffic phenomenon, accurately predicting it in real time has been and remains a difficult task¹. The methods used so far mainly rely on linear and shallow machine learning models to predict traffic flow. These methods cannot correctly reflect the non-linearity and probabilistic characteristics of the phenomena, although they can respond to seasonal changes^2,3. In recent years, there have been successful alternative attempts at traffic prediction based on deep machine learning methods^1,4,5,6, and methods based on combining different models, such as ARIMA and LSTM⁷, Kalman filters, and ARIMA⁸. More complex strategies are also being used to improve prediction accuracy, such as the combined use of the light gradient boosting machine (LightGBM) and the gated recurrent unit (GRU) with the SARIMA-GRU prediction model that takes into account the periodicity of traffic phenomena⁹. Despite advances in AI and machine learning, the importance of traditional statistical methods should not be diminished³. Among those often found in publications are GRU time series prediction methods, LSTM NNs¹⁰, and stacked autoencoders (SAE)¹. In cases of large, complex motion models, the standard practice is to combine spline-recurrent neural networks with a motion graph^11,12. This method allows the influence of traffic on neighboring roads to be considered. It is particularly effective when modeling large, branching road networks, such as in large metropolitan areas^{11,13,14,15,16}. The surprisingly rapid development of machine learning methods has impressively increased the number of possible types and configurations of neural networks, but so far, this progress has not extended to the problem of optimization understood both in terms of choosing the most suitable model and its architecture (e.g., the number of hidden layers is still chosen arbitrarily) and the selection of its internal parameters¹⁰. This problem is general and does not apply only to traffic prediction. Due to the traffic phenomenon locally serial nature, the GRU, LSTM, and stacked autoencoders (SAE) neural network models mentioned above are often adopted for their analysis¹⁰. It goes without saying that to run any machine learning model, data collected in a real-world environment is necessary. An excellent and frequently used source of traffic data is the Caltrans Performance Measurement System (PeMS)¹⁷. Software traffic simulators can be an auxiliary data source when real data is unavailable or insufficient, such as not covering the required time range. In this article, we present the results of an experiment testing LSTMs, GRUs, and SEAs models trained on real data with data acquired using the Vissim¹⁸ simulator based on the traffic model ruling the movement of vehicles proposed by R. Wiedemann at Karlsruhe University. The definition of the problem to be solved in the presented research is as follows:

1.
A large real-world traffic data trains and validates the machine learning model,
2.
We would also like to assess prediction accuracy using data from other sources, such as computer simulations,
3.
Developed neural network models are applicable to determine the speed to be displayed on variable message road signs, for example, of the kind we have constructed previously¹⁹.

The remainder of this attribution is organized as follows: in “Related work” section contains a brief literature review, in “Traffic simulation methods” section describes popular traffic simulation methods, in “Dataset and tests” section describes the actual data used in the experiment and a description of the investigation, and in “Results” section presents the results obtained. Finally, the results and conclusion are discussed in “Discussion” and “Conclusions”, respectively.

Related work

The extensive literature on the application and use of various machine-learning techniques for traffic prediction continues to increase. However, given the number of available publications, this review can only provide a cursory glimpse into the current state of the vast literature on the subject.

Over the past decade, many different method proposals have been made for traffic prediction. The proposed methods were initially based on the autoregressive integrated moving average model (ARIMA). However, it turned out that the ARIMA model, which is based on linear relationships, cannot analyze the nonlinear and time-varying relationships that are an inherent feature of traffic⁵. Furthermore, the ongoing development of methods related to deep learning in recent years has led to the emergence of new proposals, in particular SAE, DNN, DBN, LSTM, or CNN-LSTM, as well as methods that are combinations of them.

One of the many examples of this approach is a publication⁵, whose authors propose a combination of CNN and LSTM (Conv-LSTM) methods for extracting spatial–temporal properties and a two-kernel LSTM (Bi-LSTM) model for extracting periodic traffic properties. According to the authors, the proposed method gives better results than the other methods mentioned in the comparison. The ARIMA and SAE methods performed particularly unfavorably. It is also worth noting that the quality of the prediction of the presented solution was greatly affected by the addition of the Bi-LSTM module.

A slightly different approach to the problem of spatial traffic analysis shows Li et al.²⁰. The authors suggest modeling traffic as a diffusion process on a directed graph. They propose the diffusion convolutional recurrent neural network (DCRNN) for traffic prediction. This framework represents a holistic approach and considers spatial and temporal traffic conditions. According to the authors, the experiments showed the clear superiority of DCRNN over other current methods.

The LSTM method already has an established positive reputation. Mou et al.²¹ present the results of a study on the influence of time variation on the results of LSTM prediction. The proposed improvement considers the mentioned variability and introduces the T-LSTM method. The authors also use this method to solve the problem of missing data. The results obtained employing experiments confirm the high efficiency of this method proposed in the literature.

A new paper, published in April 2023, proposes a hybrid STFSA-CNN-GRU model for short-term traffic speed prediction²². This model eliminates the limitations of single architecture and provides good accuracy and stability for multiple predictions.

The difficulties of traffic prediction were highlighted in a review article from June 2022, which is also relatively recent²³. The article authors cite the complexity of spatial–temporal relationships, the dynamics of temporal dependencies, and the influence of external factors such as weather conditions and traffic events as challenges. According to its authors, the publication is the first comprehensive survey of techniques currently used for traffic prediction based on deep machine learning. The review shows that in terms of spatial modeling, the dominant methods are CNNs and those based on Chebyshev polynomials. On the other hand, temporal parameter modeling most often uses RNNs, including variants of these networks, namely LSTM^24,25 or GRU. Such a structure is repeated in all areas of traffic prediction, i.e., flow prediction, vehicle speed, demand, travel time, and occupancy. This work gives a good indication of the complexity of the modeling techniques currently used to study traffic, a phenomenon, as it turns out, that is also very complex.

Ghosh et al.²⁶ propose a one-dimensional structural time series model (STM) used to develop an economical and computationally simple multidimensional time model algorithm. In the STM methodology, various components of a time series dataset, such as trend, seasonality, cyclical and calendar components, can be modeled separately. The results indicate that the proposed forecasting algorithm is a practical approach for real-time traffic flow prediction at multiple intersections in an urban transportation network.

Deep learning techniques are also used in solutions for tracing and reconstructing vehicle traffic trajectories. Fei et al.²⁷ use CNN techniques to analyze images from multiple cameras to identify and locate objects, including pedestrians.

A solution based on deep neural networks (DNNs) is discussed in a publication²⁸. The authors propose a six-layer DNN for long-term traffic prediction based on historical data and a set of coefficients that define the context of traffic conditions, such as day of the week, weather, and season. The primary assumption of the presented method is a strong correlation of vehicle stream flow with the starting and ending time points of a short observation period, along with contextual factors. Making such an assumption makes it possible to extract the relationship between traffic flow values in a given time interval from historical data and the combination of contextual factors.

An essential aspect of traffic analysis is the prediction of accident risk. Trirat et al.²⁹ propose multi-view graph convolutional networks for traffic accident risk prediction (MG-TAR). The presented approach attempts to solve two non-trivial problems: understanding and deriving dangerous driving statistics based on rule-based classification and modeling simultaneous static and dynamic graphs representing traffic conditions. An exciting application of deep convolutional neural networks (CNNs) is the method presented in the publication³⁰. Its authors propose a method for analyzing a large-scale transportation network based on generating, using CNNs, a graphical image containing traffic information in the analyzed transportation network.

An interesting approach to traffic prediction based on incomplete data is using neural networks with an architecture that uses graph Markov processes–graph Markov network (GMN)³¹. An extension of this method by adding spectral graph splicing operations is the second method proposed by the authors, called the 'spectral graph Markov network' (SGMN). According to the publication authors, both methods, GMN and SGMN, achieve good prediction results in accuracy and performance. Among the most recent publications, a very interesting one seems to be³², whose authors present a new approach to this problem, i.e., traffic prediction for incomplete data. Also worth mentioning is publication³³, which provides a comprehensive review and comparison of traffic prediction methods used. Very interesting results were obtained by Abduljabbar³⁴ using two-way LSTM (BiLSTM) models. The developed BiLSTM short-term traffic forecasting models were evaluated using data from a calibrated microsimulation model.

An interesting idea is the use of deep neural network methods for 'backward prediction', i.e., to reconstruct the conditions existing before the collision employing damage and deformation analysis of the vehicles involved in the crash. Promising research results were presented by Chen et al.³⁵. Experiments carried out using the three-neural network (3NN) algorithm on synthetic data confirm the considerable potential of the concept adopted.

A slightly different approach was proposed by Xie et al.³⁶ They used variational bayesian learning theory and a feed-forward neural network architecture for ‘inverse prediction’. As in³⁵, experiments were carried out based on synthetic data, but the results are promising. The development of methods and algorithms that allow the reconstruction of pre-collision conditions based on the results of a collision is significant in forensic accident analysis for vehicle designers and—most notably in the context of the main topic of this article—can assist ITS in the accident prevention process by anticipating events based on the pre-collision conditions memorized by AI algorithms.

Traffic simulation methods

Several traffic simulation methods differ in the level of detail, the complexity of modeling, range, and accuracy of the simulation. Among the most commonly used are:

Microscopic traffic simulation—this method allows simulating traffic at the level of individual vehicles, taking into account their movement, maneuvers, acceleration, braking, lane changes, etc. This type of simulation is applied to a limited area of the designed transportation network and is often used to assess the effects of changes in road infrastructure.
Macroscopic traffic simulation—this method is applied to large areas of the transportation network, allowing traffic to be simulated at the level of vehicle flow and congestion, considering only the overall traffic and capacity of the system. This type of simulation is often used to analyze the ability of transportation systems and assess the effects of changes in road geometry.
Mesoscopic traffic simulation—this method combines microscopic and macroscopic traffic simulation, allowing simultaneous consideration of the details of individual vehicle movement and the overall flow and congestion of the road system. In the mesoscopic simulation, models consider individual system elements and group elements. It means mesoscopic simulation allows behavior analysis at individual and collective levels but with less accuracy than in microscopic simulation.
The phenomena occurring in traffic and their probabilistic nature make it impossible to map them with sufficient accuracy using analytical methods. Hence, it is now standard practice to use computer simulations. VISSIM, AIMSUN, PARAMICS, TRANSIMS, and SUMO are among the most well-known and popular traffic simulation programs. The latest versions of some of these programs use machine learning and artificial intelligence techniques.

Dataset and tests

Two data sets were used for the experiments:

1.
Actual measurement data provided by the California Department of Transportation (CalTrans) on the Caltrans Performance Measurement System (PeMS) website¹⁷. The use of PeMS data requires some caution, as it is quite common for individual measurement stations or circuits to fail to provide reliable measurement data due to damage or shutdowns. The technical condition of the measuring equipment and the evaluation of the reliability of the data can be checked on the PeMS website.
2.
The synthesized data set was obtained using Vissim microscopic simulation software³⁷.

For the training and validation of the machine learning models, data of May 2022 was selected and divided into a learning set of 4377 records, containing traffic volume data from May 1st to 16th, and a validation set of 1095 records, containing data from May 16th to 19th. In both cases, data were aggregated at a 5-min interval and came from station 772903 (I210-E).

The test data was generated using the Vissim simulator, calibrated with PeMS data from May 1st to 7th.

Description of experiments

The methodology used in the first phase of experiments was initially inspired by publications by Lv et al.¹ and Fu et al.¹⁰. Tests were performed based on three machine learning models: LSTM, GRU, and SAE. A formal description of these algorithms will not be included here, as they are repeated many times in the cited literature and many tutorials available on the Internet.

The adopted LSTM model in a three-layer configuration is shown in Fig. 1.

Four models containing one to four layers of LSTM were made for further testing.

The adopted GRU model in a three-layer configuration is shown in Fig. 2.

As with the LSTM, four models containing one to four GRU layers were made.

The SAE model used is shown in Fig. 3.

The SAE model was tested only in the three-layer configuration shown above. The reason for this was that pilot tests showed that for stacked autoencoders the average percentage error between the predicted and actual value for the validation set tends to be higher than for GRU and LSTM models. For this reason, we paid more attention to testing models based on recurrent networks rather than autoencoders. Like its predecessors, it was tested using synthetic data derived from the Vissim simulator after being trained on real data. The following metrics were used to evaluate the models: explained_variance_score, mean absolute percent error (MAPE), mean absolute error (MAE), mean squared error (MSE), rooted mean squared error (RMSE), and determination coefficient R2.

These metrics can be expressed as follows:

$$Explained\, variance\left(y, {\hat{\text{y}}} \right)=1- \frac{Var(y- {\hat{\text{y}}} )}{Var(y)}$$

(1)

where y—actual values, $\hat{y}$—predicted values, Var(y − $\hat{y}$)—variance of prediction errors, Var(y)—actual variance values

$$MAPE\left(y, {\hat{\text{y}}} \right)=\frac{1}{n}\sum_{i=1}^{n}\left|\frac{{y}_{i}-{ {\hat{\text{y}}} }_{i}}{{y}_{i}}\right|$$

(2)

$$MAE\left(y, {\hat{\text{y}}} \right)=\frac{1}{n}\sum_{i=1}^{n}\left|{y}_{i}-{ {\hat{\text{y}}} }_{i}\right|$$

(3)

$$MSE\left(y, {\hat{\text{y}}} \right)=\frac{1}{n}\sum_{i=1}^{n}{\left({y}_{i}-{ {\hat{\text{y}}} }_{i}\right)}^{2}$$

(4)

$$RMSE\left(y, {\hat{\text{y}}} \right)=\sqrt{\frac{1}{n}\sum_{i=1}^{n}{({y}_{i}-{ {\hat{\text{y}}} }_{i})}^{2}}$$

(5)

$$R2=\frac{\sum_{i=1}^{n}{\left({ {\hat{\text{y}}} }_{i}-\overline{y }\right)}^{2}}{\sum_{i=1}^{n}{\left({y}_{i}-\overline{y }\right)}^{2}}$$

(7)

$$\overline{y }=\frac{1}{n}\sum_{i=1}^{n}{y}_{i}$$

(8)

All calculations were performed on a computer with a six-core Intel Core i7-6800k processor @ 3.40 GHz, 64 GB RAM, with an NVIDIA GeForGTX 1070 graphics card, running MS Windows 10 Pro.

Figure 4 shows the average percentage error (mean_absolute_percentage_error) as a function of the number of learning loops for the validation set and four LSTM models, differing in the number of inner layers.

It can be seen that learning loss is not a linear function of the number of layers and that it is the smallest in cases where the number is an even number.

Similarly, Fig. 5 shows the average percentage error (Mean_absolute_percentage_error) as a function of the number of learning loops for a validation set and four GRU models, differing in the number of inner layers.

As with LSTM, models with an even number of inner layers show the lowest learning losses. However, for all GRU models, the losses appear higher than for LSTM. A comparison of losses for the three models tested is shown in Fig. 6.

While the learning losses of the GRU and LSTM models are comparable, with the LSTM indicated as the slightly more accurate model, the learning losses of the SAE model are significantly higher.

Vissim program calibration

The Vissim software provides two psychophysical models to describe driver behavior: Wiedemann 74³⁸ and Wiedemann 99^39,40. The general structure of both models in terms of car-following mode is similar and shown in Fig. 7. The Wiedemann 99 model is more sophisticated and has many more parameters that need to be set (calibrated). This model is designed for traffic analysis at higher speeds (highways, expressways). Simulations were carried out based on the Wiedemann 99 model.

Following the recommendations presented in the literature⁴¹, the following values of calibration parameters were adopted (parameter names according to Vissim):

CC0 (standstill distance):	1.5 m
CC1 (headway time):	0.9 s
CC2 ('following' distance oscillation):	4 m
CC3 (the threshold for entering ‘following’):	− 8
CC4 (negative 'following' threshold):	− 0.35
CC5 (positive 'following' threshold):	0.35
CC6 (speed dependency of oscillation):	11.44
CC7 (oscillation acceleration):	0.25 m/s²
CC8 (standstill acceleration):	3.5 m/s²
CC9 (acceleration at 80 km/h):	1.5 m/s²

It was assumed that only passenger cars (90.1%) and trucks (9.9%) participate in the traffic stream and that the permitted speeds, according to US regulations, are 70 mph and 55 mph, or 112.6 km/h and 88.5 km/h, respectively. Traffic volume distribution was defined based on PeMS data from May 1st to May 7th.

Results

Figures 4, 5, 6, 7, 8, 9, 10 and 11 illustrate the results obtained from the experiments, i.e., the prediction results obtained with the models described above. All of the following traffic volume graphs have been normalized to facilitate the evaluation of the results.

Figure 8 shows the prediction results for the single-layer LSTM and GRU models and the SAE model. As mentioned, the SAE model was tested only in the three-layer configuration described above. Therefore, in the remainder of this article, only the name SAE will be used, without mentioning the details of its architecture.

The resulting graph does not reveal significant differences between the models, but it can be seen that the upper sections of the curves do not coincide with the line drawn by the actual data. The numerical data of the parameters included in Table 1 equally do not show significant differences, except for MAPE (mean absolute percentage error—loss function), which for the SAE model is clearly more significant than for the other models.

Table 1 Error metrics for the prediction shown in Fig. 8.

Full size table

Figure 9 shows the prediction results for the two-layer LSTM and GRU models and the SAE model. It can be seen that the graph corresponding to the GRU model is clearly below the LSTM and SAE lines. The MAPE numerical values shown in Table 2 confirm this observation: the error for GRU is more significant.

Table 2 Error metrics for the prediction shown in Fig. 9.

Full size table

Figure 10 shows the prediction results for SAE and the three-layer LSTM and GRU models. In this case, no significant differences can be seen between the predictions, and the numerical values shown in Table 3 are similar, although the MAPE value for GRU is the largest.

Table 3 Error metrics for the prediction shown in Fig. 10.

Full size table

Figure 11 shows the prediction results for the SAE model and the four-layer LSTM and GRU models. As in previous cases, only the MAPE shows more significant differences, while the other parameters are very similar. For GRU, the MAPE takes on a value similar to that for the two-layer model, clearly more significant than that shown for LSTM and SAE (Table 4).

Table 4 Error metrics for the prediction shown in Fig. 11.

Full size table

Figures 12 and 13 show the prediction results of the three-layer models for shorter periods, 24 h and 8 h, respectively. It can be seen from the graphs that the predictions do not keep up with the faster changes in the actual data. It is particularly clear in the upper, approximately flat part of the graph shown in Fig. 8 and the previous ones, i.e., Figs. 4, 5, 6 and 7.

The prediction results of the SAE and four-layer LSTM I GRU models are shown in Figs. 14 and 15. As with the three-layer models, the predictions do not keep up with the rapid changes in real data.

Discussion

Preparing models, especially LSTM, proved to be a relatively time-consuming task. Reduction of calculation time proved to be feasible when the graphics card computational resources were included in learning the models, which reduced the required time by about 400 times. This problem could become significant if the trained models were implemented in devices with relatively small computational resources, e.q. in intelligent traffic signs¹⁹. Furthermore, the software must be updated due to changing traffic conditions. From this point of view, the SAE model proved to be the best, i.e., the least energy-intensive model, followed by GRU in second place and LSTM in third position. The conclusions of publication⁴² regarding the significantly lower computational cost of the GRU model compared to LSTM were confirmed in this way.

While computational needs may be necessary for specific applications, prediction accuracy is undoubtedly a more important criterion for evaluating model quality. Two observations should be mentioned at this point:

1.
At the learning stage, the dependence of losses on the type of internal architecture of the models, that is, on the number of inner layers, is clear. It can be observed that with an increase in the number of layers, the accuracy of models (GRU and LSTM) increases, although this is not a linear relationship. As mentioned above, the best learning results were achieved by models containing an even number of layers, i.e., two or four. Among them, four-layer models showed the lowest learning errors. This effect can be observed in Figs. 4 and 5.
2.
As seen in Fig. 6, the SAE model proved to be the least accurate of the tested models. The LSTM model, on the other hand, appears slightly more accurate than the GRU model with a similar architecture.

The resulting prediction accuracy is expressed by the metrics shown in Tables 1, 2, 3 and 4 as with the learning process, which depends on the architecture of the models. Single-layer LSTM and GRU models gave the best results (Table 1), which seems to contradict the above remarks on the learning process. On the other hand, LSTM I GRU models with an even number of inner layers gave significantly worse results than training with real data. Overall, the three-layer models proved to be the most accurate. The metrics shown in Table 3 are fairly even, but it can be seen that the GRU model performs least favorably in this comparison.

The trained models obtained in the experiments described above were made using the Keras library, allowing them to be saved in HDF5 binary format. They can be implemented in edge devices such as smart traffic signs. Suppose the target device has sufficient resources and environment (Linux, Tensorflow). In that case, copying the binary model into the new environment and loading it into memory using the appropriate function may be sufficient. For a Keras model stored as HDF5, the ‘keras.models.load_model (‘my_model.h5’)’ command can be used. In the case of limited resources or a different hardware platform, it will be necessary to convert the binary model to a format supported by the target device. This can be done using the TensorFlow Lite Converter tool, which can convert the TensorFlow and Keras models to the TensorFlow Lite (.tflite) format, designed to run on devices with limited computing resources and supported by different programming languages. An alternative solution is to use ONNX (Open Neural Network Exchange), an open standard for machine learning models supported by many libraries and platforms. The Keras2onnx library allows Keras models to be converted into ONNX format, which can then be loaded and run with the appropriate ONNX tool in various programming languages.

The example process of model implementation:

1.
Model conversion to TensorFlow Lite The first step is to convert the HDF5 model to TensorFlow Lite format (.tflite) using the TensorFlow Lite Converter tool. This can be done in Python using the appropriate TensorFlow functions.
1. 2.
  Implementation of TensorFlow Lite Micro TensorFlow provides a version of TensorFlow Lite Micro, which is specifically designed to run on microcontrollers. One needs to compile the library for your specific hardware platform to use TensorFlow Lite Micro on a microcontroller.
2. 3.
  Loading the model onto the microcontroller After compiling the TensorFlow Lite Micro library, one needs to configure the microcontroller to load the .tflite model. It may require converting the model into a byte array and saving it in a C++ header file, which is then included in the project.
3. 4.
  Invoking inference After loading the model, one needs to write code that feeds input data to the model and performs inference. For this, you use functions provided by the TensorFlow Lite Micro library.
4. 5.
  Updates and maintenance Once the model is implemented, regular updates and maintenance will be required. This may include re-training the model on new data, updating the intelligent road sign software, and monitoring model performance in real-time.

Extensive information on implementing trained models in embedded devices can be found at ^43,44,45.

One should remember the computational needs of implemented models. As mentioned above, LSTM models are characterized by the greatest needs in this regard, hence its implementation in edge devices can be problematic. In such a case, the more suitable choice is undoubtedly the GRU model. An interesting variant could also be the minimal gated unit (MGU) models, which were not covered in our research.

Conclusions

This paper compared traffic prediction results obtained using three machine learning models tested with two types of data: the first was real data provided by PeMS¹⁷, and the second came from traffic simulations conducted with the software simulator Vissim. The comparison was made by using the simulation results as a test set and re-generating predictions using the prepared machine learning models. The results obtained using data from the Vissim simulator do not differ (or differ slightly) from those obtained using real data. It leads to confirmation of the high reliability of the Vissim simulator. Still, on the other hand, it confirms the validity of the assumption that such a method, applicable in off-line mode, can be useful rather in situations where the acquisition of real data is difficult for various reasons. Meanwhile, machine learning models can operate in real-time mode. Furthermore, machine learning and simulations performed with specialized software are applicable for autonomous adaptive traffic control systems^19,46,47.

Data availability

All data generated or analyzed during this study are included in this published article (and its Supplementary Information files).

References

Yisheng, L. et al. Traffic flow prediction with big data: A deep learning approach. IEEE Trans. Intell. Transp. Syst. 16(2), 865–873. https://doi.org/10.1109/TITS.2014.2345663 (2015).
Article Google Scholar
Williams, B. L. & Hoel, A. Modeling and forecasting vehicular traffic flow as a seasonal ARIMA process: Theoretical basis and empirical results. J. Transp. Eng. 129(6), 664–672. https://doi.org/10.1061/(ASCE)0733-947X(2003)129:6(664) (2003).
Article Google Scholar
Karlaftis, M. G. & Vlahogianni, E. I. Statistical methods versus neural networks in transportation research: Differences, similarities, and some insights. Transp. Res. Part C Emerg. Technol. 19(3), 387–399. https://doi.org/10.1016/j.trc.2010.10.004 (2011).
Article Google Scholar
Yao, H., Tang, X., Wei, H., Zheng, G. & Li, Z. Revisiting spatial-temporal similarity: A deep learning framework for traffic prediction. Proc. AAAI Conf. Artif. Intell. 33(01), 5668–5675. https://doi.org/10.1609/aaai.v33i01.33015668 (2019).
Article Google Scholar
Liu, Y., Zheng, H., Feng, X. & Chen, Z. Short-term traffic flow prediction with Conv-LSTM. In 2017 9th International Conference on Wireless Communications and Signal Processing (WCSP), Nanjing, China, 1–6. https://doi.org/10.1109/WCSP.2017.8171119 (2017).
Chan, K. Y., Dillon, T. S., Singh, J. & Chang, E. Neural-network-based models for short-term traffic flow forecasting using a hybrid exponential smoothing and Levenberg–Marquardt algorithm. IEEE Trans. Intell. Transp. Syst. 13(2), 644–654. https://doi.org/10.1109/TITS.2011.2174051 (2012).
Article Google Scholar
Lu, S., Zhang, Q. & Chen, G. A combined method for short-term traffic flow prediction based on recurrent neural network. Dewen Seng Alex. Eng. J. 60(1), 87–94. https://doi.org/10.1016/j.aej.2020.06.008 (2021).
Article Google Scholar
Abadi, A., Rajabioun, T. & Ioannou, P. A. Traffic flow prediction for road transportation networks with limited traffic data. IEEE Trans. Intell. Transp. Syst. 16(2), 653–662. https://doi.org/10.1109/TITS.2014.2337238 (2015).
Article Google Scholar
Wei, C. et al. Combination predicting model of traffic congestion index in weekdays based on LightGBM-GRU. Sci. Rep. 12(1), 2912. https://doi.org/10.1038/s41598-022-06975-1 (2022).
Article ADS CAS Google Scholar
Fu, R., Zhang, Z. & Li, L. Using LSTM and GRU neural network methods for traffic flow prediction. In Chinese Association of Automation, 324–328. https://doi.org/10.1109/YAC.2016.7804912 (2017).
Cui, Z., Henrickson, K., Ke, R. & Wang, Y. Traffic graph convolutional recurrent neural network: A deep learning framework for network-scale traffic learning and forecasting. IEEE Trans. Intell. Transp. Syst. 21(11), 4883–4894. https://doi.org/10.1109/TITS.2019.2950416 (2020).
Article Google Scholar
Xu, X., Zhang, T., Xu, C., Cui, Z. & Yang, J. Spatial-temporal tensor graph convolutional network for traffic speed prediction. IEEE Trans. Intell. Transp. Syst. 24(1), 92–103. https://doi.org/10.1109/TITS.2022.3215613 (2023).
Article Google Scholar
Zhao, L. et al. T-GCN: A temporal graph convolutional network for traffic prediction. IEEE Trans. Intell. Transp. Syst. 21(9), 3848–3858. https://doi.org/10.1109/TITS.2019.2935152 (2020).
Article Google Scholar
Guo, S., Lin, Y., Feng, N., Song, C. & Wan, H. Attention-based spatial-temporal graph convolutional networks for traffic flow forecasting. Proc. AAAI Conf. Artif. Intell. 33(01), 922–929. https://doi.org/10.1609/aaai.v33i01.3301922 (2019).
Article CAS Google Scholar
Zhang, J., Zheng, Y., Sun, J. & Qi, D. Flow prediction in spatio-temporal networks based on multitask deep learning. IEEE Trans. Knowl. Data Eng. 32(3), 468–478. https://doi.org/10.1109/TKDE.2019.2891537 (2020).
Article Google Scholar
Luo, X., Li, D., Yang, Y. & Zhang, S. Spatiotemporal traffic flow prediction with KNN and LSTM. Hindawi J. Adv. Transp. 2019, 10. https://doi.org/10.1155/2019/4145353 (2019).
Article Google Scholar
Caltrans, Performance Measurement System (PeMS). http://pems.dot.ca.gov (2023).
PTV Planung Transport Verkehr GmbH. Manual PTV Vissim (2022).
Czyzewski, A., Sroczynski, A., Śmiałkowski, T. & Hoffmann, P. Development of intelligent road signs with V2X interface for adaptive traffic controlling. In Proceedings of the 6th International Conference on Models and Technologies for Intelligent Transportation Systems (MT-ITS), 1–7. https://doi.org/10.1109/MTITS.2019.8883369 (2019).
Li, Y., Yu, R., Shahabi, C. & Liu, Y. Diffusion convolutional recurrent neural network: Data-driven traffic forecasting. https://doi.org/10.48550/arXiv.1707.01926 (2018)
Mou, L., Zhao, P., Xie, H. & Chen, Y. T-LSTM: A long short-term memory neural network enhanced by temporal information for traffic flow prediction. IEEE Access 7, 98053–98060. https://doi.org/10.1109/ACCESS.2019.2929692 (2019).
Article Google Scholar
Ma, C., Zhao, Y., Dai, G., Xu, X. & Wong, S.-C. A novel STFSA-CNN-GRU hybrid model for short-term traffic speed prediction. IEEE Trans. Intell. Transp. Syst. 24(4), 3728–3737. https://doi.org/10.1109/TITS.2021.3117835 (2023).
Article Google Scholar
Yin, X. et al. Deep learning on traffic prediction: Methods, analysis, and future directions. IEEE Trans. Intell. Transp. Syst. 23(6), 4927–4943. https://doi.org/10.1109/TITS.2021.3054840 (2022).
Article Google Scholar
Ma, C., Dai, G. & Zhou, J. Short-term traffic flow prediction for urban road sections based on time series analysis and LSTM_BILSTM method. IEEE Trans. Intell. Transp. Syst. 23(6), 5615–5624. https://doi.org/10.1109/TITS.2021.3055258 (2022).
Article Google Scholar
Wang, Z., Su, X. & Ding, Z. Long-term traffic prediction based on LSTM encoder–decoder architecture. IEEE Trans. Intell. Transp. Syst. 22(10), 6561–6571. https://doi.org/10.1109/TITS.2020.2995546 (2021).
Article Google Scholar
Ghosh, B., Basu, B. & O’Mahony, M. Multivariate short-term traffic flow forecasting using time-series analysis. IEEE Trans. Intell. Transp. Syst. 10(2), 246–254. https://doi.org/10.1109/TITS.2009.2021448 (2009).
Article Google Scholar
Fei, L. & Han, B. Multi-object multi-camera tracking based on deep learning for intelligent transportation: A review. Sensors 23, 3852. https://doi.org/10.3390/s23083852(2023) (2023).
Article ADS PubMed PubMed Central Google Scholar
Qu, L., Li, W., Li, W., Ma, D. & Wang, Y. Daily long-term traffic flow forecasting based on a deep neural network. Expert Syst. Appl. 121, 304–312. https://doi.org/10.1016/j.eswa.2018.12.031 (2019).
Article Google Scholar
Trirat, P., Yoon, S. & Lee, J.-G. MG-TAR: multi-view graph convolutional networks for traffic accident risk prediction. IEEE Trans. Intell. Transp. Syst. 24(4), 3779–3794. https://doi.org/10.1109/TITS.2023.3237072 (2023).
Article Google Scholar
Ma, X. et al. Learning traffic as images: A deep convolutional neural network for large-scale transportation network speed prediction. Sensors 17(4), 818. https://doi.org/10.3390/s17040818 (2017).
Article ADS PubMed PubMed Central Google Scholar
Cui, Z., Lin, L., Pu, Z. & Wang, Y. Graph Markov network for traffic forecasting with missing data. Transp. Res. Part C Emerg. Technol. 117, 102671. https://doi.org/10.1016/j.trc.2020.102671 (2020).
Article Google Scholar
Wang, A., Ye, Y., Song, X., Zhang, S. & Yu, J. J. Q. Traffic prediction with missing data: A multi-task learning approach. IEEE Trans. Intell. Transp. Syst. 24(4), 4189–4202. https://doi.org/10.1109/TITS.2022.3233890 (2023).
Article Google Scholar
Lana, I., Del Ser, J., Velez, M. & Vlahogianni, E. I. Road traffic forecasting: recent advances and new challenges. IEEE Intell. Transp. Syst. Mag. 10(2), 93–109. https://doi.org/10.1109/MITS.2018.2806634 (2018).
Article Google Scholar
Abduljabbar, R. L., Dia, H. & Tsai, P.-W. Development and evaluation of bidirectional LSTM freeway traffic forecasting models using simulation data. Sci. Rep. 11(1), 1–16. https://doi.org/10.1038/s41598-021-03282-z (2021).
Article CAS Google Scholar
Chen, Q. et al. A deep neural network inverse solution to recover pre-crash impact data of car collisions. Transp. Res. Part C Emerg. Technol. 126, 103009. https://doi.org/10.1016/j.trc.2021.103009 (2021).
Article Google Scholar
Xie, Y., Wu, C. T., Li, B., Hu, X. & Li, S. A feed-forwarded neural network-based variational Bayesian learning approach for forensic analysis of traffic accident. Comput. Methods Appl. Mech. Eng. 397, 115148. https://doi.org/10.1016/j.cma.2022.115148 (2022).
Article ADS MathSciNet MATH Google Scholar
PTV Vissim-Multimodal Traffic Simulation Software. https://www.myptv.com/en/mobility-software/ptv-vissim (2023).
Wiedemann, R. Simulation des Strassenverkehrsflusses (University Karlsruhe, 1974) (in German).
Olstam, J. J. & Tapani, A. Comparison of car-following models. Swedish National Road and Transport Research Institute. ISSN 0347–6049. Available at https://vti.diva-portal.org/smash/get/diva2:673977/FULLTEXT01.pdf (2VTI meddelande 960A, 2004)
Wiedemann, R. & Reiter, U. Microscopic Traffic Simulation: The Simulation System MISSION, Background and Actual State. Project ICARUS (V1052) Final Report. Brussels, CEC. 2: Appendix A (1992)
Florida Department of Transportation. Traffic Analysis Handbook. Available at https://fdotwww.blob.core.windows.net/sitefinity/docs/default-source/planning/systems/systems-management/sm-old-files/traffic-analysis/traffic-analysis-handbook_march-2014.pdf?sfvrsn=51c88e22_0 (2014).
Jeong, M. H., Lee, T.-Y., Jeon, S.-B. & Youm, M. Highway speed prediction using gated recurrent unit neural networks. Appl. Sci. 11(7), 3059. https://doi.org/10.3390/app11073059 (2021).
Article CAS Google Scholar
Warden, P. & Situnayake, D. Tinyml: Machine Learning with Tensorflow Lite on Arduino and Ultra-low-power Microcontrollers (O’Reilly Media, 2019).
Google Scholar
Deploy Machine Learning Models on Mobile and Edge Devices. https://www.tensorflow.org/lite (2023).
Open Neural Network Exchange, The Open Standard for Machine Learning Interoperability. https://onnx.ai (2023).
Sroczyński, A. & Czyżewski, A. Examining the impact of distance between VSL road signs on vehicle speed variance. IEEE Access 11, 7521–7529. https://doi.org/10.1109/ACCESS.2023.3238578 (2023).
Article Google Scholar
Sroczynski, A., Kurowski, A., Zaporowski, S. & Czyżewski, A. Examining impact of speed recommendation algorithm operating in autonomous road signs on minimum distance between vehicles. Remote Sens. 14(12), 2803. https://doi.org/10.3390/rs14122803 (2022).
Article ADS Google Scholar

Download references

Acknowledgements

The Polish National Centre funded the research and development project (NCBR) from the European Regional Development Fund No. POIR.04.01.04-00-0089/16 entitled: “INZNAK: Intelligent Road Signs with V2X Interface for Adaptive Traffic Controlling”.

Author information

Authors and Affiliations

Multimedia Systems Department, Faculty of Electronics, Telecommunication and Informatics, Gdansk University of Technology, 80-233, Gdańsk, Poland
Andrzej Sroczyński & Andrzej Czyżewski

Authors

Andrzej Sroczyński
View author publications
You can also search for this author in PubMed Google Scholar
Andrzej Czyżewski
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.S.: planning, data collection, methodology, algorithm implementation, testing, data analysis, drafting of paper content. A.C.: the study concept, supervision, mentoring, reviewing, and editing.

Corresponding author

Correspondence to Andrzej Sroczyński.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Sroczyński, A., Czyżewski, A. Road traffic can be predicted by machine learning equally effectively as by complex microscopic model. Sci Rep 13, 14523 (2023). https://doi.org/10.1038/s41598-023-41902-y

Download citation

Received: 07 June 2023
Accepted: 01 September 2023
Published: 04 September 2023
DOI: https://doi.org/10.1038/s41598-023-41902-y

This article is cited by

Advanced transport systems: the future is sustainable and technology-enabled
- Yue Cao
- Sybil Derrible
- Haiping Du
Scientific Reports (2024)
A guided genetic algorithm-based ensemble voting of polynomial regression and LSTM (GGA-PolReg-LSTM) for congestion prediction using IoT and air quality data in sustainable cities
- Boutheina Jlifi
- Mahdi Medini
- Claude Duvallet
The Journal of Supercomputing (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.