Decomposition for Multivariant Traffic Time Series: Comparison
Please note this is a comparison between Version 2 by Alfred Zheng and Version 1 by Lei Zhang.

Data-driven modeling methods have been widely used in many applications or studies of traffic systems with complexity and chaos. The empirical mode decomposition (EMD) family provides a lightweight analytical method for non-stationary and non-linear data.  A large amount of traffic data in practice are usually multidimensional, so the EMD family cannot be used directly for those data.

  • multivariate traffic
  • empirical mode decomposition

1. Introduction

Because of the complexity and chaos of traffic systems, it is hard to investigate the system state estimates or mode identification via the method of directly modeling. Thus, data-driven modeling methods have been widely used in many applications or studies concerning intelligent traffic, and the research regarding digital traffic has become a branch of intelligent traffic, which includes traffic feature analysis, traffic flow estimate, bus arrival estimate, traffic jam identification, traffic events and accident analysis. On the other hand, the development of artificial intelligence empowers intelligent traffic and brings huge benefits. As one of the three key elements of artificial intelligence, data play an important role in applications and research.
A large number of studies of traffic modeling via time series have taken place. Chin and Quddus [1] use the random-effect negative binomial (RENB) model to investigate the elements appropriate for maintaining safety in road intersections. Brijs, Karlis and Wets [2] and Quddus [3] used integer-valued autoregressive Poisson models to model data of accidents and investigate the relation of accidents and some specific factors. Commandeur, Bijleveld, Bergel-Hayat et al. [4] and Saar [5] used auto-regression and moving average model to investigate the correlation of traffic accidents and some other factors. For panel data analysis, F. Chen, Ma and S. Chen [6,7][6][7] introduced random-effect tobit models to investigate the relationship between traffic crashes and several factors such as traffic states, weather and surface conditions. Later, methods such as full Bayesian hierarchical approach and multivariate Poisson lognormal models were used to investigate traffic crash modeling and factors regarding traffic accidents [8,9,10][8][9][10]. In recent years, while regression models have still been widely used as the foundation of time series data analysis, some machine learning methods have been introduced to improve the performance. Tuli, Mitra and Crews [11] employed a random-effect negative binomial (RENB) model to investigate the demand for shared bicycles. Barroso, Albuquerque-Oliveira and Oliveira-Neto [12] introduced clustering methods to define traffic profiles and the daily traffic periods in trip analyses based on OD data. Chang, Huang, Chan et al. [13] introduced long-memory properties to investigate road fatality factors.
Traffic time series data contain multiple mode characteristics, so mode decomposition of such data is essential for better analysis and modeling. Commonly used modal decomposition methods for time-series data include discrete wavelet transform (DWT), empirical mode decomposition (EMD) and variational mode decomposition (VMD). EMD, in particular, allows for adaptive decomposition of data and efficient handling of nonlinear and non-smooth data without significant computational burden. EMD has been utilized in many traffic and transportation applications such as traffic data denoising [14[14][15],15], traffic infrastructure healthy monitoring [16], traffic flow evolving dynamic evaluation [17] and time variant detection [18], as well as prediction of section traffic flow [19], traffic speed [20] and metro passenger flow [21]. However, current empirical mode decomposition methods such as EMD, EEMD, CEEMD and CEEMDAN are not equipped to handle multivariate data directly. Since the traffic system generates many multivariate time series data, such as trajectory data, there is a pressing need to extend classical empirical mode decomposition methods to deal with multivariate time series.

2. Decomposition for Multivariant Traffic Time Series 

Original empirical mode decomposition [22] provides a method to extract intrinsic mode functions from non-stationary time series signals. The conditions which the IMF satisfies and the procedures to extract IMFs are closely related to the extremum and envelope function. However, for multi-dimension signals, common extremum and envelope functions do not exist. So, the original EMD cannot be applied directory to multi-dimension signals. Several researchers have proposed decomposition methods for complex-valued data. Tanka and Mandic [23] decomposed complex-valued data into positive frequency components and negative frequency components. A band-pass filter was used so that both positive and negative frequency components are analytic signals, which means the real part of those components contains complete information of the original signal. Then the classical EMD was used to extract IMFs. This method made clever use of the band-pass filter and the characteristic of analytic signal, but the two sets of IMFs from positive and negative components cannot be linked intuitively to the original signal. Another idea is to extend the definition of the envelope function or extremum point. Bin Altaf, Gautama, Tanaka et al. [24] proposed a new definition of extremum of complex-valued data series in which the extremum points were found according to whether the first derivative changes its sign. Then the complex-valued envelope functions and the average can be computed. This method decomposes the complex-valued signal directory without separating the signal into two parts so that the results are more intuitive. Rilling, Flandrin, Goncalves et al. [25] extend the “oscillation” in two dimensions to the “rotation” in three dimensions so the task is to decompose the rotation modes, such as “rapid” and “slower” rotation, of complex-valued signals (data series). The extremum points were defined as the tangent points to the top, bottom, left and right. Those points were linked by a cubic spline to be the envelope functions. This method, from the perspective of subsequent studies, uses a fixed projection to calculate the extrema points and the envelope which may miss the combined effect of multiple variables. Rehman and Mandic [26] proposed an extension of EMD for trivariate signals in which projection directions were introduced to find the extrema points and calculate envelope curves. To choose those directions, a sphere in signal space was built and multiple longitudinal lines were uniformly chosen on that sphere. Then a series of equidistant points on each longitudinal line was taken as the projection directions. Along those projection directions, maximum points of the input signal were found and the envelope curves was obtained by interpolating those maximum points (along each direction). After that, the two authors proposed an advanced method for n-variate signals in which the projection directions were chosen more uniformly [27]. For n-variate input, the low-discrepancy pointsets were used to generate uniform points on the n-1 sphere as projection directions. This method was widely used in many subsequent studies. However, the computational effort of this method is very high. Inspired by some studies of non-temporal multidimensional empirical mode decomposition. Thirumalaisamy and Ansell [28] proposed a fast and adaptive multivariate EMD method in which order statistics filter was used to take the place of classical spline interpretation so that the computational cost could be reduced and Delaunay triangulation and sparable filters were used to reduce the computational cost of projection calculation. However, even though the method proposed was an improvement, the algorithm of the Multivariate EMD was still a bit complicated. Fleureau, Kachenoura, Albera et al. [29,30][29][30] proposed a method to obtain a signal’s mean trend by interpolating barycenter which was computed from identified elementary oscillations. A D+1 dimension tangent vector of a D dimension signal was defined. The oscillation extremum defined in this method was the point where the norm of the “tangent” reaches the local minimum. Then, rather than calculating envelope curves to obtain the mean curve, the concept oscillation barycenter was introduced to calculate the mean curve directly. An oscillation barycenter was defined to be a point between two oscillation extremum points. The time coordination of one barycenter point was set to the intermediate moment of its adjacent two extreme points. The variate values of the barycenter were defined to be the average of the signal variate integrals between the two extreme points. The authors improved this method later by changing the calculation of the mean curve [31]. The envelope curves were reintroduced to calculate the mean curve. The even oscillation extremum and odd oscillation extremum were interpolated separately to obtain two envelope curves. This method extended the original EMD to a multidimensional signal. However, the extremum identification algorithm may obtain false extremum points from discrete time series, for the differences of the signals are not continuous. For example, a one-dimension signal [0.1, 0.5, 0.7, 0.9, 0.7, 0.1] has minimum norm of “tangent” at the third and fourth points but the fourth point is the extremum point.

References

  1. Chin, H.C.; Quddus, M.A. Applying the random effect negative binomial model to examine traffic accident occurrence at signalized intersections. Accid. Anal. Prev. 2003, 35, 253–259.
  2. Brijs, T.; Karlis, D.; Wets, G. Studying the effect of weather conditions on daily crash counts using a discrete time-series model. Accid. Anal. Prev. 2008, 40, 1180–1190.
  3. Quddus, M.A. Time series count data models: An empirical application to traffic accidents. Accid. Anal. Prev. 2008, 40, 1732–1741.
  4. Commandeur, J.J.F.; Bijleveld, F.D.; Bergel-Hayat, R.; Antoniou, C.; Yannis, G.; Papadimitriou, E. On statistical inference in time series analysis of the evolution of road safety. Accid. Anal. Prev. 2013, 60, 424–434.
  5. Saar, I. Do Alcohol Excise Taxes Affect Traffic Accidents? Evidence From Estonia. Traffic Inj. Prev. 2015, 16, 213–218.
  6. Chen, F.; Ma, X.; Chen, S. Refined-scale panel data crash rate analysis using random-effects tobit model. Accid. Anal. Prev. 2014, 73, 323–332.
  7. Ma, X.; Chen, F.; Chen, S. Modeling Crash Rates for a Mountainous Highway by Using Refined-Scale Panel Data. Transp. Res. Rec. J. Transp. Res. Board 2015, 2515, 10–16.
  8. Ma, X.X.; Chen, S.R.; Chen, F. Correlated Random-Effects Bivariate Poisson Lognormal Model to Study Single-Vehicle and Multivehicle Crashes. J. Transp. Eng. 2016, 142, 04016049.
  9. Gill, G.S.; Cheng, W.; Xie, M.; Vo, T.; Jia, X.; Zhou, J. Evaluating Influence of Neighboring Structures on Spatial Crash Frequency Modeling and Site-Ranking Performance. Transp. Res. Rec. J. Transp. Res. Board 2017, 2659, 117–126.
  10. Cheng, W.; Gill, G.S.; Sakrani, T.; Dasu, M.; Zhou, J. Predicting motorcycle crash injury severity using weather data and alternative Bayesian multivariate crash frequency models. Accid. Anal. Prev. 2017, 108, 172–180.
  11. Tuli, F.M.; Mitra, S.; Crews, M.B. Factors influencing the usage of shared E-scooters in Chicago. Transp. Res. Part Policy Pract. 2021, 154, 164–185.
  12. Barroso, J.M.F.; Albuquerque-Oliveira, J.L.; Oliveira-Neto, F.M. Correlation analysis of day-to-day origin-destination flows and traffic volumes in urban networks. J. Transp. Geogr. 2020, 89, 102899.
  13. Chang, F.; Huang, H.; Chan, A.H.S.; Shing Man, S.; Gong, Y.; Zhou, H. Capturing long-memory properties in road fatality rate series by an autoregressive fractionally integrated moving average model with generalized autoregressive conditional heteroscedasticity: A case study of Florida, the United States, 1975–2018. J. Saf. Res. 2022, 81, 216–224.
  14. Chen, X.; Chen, H.; Yang, Y.; Wu, H.; Zhang, W.; Zhao, J.; Xiong, Y. Traffic flow prediction by an ensemble framework with data denoising and deep learning model. Phys. A-Stat. Mech. Appl. 2021, 565.
  15. Xu, L.; Wei, C.; Chingyao, C.; Bin, L.; Xianghui, S. Multi-sensor fusion methodology for enhanced land vehicle positioning. Inf. Fusion 2019, 46, 51–62.
  16. Aied, H.; Gonzalez, A.; Cantero, D. Identification of sudden stiffness changes in the acceleration response of a bridge to moving loads using ensemble empirical mode decomposition. Mech. Syst. Signal Process. 2016, 66-67, 314–338.
  17. Xiong, H.; Shang, P.; Bian, S. Detecting intrinsic dynamics of traffic flow with recurrence analysis and empirical mode decomposition. Phys. A-Stat. Mech. Appl. 2017, 474, 70–84.
  18. Chen, M.C.; Wei, Y. Exploring time variants for short-term passenger flow. J. Transp. Geogr. 2011, 19, 488–498.
  19. Pholsena, K.; Pan, L.; Zheng, Z. Mode decomposition based deep learning model for multi-section traffic prediction. World Wide-Web-Internet Web Inf. Syst. 2020, 23, 2513–2527.
  20. Wang, H.; Liu, L.; Dong, S.; Qian, Z.; Wei, H. A novel work zone short-term vehicle-type specific traffic speed prediction model through the hybrid EMD-ARIMA framework. Transp.-Transp. Dyn. 2016, 4, 159–186.
  21. Wei, Y.; Chen, M.C. Forecasting the short-term metro passenger flow with empirical mode decomposition and neural networks. Transp. Res. Part -Emerg. Technol. 2012, 21, 148–162.
  22. Huang, N.E.; Shen, Z.; Long, S.R.; Wu, M.C.; Shih, H.H.; Zheng, Q.; Yen, N.C.; Tung, C.C.; Liu, H.H. The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis. Proc. R. Soc. Math. Phys. Eng. Sci. 1998, 454, 903–995.
  23. Tanaka, T.; Mandic, D.P. Complex Empirical Mode Decomposition. IEEE Signal Process. Lett. 2007, 14, 101–104.
  24. Bin Altaf, M.U.; Gautama, T.; Tanaka, T.; Mandic, D.P. Rotation Invariant Complex Empirical Mode Decomposition. In Proceedings of the 2007 IEEE International Conference on Acoustics, Speech and Signal Processing—ICASSP ’07, Honolulu, HI, USA, 15–20 April 2007; Volume 3, pp. III-1009–III-1012.
  25. Rilling, G.; Flandrin, P.; Goncalves, P.; Lilly, J.M. Bivariate empirical mode decomposition. IEEE Signal Process. Lett. 2007, 14, 936–939.
  26. Rehman, N.; Mandic, D.P. Empirical Mode Decomposition for Trivariate Signals. IEEE Trans. Signal Process. 2010, 58, 1059–1068.
  27. Rehman, N.; Mandic, D.P. Multivariate empirical mode decomposition. Proc. R. Soc.-Math. Phys. Eng. Sci. 2010, 466, 1291–1302.
  28. Thirumalaisamy, M.R.; Ansell, P.J. Fast and Adaptive Empirical Mode Decomposition for Multidimensional, Multivariate Signals. IEEE Signal Process. Lett. 2018, 25, 1550–1554.
  29. Fleureau, J.; Nunes, J.C.; Kachenoura, A.; Albera, L.; Senhadji, L. Turning Tangent Empirical Mode Decomposition: A Framework for Mono- and Multivariate Signals. IEEE Trans. Signal Process. 2011, 59, 1309–1316.
  30. Fleureau, J.; Kachenoura, A.; Nunes, J.C.; Albera, L.; Senhadji, L. 3A-EMD: A generalized approach for monovariate and multivariate EMD. In Proceedings of the 10th International Conference on Information Science, Signal Processing and their Applications (ISSPA 2010), Kuala Lumpur, Malaysia, 10–13 May 2010; pp. 300–303.
  31. Fleureau, J.; Kachenoura, A.; Albera, L.; Nunes, J.C.; Senhadji, L. Multivariate empirical mode decomposition and application to multichannel filtering. Signal Process. 2011, 91, 2783–2792.
More
ScholarVision Creations